[CRITICAL] Fix clipboard text operations and UTF-8 cursor handling #841

PavelSharp · 2025-10-18T00:15:19Z

Edit: Improve structure
This pull request effectively repairs the entire clipboard handling system in Nuklear. To achieve this, it addresses three separate bugs:

Note: see #840 for more details and a minimal reproducible example.

PavelSharp · 2025-10-19T18:13:36Z

Hi, @sleeptightAnsiC!
Thank you for your comment - I really appreciate the feedback. I carefully reviewed your notes and started digging into the issue. While doing so, I actually discovered two fantastic bugs in Nuklear that (hopefully!) explain all the clipboard and UTF-8 handling problems. Let me show you what I found:

1 Cursor position issue
The problem is caused by an incorrect cursor offset in line 27194 (nk_textedit_paste).
Obviously, we want to move the cursor by the number of codepoints, not bytes.
So instead of:

state->cursor += len;

it should be:

state->cursor += glyphs;

2 Clipboard copy bug - this one is mind-blowing
Nuklear calls clip->copy only once (line 28172 in nk_do_edit), like this:

edit->clip.copy(edit->clip.userdata, text, end - begin);

The backend must provide its own implementation of this function.
But guess what? It expects the length in bytes, not in codepoints (I double-checked nk_glfw3_clipboard_copy and nk_sdl_clipboard_copy). So that’s the root cause! Luckily, it’s easy to fix.
Here’s the corrected version:

text = nk_str_at_const(&edit->string, begin, &unicode, &glyph_len);
text2 = nk_str_at_const(&edit->string, end, &unicode, &glyph_len);

if (edit->clip.copy)
    edit->clip.copy(edit->clip.userdata, text, text2 - text);

After these changes, everything starts working properly, though this is still just a draft fix.
Would you like to give it a try? 😇
As an optional step towards full unicode support, it will be useful to take another look at #829.

sleeptightAnsiC · 2025-10-19T19:48:20Z

The issues that I mentioned in #841 (comment) does not occur after 3342ffc

But a possibility that you used LLM to fix those worries me a lot...

src/nuklear_string.c

sleeptightAnsiC · 2025-11-07T09:53:18Z

Something is really odd here (and I'm not talking about LLM concerns anymore, although I think my concerns are valid)

Per your comment under #852 (comment), if utf8/clipboard handling is broken everywhere (in every demo), this really feels like we have a regression here. What we should do first, is not trying to fix it, but instead we should find out why the regression has happened, what caused it and so on... Without this we may break something else, cause another regression, or run into risk that this thing will break again.

~~@PavelSharp did you try to git-bisect this issue before submitting PR/Issue ?~~

EDIT: Okey, fine... I think that the next Pavel's comment explains everything. Although, I still think this whole situation is a bit strange.

PavelSharp · 2025-11-07T14:14:07Z

@sleeptightAnsiC
So, let’s talk specifically about the clipboard copy bug. The root cause lies in these lines:

Nuklear/src/nuklear_edit.c

Lines 284 to 291 in fcd64f8

    
           int b = edit->select_start; 
        
           int e = edit->select_end; 
        
           int begin = NK_MIN(b, e); 
        
           int end = NK_MAX(b, e); 
        
           text = nk_str_at_const(&edit->string, begin, &unicode, &glyph_len); 
        
           if (edit->clip.copy) 
        
               edit->clip.copy(edit->clip.userdata, text, end - begin);

Here we clearly see that end - begin is passed as the len argument. Since b and e represent text selection positions in the edit buffer, end - begin gives us the number of Unicode codepoints selected.

However, as I said earlier, all backends treat len as the number of bytes, not the number of codepoints. And that mismatch is the core of this bug.

As for how old this issue is. It’s actually nearly 10 years old. I traced it back to the commit where version 1.0 was released. Before that point, Nuklear was still named zahnrad.

PavelSharp · 2025-11-14T16:30:44Z

@RobLoach I think this PR is finally ready to be merged!
All the requested changes have been addressed, and all merge conflicts have been fully resolved.
Also, as mentioned in our Reviewers Guide (“Have at least one other person review the changes before merging”), an additional review and testing has already been completed by @sleeptightAnsiC in his comment. Thanks.
With this, the clipboard behavior is now properly fixed across all existing backends

RobLoach · 2025-11-14T17:20:24Z

What in the ChatGPT 💀 ... I'll be happy to merge once we get the 👍

Any nit-picks can be fixed as follow ups. Thanks again, all, for all the hard work here.

sleeptightAnsiC · 2025-11-14T17:28:45Z

Given that this PR basically fixes something that was broken since forever, and I'm afraid a lot of people had to make some workarounds related to the bug, it might be safe to mark the fix as a "breaking change" and/or increase the major version (?)

I'm definitely NOT someone who should be reviewing this. A long-time contributor should take an action here and decide.

@rswinkle Would you mind taking a look? or maybe tag someone who you think is appropriate?

PavelSharp · 2025-11-14T17:33:25Z

@sleeptightAnsiC

I didn't test the latest changes yet. The old comment doesn't really apply here, it's outdated.

I'm sorry, but the only real code change since that comment is just this small tweak:
return nk_str_insert_at_rune(str, pos, text, len); to return nk_str_insert_at_rune(str, pos, text, len) ? len : 0;

Everything else was just rebasing and updating the branch.
Honestly, it felt like a tiny detail 🙂

sleeptightAnsiC · 2025-11-16T16:14:05Z

I'll try to test this soon.

the only real code change since that #841 (comment) is just this small tweak

Retested and everything seems fine.

Any nit-picks can be fixed as follow ups.

My only nitpick here is that it might be nice to have this change as one single commit (?) Those two commits don't really work correctly without each other, and the third commit is rebuilding the amalgamation, so it may confuse people. This way, if said change ever causes an issue for someone, it will be easier to find it with git-log or git-bisect. But this is just a minor inconvenience, and @RobLoach can easily solve it with "squash+merge", so no worries.

And my previous comment still applies #841 (comment)

PavelSharp · 2025-11-16T17:32:58Z

Just to note, there have been a couple of previous attempts to address this:

nk_do_edit cut and copy handler passes len as grapheme cluster count to nk_plugin_copy not as count bytes in const char* #764, as @sleeptightAnsiC pointed in his comment
Fix copied string corrupt when contain CJK glyphs #543, where the author describes the same problem but only fixes it for the GLFW backend.

This PR resolves the root cause. Personally, I believe it's obviously that the copy callback should take the number of bytes instead of glyphs.

Changes: (1) length as byte count in copy callback (instead of glyph count) (2) fix nk_str_insert_text_char and nk_str_insert_str_char (3) correct UTF-8 cursor handling Fixes: Immediate-Mode-UI#840 Fixes: Immediate-Mode-UI#764 Obsoletes: Immediate-Mode-UI#543 BREAKING CHANGE: If your custom backed has a workaround for this issue, it will break.

This comment was marked as outdated.

Sign in to view

This comment was marked as off-topic.

Sign in to view

RobLoach reviewed Oct 19, 2025

View reviewed changes

src/nuklear_string.c Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

This was referenced Nov 1, 2025

Add SDL3 #779

Closed

Add sdl3_renderer demo V4 #852

Merged

PavelSharp force-pushed the fix-nk_str_insert_text_char branch 3 times, most recently from ae86fca to 27ae085 Compare November 14, 2025 15:56

This comment was marked as outdated.

Sign in to view

sleeptightAnsiC mentioned this pull request Nov 16, 2025

nk_do_edit cut and copy handler passes len as grapheme cluster count to nk_plugin_copy not as count bytes in const char* #764

Open

PavelSharp force-pushed the fix-nk_str_insert_text_char branch 2 times, most recently from 5f6aab0 to 61a0ae1 Compare November 16, 2025 18:20

PavelSharp changed the title ~~Fix memory corruption in nk_str_insert_text_char when pasting UTF-8 text~~ [CRITICAL] Fix clipboard text operations and UTF-8 cursor handling Nov 16, 2025

PavelSharp added 2 commits November 17, 2025 01:41

Update CHANGELOG and Rebuild nuklear.h

5ad5e59

PavelSharp force-pushed the fix-nk_str_insert_text_char branch from 61a0ae1 to 5ad5e59 Compare November 16, 2025 22:45

[CRITICAL] Fix clipboard text operations and UTF-8 cursor handling #841

Are you sure you want to change the base?

[CRITICAL] Fix clipboard text operations and UTF-8 cursor handling #841

Uh oh!

Conversation

PavelSharp commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

PavelSharp commented Oct 19, 2025

Uh oh!

This comment was marked as off-topic.

sleeptightAnsiC commented Oct 19, 2025

Uh oh!

This comment was marked as off-topic.

This comment was marked as off-topic.

Uh oh!

This comment was marked as outdated.

sleeptightAnsiC commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PavelSharp commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PavelSharp commented Nov 14, 2025

Uh oh!

This comment was marked as outdated.

RobLoach commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sleeptightAnsiC commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PavelSharp commented Nov 14, 2025

Uh oh!

sleeptightAnsiC commented Nov 16, 2025

Uh oh!

PavelSharp commented Nov 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PavelSharp commented Oct 18, 2025 •

edited

Loading

sleeptightAnsiC commented Nov 7, 2025 •

edited

Loading

PavelSharp commented Nov 7, 2025 •

edited

Loading

RobLoach commented Nov 14, 2025 •

edited

Loading

sleeptightAnsiC commented Nov 14, 2025 •

edited

Loading