-
Notifications
You must be signed in to change notification settings - Fork 32
Open
Description
Unicode defines normalized forms for characters and character classes.
It might work to normalize strings to NFKD and remove any characters of class Mn (Nonspacing_Mark) (see table 12)
It might be necessary to specially handle conversions like ß to ss
See also python stack overflow answer
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels