-
Notifications
You must be signed in to change notification settings - Fork 11
Open
Description
The BagIt specification lets you specify that UTF-8 encoding be used in tag manifests. But it doesn't appear to assume a particular normalization form.
I have a problem where files are bagged and transferred from an OS X filesystem (which uses NFD) and are copied to Linux (which uses NFC). During validation the NFC normalized form from the filesystem is compared against the NFD normalized form from the manifest and validation fails.
Should a particular normalization form (NFC?) be assumed for unicode encodings?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels