-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
In some protocols, following ligatures a single whitespace is added, disrupting the respective word. The ligatures are correctly translated into single chars by pdftohtml however.
Examples: "défi nition", "diffi culté", "spécifi que"
This problem already occurs in the xml which is generated by pdftohtml:
pdftohtml -xml -hidden -f 3 -l 39 -q -stdout -i 20080095.pdf
There, line 73 reads:
défi nition condamnés à de courtes peines ou placés en
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels