Add support for PDF files #2

tempelmann · 2024-01-20T15:11:18Z

PDFs that contain scanned image with text in them can't be parsed with this tool right now because it assumes the files are images.

I wonder if this can be made to work with PDFs as well. I suspect this would require to open the PDF and then render each page into an NSImage, and then feed each image into the text recognizer, and then merge the page results into one.

xulihang · 2024-01-21T03:03:13Z

This project is for integrating macOS's OCR ability to other full-fledge OCR tools like ImageTrans, so PDF support will not be considered. You can use ImageTrans for PDF OCR.

xulihang closed this as completed Jan 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for PDF files #2

Add support for PDF files #2

tempelmann commented Jan 20, 2024

xulihang commented Jan 21, 2024

Add support for PDF files #2

Add support for PDF files #2

Comments

tempelmann commented Jan 20, 2024

xulihang commented Jan 21, 2024