Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for PDF files #2

Closed
tempelmann opened this issue Jan 20, 2024 · 1 comment
Closed

Add support for PDF files #2

tempelmann opened this issue Jan 20, 2024 · 1 comment

Comments

@tempelmann
Copy link

PDFs that contain scanned image with text in them can't be parsed with this tool right now because it assumes the files are images.

I wonder if this can be made to work with PDFs as well. I suspect this would require to open the PDF and then render each page into an NSImage, and then feed each image into the text recognizer, and then merge the page results into one.

@xulihang
Copy link
Owner

This project is for integrating macOS's OCR ability to other full-fledge OCR tools like ImageTrans, so PDF support will not be considered. You can use ImageTrans for PDF OCR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants