PDF to text using EasyOCR by Jonathan Germain with thanks to EasyOCR (https://github.com/JaidedAI/EasyOCR), pdf2jpg (https://github.com/pankajr141/pdf2jpg), and https://github.com/hansalemaos/PDFImage2TXT
This is a simple Python script that uses EasyOCR to extract text from a PDF file.
pip install easyocr
Place pdf2txt.py wherever you want it
python pdf2txt.py This software works offline, however it will download any required EasyOCR models that are required on first use.
The output can be piped to other software like any other text output in Linux.
Need an AI-focused business solution? Contact the author via the Compilebunny github page