Skip to content

This little script renames your PDFs based in its contents, it has the ability to extract text from images so don't worry if your PDFs are just prints of something.

Notifications You must be signed in to change notification settings

puzzledbytheweb/pdf-renominator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF Renominator

This little script renames your PDFs based in its contents, it has the ability to extract text from images so don't worry if your PDFs are just prints of something.

Prerequisites

It uses OpenAI API so you'll need to create an account and pay them 5$ (they stopped their free tier 😔).

You'll also need to have Poppler and Tesseract installed globally:

Windows

MacOS

brew install poppler
brew install tesseract

Ubuntu

sudo apt update
sudo apt install poppler-utils tesseract-ocr

Setup

  1. Set up a virtual environment to encapsulate your Python packages, otherwise running the next command installs the dependecies globally:

    python -m venv .venv
  2. Install the required dependencies:

    pip install -r requirements.txt
  3. Set your OpenAI API key:

    export OPENAI_API_KEY=YOUR_API_KEY
  4. Run the main script:

    python main.py

About

This little script renames your PDFs based in its contents, it has the ability to extract text from images so don't worry if your PDFs are just prints of something.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages