ThriftAssist - OCR Phrase Detection

A powerful OCR tool for detecting and annotating phrases in images using Google Cloud Vision API with fuzzy matching support.

Features

🔍 Multi-orientation text detection - Handles horizontal, vertical, upside-down, and diagonal text
🎯 Fuzzy phrase matching - Finds phrases even with OCR errors or variations
📦 Spanning detection - Matches phrases that span multiple lines
🎨 Visual annotation - Draws color-coded bounding boxes with smart label placement
⚡ Configurable - Easy configuration for thresholds, angles, and text filtering

Project Structure

.
├── README.md            # This file
├── requirements.txt     # Python package dependencies
├── thrift_assist/       # Source code for ThriftAssist
│   ├── __init__.py
│   ├── cli.py           # Command-line interface
│   ├── config.py        # Configuration handling
│   ├── detector.py      # Core detection logic
│   ├── drawer.py        # Visual annotation logic
│   └── ocr.py           # OCR processing logic
└── tests/               # Unit tests for ThriftAssist
    ├── __init__.py
    ├── test_detector.py
    ├── test_drawer.py
    └── test_ocr.py

Installation

Clone the repository:

git clone https://github.com/yourusername/thrift_assist.git
cd thrift_assist

Install the required Python packages:
```
pip install -r requirements.txt
```
Set up your Google Cloud Vision API credentials:
- Follow the Google Cloud Vision API Quickstart to create a project and obtain credentials.
- Set the GOOGLE_APPLICATION_CREDENTIALS environment variable to the path of your service account key file:
```
export GOOGLE_APPLICATION_CREDENTIALS="/path/to/your/service-account-file.json"
```

Usage

Run the command-line interface to start detecting phrases in images:

python -m thrift_assist.cli --help

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository.
Create a new branch for your feature or bugfix.
Make your changes and commit them.
Push your branch and create a pull request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
backend		backend
config		config
image		image
public		public
tests		tests
utils		utils
vision		vision
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
Makefile		Makefile
README.md		README.md
conftest.py		conftest.py
debug_angles.py		debug_angles.py
main.py		main.py
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
run_api.py		run_api.py
thriftassist_colors.png		thriftassist_colors.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ThriftAssist - OCR Phrase Detection

Features

Project Structure

Installation

Usage

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ThriftAssist - OCR Phrase Detection

Features

Project Structure

Installation

Usage

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages