Skip to content

firaslamouchi21/Job-Scraper02

Repository files navigation

Job Scraper Tool by Firas Lamouchi

CI License Python Docker

A local-first job scraping stack with Docker Compose by Firas Lamouchi.

Services

Quick StartScreenshot 2026-02-03 193941

  1. Configure

Edit files in ./config:

  • sites.txt: one site per line
  • keywords.txt: one keyword or job title per line
  • cv.txt: your CV text
  • .env.example: copy to .env and set GROQ_API_KEY (optional)
  1. Run
docker compose up --build
  1. Use

Developer CLI

Run without the UI:

python cli.py run --lite
python cli.py run --api-key YOUR_GROQ_KEY

Export saved results from the local SQLite database:

python cli.py export --format json --limit 200
python cli.py export --format csv --out jobs.csv

You can override paths:

python cli.py --config ./config --data ./data run --lite

Makefile

make build
make up
make down
make logs
make ps

BYOK and Lite Mode

  • Provide a Groq API key in the UI to enable AI scoring.
  • Toggle Lite Mode to use keyword-only scoring without an API key.
  • Scraper can also be triggered from n8n via POST http://scraper:8000/run

Persistence

All state is stored in ./data:

  • SQLite database and logs
  • n8n workflows and database

Running docker compose down will not delete ./data.

Scraper API

Trigger a run:

Read results:

Export:

Rate limiting and retries

The scraper supports basic delay and retry tuning via environment variables:

  • REQUEST_DELAY_SECONDS (default 0.6)
  • RETRY_MAX_ATTEMPTS (default 4)
  • RETRY_BASE_SECONDS (default 0.6)
  • RETRY_MAX_SECONDS (default 8)

n8n Trigger

Send a POST to http://scraper:8000/run with JSON:

{"api_key":"your_key","lite_mode":false}

About

is a local-first, Dockerized engine that automates job hunting. It scrapes DuckDuckGo using Playwright, ranks listings via Groq AI, and manages everything through a FastAPI/Streamlit stack. Integrated with n8n for scheduling, it turns the job search into a fully automated pipeline

Topics

Resources

License

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors