DataHawk is a OSINT tool for scraping websites to identify sensitive information, designed for researchers and security professionals.
- Supports searching for emails, usernames, phone numbers, URLs, IP addresses, or custom regex patterns.
- Multithreading for faster crawling.
- Output results in TXT, CSV, or JSON formats.
- Proxy support for anonymous scraping.
- Dynamic user-agent rotation.
- Clone the repository:
git clone https://github.com/ScribeAegis/DataHawk.git cd DataHawk
pip install -r requirements.txt
python DataHawk.py
python DataHawk.py
python DataHawk.py -q username
python DataHawk.py --proxy http://proxyserver:port
python DataHawk.py --output csv
DataHawk is for educational and research purposes only. Ensure compliance with legal and ethical guidelines when using this tool.
Licensed under the MIT License.