DocsToTxt is a lightweight Python script that converts .docx
files into plain .txt
format locally, without requiring an internet connection. This ensures privacy, speed, and reliability when processing documents.
✅ Convert .docx
to .txt
without losing content
✅ Works offline – no internet required
✅ Simple and lightweight Python script
✅ Batch processing (convert multiple files in a directory)
✅ Cross-platform support: Windows, macOS, Linux
Make sure you have Python 3.x installed on your system.
git clone https://github.com/eltonbaidoo/DocsToTxt.git
cd DocsToTxt
This project requires the python-docx
library for reading .docx
files. Install it using:
pip install python-docx
Run the script with the file path:
python docx_to_txt.py input.docx
This will generate input.txt
in the same directory.
Run:
python docx_to_txt.py /path/to/folder/
All .docx
files in the folder will be converted to .txt
.
- Uses
python-docx
to read.docx
content - Extracts text while preserving formatting
- Saves it as a
.txt
file in the same location
📄 Sample.docx
Title: Meeting Notes
Date: 2025-02-13
- Discussed project deadlines
- Assigned tasks to team members
- Next meeting: Monday
📜 Sample.txt
Meeting Notes
Date: 2025-02-13
- Discussed project deadlines
- Assigned tasks to team members
- Next meeting: Monday
Q: Does this support .doc
files?
A: No, Not yet (Working on it) this only works with .docx
files. You can convert .doc
to .docx
using Microsoft Word or Google Docs.
Q: Will formatting be preserved?
A: Only text content is extracted. Formatting (bold, italic) is not retained in .txt
files.
Q: Can I use this on Mac/Linux?
A: Yes! As long as Python is installed, it works on Windows, macOS, and Linux.
Want to improve DocsToTxt? Contributions are welcome!
- Fork the repository
- Create a new branch (
feature-name
) - Commit and push changes
- Submit a pull request
This project is licensed under the MIT License. See the LICENSE file for details.
Developed by Elton Baidoo
📇 Contact: LinkedIn