Skip to content

A local translation of a Docs file to Raw Txt format

Notifications You must be signed in to change notification settings

eltonbaidoo/DocsToTxt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

📜 DocsToTxt - Convert DOCX to TXT Locally

Python
License
Repo Stars

🔹 About

DocsToTxt is a lightweight Python script that converts .docx files into plain .txt format locally, without requiring an internet connection. This ensures privacy, speed, and reliability when processing documents.


🚀 Features

✅ Convert .docx to .txt without losing content
✅ Works offline – no internet required
✅ Simple and lightweight Python script
Batch processing (convert multiple files in a directory)
✅ Cross-platform support: Windows, macOS, Linux


📌 Installation

Make sure you have Python 3.x installed on your system.

1️⃣ Clone the Repository

git clone https://github.com/eltonbaidoo/DocsToTxt.git  
cd DocsToTxt  

2️⃣ Install Dependencies

This project requires the python-docx library for reading .docx files. Install it using:

pip install python-docx  

🛠 Usage

Convert a Single File

Run the script with the file path:

python docx_to_txt.py input.docx  

This will generate input.txt in the same directory.

Convert All .docx Files in a Folder

Run:

python docx_to_txt.py /path/to/folder/  

All .docx files in the folder will be converted to .txt.


🔧 How It Works

  • Uses python-docx to read .docx content
  • Extracts text while preserving formatting
  • Saves it as a .txt file in the same location

Example Output

Original DOCX File

📄 Sample.docx

Title: Meeting Notes  
Date: 2025-02-13  

- Discussed project deadlines  
- Assigned tasks to team members  
- Next meeting: Monday  

Converted TXT File

📜 Sample.txt

Meeting Notes  
Date: 2025-02-13  

- Discussed project deadlines  
- Assigned tasks to team members  
- Next meeting: Monday  

FAQ

Q: Does this support .doc files?
A: No, Not yet (Working on it) this only works with .docx files. You can convert .doc to .docx using Microsoft Word or Google Docs.

Q: Will formatting be preserved?
A: Only text content is extracted. Formatting (bold, italic) is not retained in .txt files.

Q: Can I use this on Mac/Linux?
A: Yes! As long as Python is installed, it works on Windows, macOS, and Linux.


🌟 Contributing

Want to improve DocsToTxt? Contributions are welcome!

  1. Fork the repository
  2. Create a new branch (feature-name)
  3. Commit and push changes
  4. Submit a pull request

📜 License

This project is licensed under the MIT License. See the LICENSE file for details.


👤 Author

Developed by Elton Baidoo
📇 Contact: LinkedIn


About

A local translation of a Docs file to Raw Txt format

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages