Important
LexiClean v2 is currently being released. While it is currently testable, an updated video walk through is still pending and will be re-released ASAP. In the meantime, please consult the original systems demonstration or documentation for more information.
LexiClean is a rapid annotation tool for acquiring parallel corpora for lexical normalisation built with MongoDB, React and FastAPI. <<<<<<< HEAD
📌 Quick Links:
- Live Demo
- Demo Video
39dc557 (chore: update README for clarity and formatting improvements)
To run LexiClean using Docker, you'll need:
- Docker Engine (see: https://docs.docker.com/engine/install/)
- Docker Compose (see: https://docs.docker.com/compose/install/)
- Clone the repository:
git clone https://github.com/nlp-tlp/lexiclean.git
cd lexiclean
- Start the application:
docker compose up --build
Service | URL | Description |
---|---|---|
Frontend | http://localhost:3000 | User interface |
Backend API | http://localhost:8000 | API server |
Documentation | http://localhost:4000 | User documentation |
MongoDB | localhost:27018 | Database |
The application consists of four main services:
- Frontend (React): User interface running on port 3000
- Backend (FastAPI): API server running on port 8000
- MongoDB: Database running on port 27018
- Documentation (React, Docasaurus): Service running on port 4000
MONGODB__URI=mongodb://root:example@mongodb:27017/lexiclean?authSource=admin
MONGODB__DB_NAME=lexiclean
AUTH__SECRET_KEY=secret
AUTH__ALGORITHM=HS256
AUTH__ACCESS_TOKEN_EXPIRE_MINUTES=360
API__PREFIX=/api
VITE_API_URL=http://localhost:8000/api
VITE_DOCS_URL=http://localhost:4000
VITE_GITHUB_URL=https://github.com/nlp-tlp/lexiclean
NODE_ENV=development
MONGO_INITDB_ROOT_USERNAME=root
MONGO_INITDB_ROOT_PASSWORD=example
If you prefer to run the application without Docker, follow these steps:
- Install MongoDB (v4.4.6 or later):
- Follow the official MongoDB installation guide
- Verify MongoDB is running:
service mongod status
- Create and activate a Python virtual environment:
# Create a virtual environment in the server directory
cd server
python -m venv venv
# Activate the virtual environment
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate
- Install dependencies:
# Install backend dependencies
cd server
pip install -r requirements.txt
# Install frontend dependencies
cd ../client
npm install
# Optional: Install documentation dependencies
cd ../docs
npm install
-
Set up environment variables using the
.env.example
files as examples -
Start the services manually:
# Start backend
cd server
uvicorn main:app --reload
# Start frontend (in a new terminal)
cd client
npm run dev
# Optiona: Start the documentation server (in a new terminal)
cd docs
npm run start
To run the application in development mode with hot-reloading enabled:
# Start all services in development mode
docker compose -f docker-compose.dev.yml up
# To run in detached mode (background)
docker compose -f docker-compose.dev.yml up -d
# To rebuild containers after dependency changes
docker compose -f docker-compose.dev.yml up --build
The development configuration (docker-compose.dev.yml
) enables:
- Hot-reloading for both frontend and backend changes
- Volume mounting of local files into containers
- Development server configurations
- Exposed ports for debugging
Your changes to the following directories will automatically reflect in the running containers:
./client
- Frontend React application./server
- Backend FastAPI application./docs
- Documentation site
To stop the development servers:
# If running in foreground, use Ctrl+C
# If running in detached mode:
docker compose -f docker-compose.dev.yml down
Please cite our [conference paper] if you find it useful in your research:
@inproceedings{bikaun2021lexiclean,
title={LexiClean: An annotation tool for rapid multi-task lexical normalisation},
author={Bikaun, Tyler and French, Tim and Hodkiewicz, Melinda and Stewart, Michael and Liu, Wei},
booktitle={Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations},
pages={212--219},
year={2021}
}
Please email any feedback or questions to Tyler Bikaun ([email protected])
This project is licensed under the MIT License - see the LICENSE file for details.
- Open an issue using the appropriate template
- Provide clear description and steps to reproduce (for bugs)
- Include relevant environment details or examples
- Fork and create a branch
- Make changes following our code style
- Test your changes
- Submit a PR with a clear description