Token-Visualizer

Token-Visualizer is a visual tool designed to analyze and display the tokenization process of text inputs using various NLP models. The tool allows users to input text and visualize how different pre-trained models (like BERT, RoBERTa, and DistilBERT) tokenize the text. The tokens are then displayed with color coding based on their embeddings, providing an insightful view into the tokenization and attention mechanism of the selected model.

Features

Text Tokenization: Enter any text, and the tool will tokenize it using the selected NLP model.
Multiple Model Support: Choose from popular NLP models such as:
- BERT Base Uncased
- RoBERTa Base
- DistilBERT Base
Token Visualization: Tokens are displayed with color-coding based on their embedding magnitudes, providing a clear visual representation of their significance.
Real-time Feedback: The tool updates the tokenized view and embeddings as you interact with the interface.
Interactive Interface: The tool features an easy-to-use UI with options to input text, select models, and see the tokenization results instantly.

Tech Stack

Frontend: HTML, CSS, JavaScript (Widgets for UI components)
Backend: Python, Flask (Server-side logic and tokenization)
Machine Learning Models: Hugging Face's transformers library (BERT, RoBERTa, DistilBERT)

Prerequisites

Before you can run the Token-Visualizer tool locally, make sure you have the following installed:

Python 3.6+
pip (Python package installer)

Additionally, you will need to install the required Python libraries. These can be installed using pip by running the following command:

pip install -r requirements.txt

Deployment Instructions -- Windows

1.Set Environment Variables: Set the environment variables for Flask. In the command prompt, run:

set FLASK_APP=app.py
set FLASK_ENV=development

2.Run the Flask Application: Start the Flask application by running the following command:

flask run

This will start the Flask server, and you can access your application in the browser at

http://127.0.0.1:5000/.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
__pycache__		__pycache__
static		static
templates		templates
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Token-Visualizer

Features

Tech Stack

Prerequisites

Deployment Instructions -- Windows

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Token-Visualizer

Features

Tech Stack

Prerequisites

Deployment Instructions -- Windows

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages