Skip to content

halannhile/topic-modeling-app

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

117 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EasyTopics

A web app to detect, analyze and visualize topics from your documents using BERTopic.

Developers: Chris Tam, Ian Bulovic, Nhi Le (Brandeis University)

Link to app

Project report

Table of contents:

Table of contents generated with markdown-toc

App overview

Note: Some of the pages in the app might take a few minutes to load because of the underlying BERTopic model.

Homepage

This page shows all the documents you have uploaded, classified into two upload types: documents (for individual file uploads), and dataset (for zip uploads).

You can filter and delete documents by various criteria.

Upload Documents page

You can either:

  1. Upload a single document and run a pretrained topic model on it (BERTopic Wikipedia in this case):

  1. Upload a dataset in the form of a zip folder and train a topic model. Note: this may take a while depending on the size of your dataset.

Topic Modeling Results page

On this page, you are given the option to visualize topic modeling results on document uploads or dataset uploads, using either the pretrained BERTopic Wikipedia model, or your own model.

Custom Model Visualizations page

You can also view different topic visualizations of your trained model:

Running the app

Option 1 (recommended): Docker

Note: it may take a few minutes for the build process to complete.

  1. Building the image:
docker build -t topic-modeling-app .
  1. Running the app:
 docker run -p 8501:8501 topic-modeling-app

Navigate to http://localhost:8501 to access app.

Option 2: Running from source code

  1. Setup virutal environment:
conda create -n topic-modeling-app python=3.11
conda activate topic-modeling-app
python -m pip install -r requirements.txt
  1. cd into app directory and run the streamlit app:
python -m streamlit run 1_Your_Documents.py

Running unit tests

cd into app directory:

python -m unittest discover -s unit_tests

About

a full-stack web app to detect, analyze and visualize topics from your documents using BERTopic 📄

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors