This folder contains scripts for downloading, reading and preprocessing data for chat-bot training:
download_cornell.sh
- downloads Cornell movie dialogues dataset (small size)download_opensubs.sh
- downloads Opensubs movie subtitles dataset (large size)datasets.py
- module to be imported in your scripts, that exports functions for reading a datasetexample.py
- example of reading the dataset