Skip to content

Files

This branch is 13 commits behind hse-aml/natural-language-processing:master.

honor

Utils to download and read data for chat-bot training

This folder contains scripts for downloading, reading and preprocessing data for chat-bot training:

  • download_cornell.sh - downloads Cornell movie dialogues dataset (small size)
  • download_opensubs.sh - downloads Opensubs movie subtitles dataset (large size)
  • datasets.py - module to be imported in your scripts, that exports functions for reading a dataset
  • example.py - example of reading the dataset