colab notebook: https://colab.research.google.com/drive/1HFpva6olZ551Dru7P42VcXlcsVOP_FwE?usp=sharing Preprocessing of data is being done locally, so that we don't run into colab usage limits