This is the code for my various submissions at the data challenge [TO BE REVEALED]
- Preprocessing the entire training set before cross-validation is slightly cheating - maybe implement a new wrapper class and use pipelines?
- Subsample pre-computed KS features to enable usage