This is my submission for Tech In Asia's Data Engineering/Scientist recruitment assignment.
- Data scraping from the API is in
data_collection.py - Data analysis/visualization was done in
TIA_data_analysis.ipynb. For easy viewing without the need to download Jupyter Notebook, view the file in github directly, or download theTIA_data_analysis.htmlfile and view it in your internet browser. - Scraped data is stored in the
datafolder - The
outputfolder contains output from data visualizations. - The
testfolder contains tests for models built.
- Install
anacondaURL: https://docs.continuum.io/anaconda/install/ - open
environment.ymlwith a text editor, change theprefixpath to the path of your anaconda distribution conda env create -f environment.yml
You should now be able to activate that environment with (Mac / Linux) source activate tia or (Windows) activate tia
- Celeste Ang (LinkedIn)