The data is publicly available from the National Science Foundation.
Awards up to 2022 were manually downloaded in accordance with NSFs robots.txt file. Files were unzipped by year and placed in the data/ subdirectory but not included in the repository.
We then use Gensim to run topic modeling via LDA. The analysis is contained in the 'topic-modeling.ipynb' notebook.