Remove PII from BigQuery tables and export them as CSV files
Ensure you have Python 3.10.4 installed (as specified in .python-version), configure your Poetry environment to use the correct Python interpreter, and install the dependencies:
pyenv install 3.10.4
poetry env use $(pyenv which python)
poetry installDownload an appropriate Google Cloud service account key to your file system, then copy .env.dist as .env and fill in the values.
Copy config.toml.dist as config.toml and enter the course IDs and queries you want to perform using the template.
- Enter as many
course_idsand[[table]]blocks as required. - Select from
full_table_namein the query. The fully-qualified table name will be interpolated at runtime.
poetry run python app.pyThe requested CSV files will be written to <date>/exported/<course_id>/<table_name>.csv.