VD data pipeline

Code to create tabular versions of the VD17 and VD18 bibliographic databases.

Runnning the pipeline

Install Python dependencies using your favourite method. Then, just run make -j. This will:

Download the VD17 and VD18 data from the respective websites.
Download authority data referred to from the VD17 and VD18 data (along with a few additional GND ids) from the GND.
Convert the downloaded data into Parquet and TSV formats using bibxml2.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.lintr		.lintr
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml