Skip to content

hsci-r/vd-data-pipeline

Repository files navigation

VD data pipeline

Code to create tabular versions of the VD17 and VD18 bibliographic databases.

Runnning the pipeline

Install Python dependencies using your favourite method. Then, just run make -j. This will:

  1. Download the VD17 and VD18 data from the respective websites.
  2. Download authority data referred to from the VD17 and VD18 data (along with a few additional GND ids) from the GND.
  3. Convert the downloaded data into Parquet and TSV formats using bibxml2.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •