This script is turning the shopify taxonomy into a .csv format with mappings that can be used for SQL databases.
For each type (vertical, category, attribute, attribute value, extended attribute) there is a .csv file with the English translations. Each file has an additional localizations_XX.csv that includes all the languages codes specified in the main.py script.
- Drop the latest distribution from Shopify/product-taxonomy/tree/main/dist into
data/input/{version_name}whereversion_nameis the version fromtaxonomy.json(e.g.2025-06-unstable). - Update the version in
main.pyand specify the target languages for localization. - Run the script and see the output files in
data/output/{version_name}.
It uses different .json files from the shopify dist folder as input to create .csv files with a serial id in addition to the shopify gid indentifier. These id are used to create _mappings.csv files that can be used for junction tables.
The script uses .txt files (where available) for translations since the structure is simpler. Since verticals and extended attributes don't have .txt files, these translations come from the .json files instead.