Skip to content

Idea: performance improvement: Use DuckDB as a intermediate layer. #70

@matthiasfeist

Description

@matthiasfeist

DuckDB is exceptionally fast in reading CSV files and it can be used to write data directly into Postgres as well.

For the Node-GTFS project, I ran a quick benchmark here: BlinkTagInc/node-gtfs#148 (comment)
Then using DuckDB, the imported files could be modified and cleaned up directly in DuckDB, and then subsequently exported to Postgres: https://duckdb.org/docs/extensions/postgres.html#writing-data-to-postgresql

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions