GitHub - embryo-labs/dynamic-predicate-transfer: [VLDB'26] This repository provides a DuckDB implementation of RPT+, following a Yannakakis-style execution framework.

Dynamic Predicate Transfer (RPT+)

This repository contains the implementation of Dynamic Predicate Transfer (RPT+), built on top of DuckDB v1.3.0.

Compared to the original Robust Predicate Transfer (RPT), RPT+ introduces several key optimizations. For technical details, please refer to the paper:

Robust Predicate Transfer with Dynamic Execution (PVLDB 2026, to appear) · Yiming Qiao, Peter Boncz, Huanchen Zhang [Link]

Quick Start

Follow these steps to build RPT+ and run a demonstration of the multi-way join optimization.

1. Build & Launch

Compile the project and start the DuckDB shell:

make release
./build/release/duckdb

2. Run Example

Run the following SQL to set up data (5M rows) and perform a 3-way join. RPT+ will automatically transfer filters to optimize performance.

-- 1. Setup Data
CREATE TABLE A AS SELECT i AS id1, i AS id2 FROM range(1, 5000001) AS t(i);
CREATE TABLE B AS SELECT i AS id1, i AS id2 FROM range(1, 5000001) AS t(i);
CREATE TABLE C AS SELECT i AS id1, i AS id2 FROM range(1, 5000001) AS t(i);

-- 2. Execute Query
EXPLAIN ANALYZE        -- Show query execution plan
SELECT count(*)
FROM A JOIN B ON A.id1 = B.id1 JOIN C ON B.id1 = C.id1
WHERE A.id2 % 2 = 0    -- Filter on A
  AND B.id2 % 7 = 0    -- Filter on B
  AND C.id2 % 13 = 0;  -- Filter on C

The query plan demonstrates a chain forward pass (C → B → A) followed by a broadcast backward pass (A → B & C).

Advanced Build Configuration

RPT+ follows the same build process as DuckDB. For customized builds:

make                   # Build optimized release version
make release           # Same as 'make'
make debug             # Build with debug symbols
GEN=ninja make         # Use Ninja as backend
BUILD_BENCHMARK=1 make # Build with benchmark support

For more flags, see the DuckDB Build Configuration Guide.

Original RPT Support

To use the original RPT implementation on DuckDB 1.3.0, apply the vanilla patch:

git apply APPLY_ME_TO_GET_VANILLA_RPT.patch

Benchmark

DuckDB includes a built-in implementation of benchmarks. You can build and run them with:

TPC-H (SF=100)

BUILD_BENCHMARK=1 BUILD_TPCH=1 BUILD_TPCDS=1 BUILD_HTTPFS=1 CORE_EXTENSIONS='tpch' make -j$(nproc)
build/release/benchmark/benchmark_runner "benchmark/large/tpch-sf100/.*.benchmark" --threads=8

Join Order Benchmark (JOB)

BUILD_BENCHMARK=1 BUILD_TPCH=1 BUILD_TPCDS=1 BUILD_HTTPFS=1 CORE_EXTENSIONS='tpch' make -j$(nproc)
build/release/benchmark/benchmark_runner "benchmark/imdb/.*.benchmark" --threads=8

Appian Benchmark

BUILD_BENCHMARK=1 BUILD_TPCH=1 BUILD_TPCDS=1 BUILD_HTTPFS=1 CORE_EXTENSIONS='tpch' make -j$(nproc)
build/release/benchmark/benchmark_runner "benchmark/appian_benchmarks/.*.benchmark" --threads=8

SQLStorm

To run the SQLStorm benchmark:

Clone and set up the benchmark framework from the SQLStorm repository.
Download the StackOverflow Math dataset and load it according to SQLStorm’s setup instructions.
The list of queries that are executable with DuckDB is available here.

To build a duckdb executable for SQLStorm,

BUILD_BENCHMARK=1 BUILD_TPCH=1 BUILD_TPCDS=1 BUILD_HTTPFS=1 CXXFLAGS="-DUSE_LOCK_BF=1 -DUSE_SQLSTORM_DP_CONDITION=1" make -j$(nproc)

The executable of duckdb is placed at ./build/release/duckdb.

Note:
The following section is the unmodified original README of DuckDB.

DuckDB

DuckDB is a high-performance analytical database system. It is designed to be fast, reliable, portable, and easy to use. DuckDB provides a rich SQL dialect, with support far beyond basic SQL. DuckDB supports arbitrary and nested correlated subqueries, window functions, collations, complex types (arrays, structs, maps), and several extensions designed to make SQL easier to use.

DuckDB is available as a standalone CLI application and has clients for Python, R, Java, Wasm, etc., with deep integrations with packages such as pandas and dplyr.

For more information on using DuckDB, please refer to the DuckDB documentation.

Installation

If you want to install DuckDB, please see our installation page for instructions.

Data Import

For CSV files and Parquet files, data import is as simple as referencing the file in the FROM clause:

SELECT * FROM 'myfile.csv';
SELECT * FROM 'myfile.parquet';

Refer to our Data Import section for more information.

SQL Reference

The documentation contains a SQL introduction and reference.

Development

For development, DuckDB requires CMake, Python3 and a C++11 compliant compiler. Run make in the root directory to compile the sources. For development, use make debug to build a non-optimized debug version. You should run make unit and make allunit to verify that your version works properly after making changes. To test performance, you can run BUILD_BENCHMARK=1 BUILD_TPCH=1 make and then perform several standard benchmarks from the root directory by executing ./build/release/benchmark/benchmark_runner. The details of benchmarks are in our Benchmark Guide.

Please also refer to our Build Guide and Contribution Guide.

Support

See the Support Options page.

Name		Name	Last commit message	Last commit date
Latest commit History 57,908 Commits
.github		.github
benchmark		benchmark
data		data
examples		examples
extension		extension
logo		logo
scripts		scripts
src		src
test		test
third_party		third_party
tools		tools
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.clangd		.clangd
.codecov.yml		.codecov.yml
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.sanitizer-leak-suppressions.txt		.sanitizer-leak-suppressions.txt
.sanitizer-thread-suppressions.txt		.sanitizer-thread-suppressions.txt
APPLY_ME_TO_GET_VANILLA_RPT.patch		APPLY_ME_TO_GET_VANILLA_RPT.patch
CITATION.cff		CITATION.cff
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Doxyfile		Doxyfile
DuckDBConfig.cmake.in		DuckDBConfig.cmake.in
DuckDBConfigVersion.cmake.in		DuckDBConfigVersion.cmake.in
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dynamic Predicate Transfer (RPT+)

Quick Start

1. Build & Launch

2. Run Example

Advanced Build Configuration

Original RPT Support

Benchmark

TPC-H (SF=100)

Join Order Benchmark (JOB)

Appian Benchmark

SQLStorm

DuckDB

Installation

Data Import

SQL Reference

Development

Support

About

Uh oh!

Languages

License

embryo-labs/dynamic-predicate-transfer

Folders and files

Latest commit

History

Repository files navigation

Dynamic Predicate Transfer (RPT+)

Quick Start

1. Build & Launch

2. Run Example

Advanced Build Configuration

Original RPT Support

Benchmark

TPC-H (SF=100)

Join Order Benchmark (JOB)

Appian Benchmark

SQLStorm

DuckDB

Installation

Data Import

SQL Reference

Development

Support

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Languages