Relational Transformer

This repository provides a reference implementation of the Relational Transformer architecture from the paper: Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data.

Installation:

Install pixi https://pixi.sh/latest/#installation.
Clone and install the repository:

git clone https://github.com/snap-stanford/relational-transformer
cd relational-transformer
pixi install
# compile and install the rust sampler
cd rustler
pixi run maturin develop --uv --release

Data Preparation:

Download the datasets and tasks from Relbench:

cd .. # back to the root of the repository
pixi run python scripts/download_relbench.py

Link the cache repository

mkdir ~/scratch
ln -s ~/.cache/relbench ~/scratch/relbench

Preprocessing (per database):

cd rustler
pixi run cargo run --release -- pre rel-f1

Text embedding (per database):

pixi run python -m rt.embed rel-f1

Note

Steps 3. and 4. should be run for all databases:
rel-amazon, rel-avito, rel-event, rel-f1, rel-hm, rel-stack, rel-trial

Download Preprocessed Data

This project’s preprocessed data is hosted at hvag976/relational-transformer on the Hugging Face Hub. You can directly use this data and skip the data preparation step.

Install the CLI

pip install -U huggingface_hub

Create the destination

mkdir -p ~/scratch/pre

Download the repository contents into ~/scratch/pre

huggingface-cli download hvag976/relational-transformer \
  --repo-type dataset \
  --local-dir ~/scratch/pre \
  --local-dir-use-symlinks False

Experiments

Setup wandb with:

pixi run wandb login

or:

pixi run wandb disabled

The following example commands replicate the results for rel-amazon/user-churn.

Pretrain on all datasets with rel-amazon held-out (takes about 2 hours on 8xA100 GPUs):

pixi run torchrun --standalone --nproc_per_node=8 scripts/pretrain.py

Continued pretrain from best checkpoint obtained above on all rel-amazon tasks with user-churn held-out (takes about 15 minutes on 8xA100 GPUs):

pixi run torchrun --standalone --nproc_per_node=8 scripts/contd_pretrain.py

Finetune from best checkpoint obtained above on rel-amazon/user-churn task only (takes about 1.5 hours on 8xA100 GPUs):

pixi run torchrun --standalone --nproc_per_node=8 scripts/finetune.py

Pretrained Checkpoints

Pretrained checkpoints can be downloaded from: TODO.

To use, pass the path to the checkpoint to the load_ckpt_path argument of the training scripts.

Citation

@misc{ranjan2025relationaltransformer,
    title={Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data}, 
    author={Rishabh Ranjan and Valter Hudovernik and Mark Znidar and Charilaos Kanatsoulis and Roshan Upendra and Mahmoud Mohammadi and Joe Meyer and Tom Palczewski and Carlos Guestrin and Jure Leskovec},
    year={2025},
    eprint={2510.06377},
    archivePrefix={arXiv},
    primaryClass={cs.LG},
    url={https://arxiv.org/abs/2510.06377}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
rt		rt
rustler		rustler
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
pixi.lock		pixi.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Relational Transformer

Installation:

Data Preparation:

Download Preprocessed Data

Experiments

Pretrained Checkpoints

Citation

About

Uh oh!

Contributors 2

Uh oh!

Languages

snap-stanford/relational-transformer

Folders and files

Latest commit

History

Repository files navigation

Relational Transformer

Installation:

Data Preparation:

Download Preprocessed Data

Experiments

Pretrained Checkpoints

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages