TabDiff easy run on just numerical features

Environment Setup

Create the main environment with tabdiff.yaml. Then update pytorch version xd.

conda env create -f tabdiff.yaml
conda activate tabdiff
pip install torch==2.5.0

Dataset preparation

Save your tabular data as <NAME_OF_YOUR_DATASET>.csv file.

The first row should be the header indicating the name of each column, and the remaining rows are records. Place you data's csv file in the directory named "datacsv".

Run the following command to process your dataset:

python process_dataset.py --dataname <NAME_OF_YOUR_DATASET>

Training TabDiff and sampling

Config path: tabdiff/configs/tabdiff_configs.toml

model will generate a sample every check_eval_every epochs

To train a model across the entire table, run

python main.py --dataname <NAME_OF_DATASET> --mode train --exp_name <experiment_name>

Results will be saved in tabdiff/<NAME_OF_YOUR_DATASET>/<experiment_name>/

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
datacsv		datacsv
eval		eval
src		src
tabdiff		tabdiff
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval_impute.py		eval_impute.py
main.py		main.py
process_dataset.py		process_dataset.py
synthcity.yaml		synthcity.yaml
tabdiff.yaml		tabdiff.yaml
utils_train.py		utils_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TabDiff easy run on just numerical features

Environment Setup

Dataset preparation

Training TabDiff and sampling

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TabDiff easy run on just numerical features

Environment Setup

Dataset preparation

Training TabDiff and sampling

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages