Skip to content

Add reproducibility when fitting a synthesizer #2022

Open
@srinify

Description

@srinify

Problem Description

I want to improve my ability to evaluate synthesizers with different parameters, in different environments, and against each other.

Expected behavior

As a user, I'd like the Synthesizer models to be fit in the same way so I can generate the same synthetic data every time.

Potential API

There are situations when you want a slightly different model to be trained. So reproducibility may be something we try to incorporate with a parameter:

synthesizer.fit(original_data, random_state=1)

Additional context

Originally raised here: sdv-dev/CTGAN#380 (comment)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions