Skip to content

Updated Dataset #119

@GriffinYoung

Description

@GriffinYoung

Hi,
The original PLINDER dataset was a really rigorous way of assessing leakage and constructing benchmarks. Due to the inevitable march of time it is now out of date and cannot be used to assess eg the latest Protenix model which has been trained up to mid-2025. Are there plans to release an updated version of the dataset? Or in lieu of that can someone direct me as to how I would use the code in this repo to generate an updated version myself?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions