Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to combine OC22 and OC20 dataset for model training using fairchem? #1026

Open
zouzihan123 opened this issue Feb 23, 2025 · 0 comments
Open

Comments

@zouzihan123
Copy link

What would you like to report?

Dear fairchem staff, I want to make a dataset by mixing OC20 and OC22. But, in the Open Catalysis project, I found that the data formats provided for OC20 and OC22 are not the same. Where, OC22 is provided as .lmdb, but OC20 (training set) is provided as .extxyz.xz. I would like to ask if fairchem project has code that reads and combines the two formats of the dataset together, I have only found code for loading .lmdb before.
Sincerely

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant