-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Uniprot #470
Uniprot #470
Conversation
is there something you still need help with? |
data/tabular/uniprot/transform.py
Outdated
def load_dataset() -> pd.DataFrame: | ||
uniprot = pd.read_csv("reactions_sentences.csv") | ||
uniprot.to_csv("data_clean.csv", index=False) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this would still require update to something we can download
data/tabular/uniprot/meta.yaml
Outdated
- id: sentences | ||
description: sentences describing the catalytic activity of a protein | ||
names: | ||
- noun: catalytic activity | ||
- id: reactions | ||
description: biochemical reactions catalyzed by a protein | ||
names: | ||
- noun: chemical reactions | ||
- noun: biochemical reactions |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you also get smth about the binding sites and the function description?
thanks for giving this a look! |
Co-authored-by: Michael Pieler <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some minor changes, please run export to verify, then we can merge! 👍
Co-authored-by: Michael Pieler <[email protected]>
Co-authored-by: Michael Pieler <[email protected]>
Co-authored-by: Michael Pieler <[email protected]>
Co-authored-by: Michael Pieler <[email protected]>
Co-authored-by: Kevin M Jablonka <[email protected]>
ok, I'll try to fix the CI and try sampling once I'm back from groceries |
Thanks for the updates, @AdrianM0! Really appreciate that you squeezed this in at the last minute! |
There is a bit of leakage in some of the descriptions |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dropped duplicates now too, cleaned up some strings, but now looks nice to me!
No description provided.