Question about Uniprot IDs #1

GainGod-Xu · 2024-11-05T06:34:31Z

Hi Zhaohan,

FusionDTI is a fantastic work. I am currently attempting to test it for drug-protein interaction prediction, but I am uncertain how to obtain the Uniprot IDs for all the protein sequences in the bindingdb.csv file. Could you please give more guidance on this step?

"The first step, if you do not have Uniprot IDs, you will need to obtain them from the UniProt website based on existing amino acid sequences, protein names, etc. Then save them as a comma-delimited text file."

Thanks for your help in advance!

ZhaohanM · 2024-11-05T10:02:23Z

Thank you for your interest in our work. In the following, I outline the process of accessing the UniProt IDs for the three publicly available datasets:

BindingDB Dataset: On the BindingDB website, you can access the files "BindingDB_UniProt.txt" and "BindingDBTargetSequences.fasta." By matching these files, you will extract the UniProt IDs for all amino acid sequences.

BioSNAP Dataset: The current BioSNAP dataset already includes the corresponding UniProt IDs, so no additional processing is required.

Human Dataset: First, you can find human amino acid sequences in the UniProt database, which should also have corresponding 3D structures on AlphaFoldDB (covering roughly 8,500 sequences). You can then match the Human dataset with the downloaded UniProt dataset to obtain the respective UniProt IDs.

In addition, if the UniProt ID or corresponding 3D structure file (.cif) is not available, the 3D structure can also be predicted with the AlphaFold model directly using the amino acid sequence.

GainGod-Xu · 2024-11-05T14:39:23Z

Thanks for your quick response!

Now I understand that Uniprot IDs allow us to retrieve 3D structures from AlphaFoldDB. I just wanted to confirm if this is primarily for the purpose of saving time.

ZhaohanM · 2024-11-05T14:50:51Z

Thank you for your question! Yes, precisely—the UniProt IDs serve as a bridge to quickly retrieve the 3D structures from AlphaFoldDB, making the process more efficient.

GainGod-Xu · 2024-11-05T14:51:33Z

Thanks, Zhaohan!

GainGod-Xu closed this as completed Nov 5, 2024

ZhaohanM pinned this issue Nov 18, 2024

ZhaohanM unpinned this issue Nov 18, 2024

ZhaohanM reopened this Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about Uniprot IDs #1

Question about Uniprot IDs #1

GainGod-Xu commented Nov 5, 2024 •

edited

Loading

ZhaohanM commented Nov 5, 2024 •

edited

Loading

GainGod-Xu commented Nov 5, 2024

ZhaohanM commented Nov 5, 2024

GainGod-Xu commented Nov 5, 2024

Question about Uniprot IDs #1

Question about Uniprot IDs #1

Comments

GainGod-Xu commented Nov 5, 2024 • edited Loading

ZhaohanM commented Nov 5, 2024 • edited Loading

GainGod-Xu commented Nov 5, 2024

ZhaohanM commented Nov 5, 2024

GainGod-Xu commented Nov 5, 2024

GainGod-Xu commented Nov 5, 2024 •

edited

Loading

ZhaohanM commented Nov 5, 2024 •

edited

Loading