How do I deal with wild cards when creating reaction templates? #21

gnsrivastava · 2021-01-14T10:44:05Z

Hello,

I am trying to create reaction templates but there are some reactions that have wildcard notations for some of the atoms. I am not sure how to deal with them.
There is a paper that talks about converting them into C, N, P , S, Se, As if molecules is aromatic but doesn't say anything about aliphatic molecules.
Can you please guide me as to how to deal with them?

Gopal

connorcoley · 2021-01-14T12:11:54Z

Can you provide an example?

On Thu, Jan 14, 2021 at 05:44 Gopal Srivastava ***@***.***> wrote: Hello, I am trying to create reaction templates but there are some reactions that have wildcard notations for some of the atoms. I am not sure how to deal with them. There is a paper that talks about converting them into C, N, P , S, Se, As if molecules is aromatic but doesn't say anything about aliphatic molecules. Can you please guide me as to how to deal with them? Gopal — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#21>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABAEXJWYTB4T4J7ZTOQKHWDSZ3DILANCNFSM4WCHK43Q> .

-- Sent from my phone; please excuse my brevity

gnsrivastava · 2021-01-14T13:39:44Z

These are smile strings. I am planning on converting them to SMARTS using Reaction Decoder.

*C@HCC(=O)SCCNC(=O)CCNC(=O)C@HC(C)(C)COP(=O)(O)OP(=O)(O)OC[C@H]1OC@@H C@H[C@@h]1OP(=O)
C(=O)CC(=O)SCCNC(=O)CCNC(=O)C@HC(C)(C)COP(=O)(O)OP(=O)(O)OC[C@H]1OC@@H C@H[C@@h]1OP(=O)(O)O>>C@HCC(=O)SCCNC(=O)CCNC(=O)C@HC(C)(C)COP(=O)(O)OP(=O)(O)OC[C@H]1OC@@H C@H[C@@h]1OP(=O)(O)O

*: Wildcards => can be R group or different substitution groups.
Thanks for the help.

Gopal

connorcoley · 2021-01-14T20:19:36Z

Hi Gopal,

These don't look like well-formed, valid SMILES and it's odd that you've got a mix of molecules and reactions there. Where did you get these from? I'm a little confused about what you're trying to do here, sorry

Reaction template extraction is outside of the scope of this repo, but is something that another one of our codes can do, rdchiral. The code isn't written to deal with these wildcards.

gnsrivastava · 2021-01-15T04:45:07Z

I am sorry for not properly posting the reaction. I think you answered my question. I have just one last question I was hoping you can help me with.
In [CH2:23]1[O:24][CH2:25][CH2:26][CH2:27]1.[F:1][c:2]1c:3[cH:4]c:5c:6[cH:7]1.[H-:22].[NH2:13][c:14]1[s:15][cH:16][cH:17][c:18]1[C:19]#[N:20].[Na+:21] | 1 | 2-13-1.0 | 2.196 | 1-2-0.0 | 1.877 what are 2.196 and 1.877. This line was taken from legend.csv in rexgen-master/template_comparison/cases.

connorcoley · 2021-01-15T14:54:50Z

These are scores associated with the graph edit they are adjacent to (i.e., a measure of how likely the model perceives them to be)

gnsrivastava · 2021-01-16T06:00:47Z

Thank you so much. These answers were a huge help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I deal with wild cards when creating reaction templates? #21

How do I deal with wild cards when creating reaction templates? #21

gnsrivastava commented Jan 14, 2021

connorcoley commented Jan 14, 2021 via email

gnsrivastava commented Jan 14, 2021

connorcoley commented Jan 14, 2021

gnsrivastava commented Jan 15, 2021

connorcoley commented Jan 15, 2021

gnsrivastava commented Jan 16, 2021

How do I deal with wild cards when creating reaction templates? #21

How do I deal with wild cards when creating reaction templates? #21

Comments

gnsrivastava commented Jan 14, 2021

connorcoley commented Jan 14, 2021 via email

gnsrivastava commented Jan 14, 2021

connorcoley commented Jan 14, 2021

gnsrivastava commented Jan 15, 2021

connorcoley commented Jan 15, 2021

gnsrivastava commented Jan 16, 2021