Add a `concept_id` column to drug and gene claims #573

jsstevenson · 2025-02-26T15:00:23Z

Some ingested resources provide concept IDs instead of, or in addition to, drug and gene names. We are relatively inconsistent about how we handle this, e.g. whether we make the label or the concept ID an alias.

I think it'd be preferable to parse out the concept ID when it's available. This would help grouping (better to search by concept ID first). If a claim doesn't provide a label otherwise, we could just copy the ID over to the label as well.

This becomes relevant in cases like this: #567. Here, an NCI row simply provides the drug name "ADM" and NCIt concept code "C1326". We store the name as the name of the claim and the concept code as an xref. This becomes problematic during normalization because "ADM" is ambiguous (acellular dermal matrix vs adriamycin).

Other possibilities:

Always make the concept ID the label if available. This would obviate the need for a schema change but I think it's less ideal if we want to make UI views for claims (which I think we should, this was a feature in the original UI if I'm not mistaken).

mcannon068nw · 2025-02-26T15:25:55Z

As mentioned from our discussion, I think this is a good idea but we should probably do a test run of this just to see how it impacts grouping. Also, we should consider another name for this column besides concept_id so as to avoid confusion/differentiate it a little more. Maybe source_concept_id?

jsstevenson added the backend Changes to the backend only label Feb 26, 2025

jsstevenson assigned mcannon068nw Feb 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a `concept_id` column to drug and gene claims #573

Add a `concept_id` column to drug and gene claims #573

jsstevenson commented Feb 26, 2025

mcannon068nw commented Feb 26, 2025

Add a concept_id column to drug and gene claims #573

Add a concept_id column to drug and gene claims #573

Comments

jsstevenson commented Feb 26, 2025

mcannon068nw commented Feb 26, 2025

Add a `concept_id` column to drug and gene claims #573

Add a `concept_id` column to drug and gene claims #573