Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make the reference configurable #35

Open
leoisl opened this issue May 18, 2022 · 10 comments
Open

Make the reference configurable #35

leoisl opened this issue May 18, 2022 · 10 comments

Comments

@leoisl
Copy link
Collaborator

leoisl commented May 18, 2022

This is a request from Floriane and Pedro from our meeting on 18/05/2022.

H37Rv is not always the reference they might want describe the variants with respect to, so the reference has to be configurable (e.g. through the parameters). Right now it is always forced to be H37Rv.

@mbhall88
Copy link
Owner

As long as they're aware that using different references will mean they won't be able to do SNP distances between isolates with different references

@leoisl
Copy link
Collaborator Author

leoisl commented May 19, 2022 via email

@mbhall88
Copy link
Owner

@iqbal-lab I think we need to stress this issue. I don't think we want to be detecting when they're providing consensus sequences based on different references...

@iqbal-lab
Copy link
Collaborator

It's called tbpore, I don't see why we need to support different references. Will follow up

@leoisl
Copy link
Collaborator Author

leoisl commented May 20, 2022

It is still a TB reference, but a different one. Sometimes they want to compare with an African (if I remember correctly) reference strain, not with H37Rv

@mbhall88
Copy link
Owner

Yep. But the genome will be a different size and therefore SNP distances won't be applicable

@leoisl
Copy link
Collaborator Author

leoisl commented Aug 18, 2022

So, should we reconsider this or should we close as not planned?

@mbhall88
Copy link
Owner

We can make it configurable - it shouldn't be too difficult right? And just make sure we add a disclaimer in the help menu and docs that when using a different reference you can't compare SNP distances to other samples using a different reference.

@leoisl
Copy link
Collaborator Author

leoisl commented Aug 19, 2022

It is not hard to make it configurable. We could also easily control that tbpore cluster is called with consensus that were generated from a same ref - just add the ref sha-256 or similar hash to the first fasta header of the consensus. Of course this control can't be enforced if the consensus was not made by tbpore but then we could just have a --ignore-ref-check flag

@mbhall88
Copy link
Owner

Sounds like a good idea to me

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants