I'm using MIMIC2 and have adjusted the "Prevalence filtering threshold for shortlisting genomes" to allow specific strains to be included in the selection. However, by lowering the prevalence threshold, I end up with over 70 genomes being selected for the next scoring step. This causes a significant increase in memory usage (%MEM), leading to performance issues.
Is there a way to reduce memory usage while still ensuring that these specific strains are included in the selection, or any other approaches to handle large numbers of genomes without overwhelming the memory?
Thanks in advance for your help!
Best regards