Skip to content

Hotfix/multilib annotate#109

Merged
nickp60 merged 7 commits into
release/0.5.0from
hotfix/multilib-annotate
Apr 25, 2025
Merged

Hotfix/multilib annotate#109
nickp60 merged 7 commits into
release/0.5.0from
hotfix/multilib-annotate

Conversation

@nickp60

@nickp60 nickp60 commented Apr 24, 2025

Copy link
Copy Markdown
Contributor

Multiple fixes to annotation app:

  • adds a param for number of contigs (defaults to 200) in each shard of the annotation
  • adds an extra arg to the test script to make it easier to run with --dryrun or targeting a certain file during testing\
  • bumped metaerg version to hopefully fix minpath issues
  • fixed make_assembly_split_names function, added tests to catch offby1
  • made several annotation files temporary to cut down on storage space
  • make alignment happen on whole metagenome rather than shards; alignment runtime will be limited by number of reads so this is more efficient than aligning all the reads to each shard individually

@nickp60 nickp60 requested a review from miraep8 April 24, 2025 16:43

@miraep8 miraep8 left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you Nick! Looks great to me for now - shall we wait to merge until the latest round of annotate test apps goes through?

Comment thread workflow/rules/annotate.smk Outdated
outdir=temp(directory("annotation/annotation_{batch}/")),
gff=temp("annotation/annotation_{batch}.either_all_or_master.gff"),
ffn=temp("annotation/annotation_{batch}.cds.ffn"),
faa=("annotation/annotation_{batch}.cds.faa"),

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should the batch level faa also be a temp file?

gff="annotation/annotation_{batch}/data/either_all_or_master.gff",
ffn="annotation/annotation_{batch}/data/cds.ffn",
faa="annotation/annotation_{batch}/data/cds.faa",
outdir=temp(directory("annotation/annotation_{batch}/")),

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think probably the outdir in the command below should be updated to make use of the $output.outdir variable?

Comment thread workflow/rules/annotate.smk Outdated
output:
length_file="annotation/annotation_{batch}/seqkit.length",
bed_file="annotation/annotation_{batch}/seqkit.bed",
length_file=f"{config['sample']}_metaerg.seqkit.length",

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These could also probably be temp files

@nickp60

nickp60 commented Apr 25, 2025

Copy link
Copy Markdown
Contributor Author

Jobs are all succeeding so I'll merge

@nickp60 nickp60 merged commit c0d3187 into release/0.5.0 Apr 25, 2025
@nickp60 nickp60 deleted the hotfix/multilib-annotate branch April 25, 2025 02:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants