Skip to content

Releases: vmikk/NextITS

1.0.0

24 Mar 08:43
Compare
Choose a tag to compare

NextITS v.1.0.0 [2025-03-24]

  • Added support of asymmetric barcoding scheme for demultiplexing of PacBio data
  • Added support of BAM files (CCS) as input
  • Added support for SWARM d=1 pre-clustering
  • Changed the selection of representative sequences (sequence with the highest quality score is taken as the representative; using phredsort)
  • Refactored sequence quality estimation
  • Improved processing speed (using DuckDB and Parquet format)
  • Improved tag valiadion for demultiplexing
  • Improved compression speed for output files (runs in parallel using pigz)
  • Update of the database for reference-based chimera detection (using the EUKARYOME database)
  • New parameters added:
    • step (specifies which pipeline step to run - "Step1" or "Step2")
    • storagemode (Adjusts how files are directed to the results folder)
    • gzip_compression (Controls GZIP compression level in output files)
    • categorical lima_barcodetype replaces boolean lima_dualbarcode
    • lima_minendscore (For asymmetric and dual barcoding scheme)
    • lima_minrefspan (Controls barcode coverage)
    • lima_minscoringregions (Controls the number of reqired barcodes for demultiplexing using dual barcodes)
  • Added auxilarry output files:
    • [Step-1] All rRNA parts extracted by ITSx (pooled within sequencing run - useful for extracting these regions for representative sequences)
    • [Step-1] File with quality scores for full-length sequences (after QC and trimming)
    • [Step-2] File with joined sequence memebership (dereplication, pre-clustering, and clustering)
  • Primer trimming prior ITSx is now default (sequence quality is also estimated on trimmed sequence)
  • Fixed VSEARCH clustering on denoised reads
  • Resolved an issue where no de novo chimeras were detected
  • Reconfigured parameter specification
  • Introduced a parameter schema and enhanced parameter validation
  • Container updates to included the latest versions of dependencies
  • New dependencies - specialized tools written in Go to speed up the processing:

v.0.5.0 [2023-08-08]

25 Aug 12:56
Compare
Choose a tag to compare
  • New seqstats sub-workflow (only dereplication, primer validation, and basic run stats)
  • Add SWARM clustering (Mahé et al., 2022 DOI:10.1093/bioinformatics/btab493)
  • Add post-clustering curation with LULU (Frøslev et al., 2017 DOI:doi.org/10.1038)
  • Add barcode validation step
  • Add SSU and LSU region-based output sequences
  • Add support for UNOISE-only output (without clustering)
  • Add merge_replicates parameter (Step-2) for merging or keeping separate sample replicates
  • Update Step-1 run summary (add homopolymer stats)
  • Deprecate taxonomy annotation workflow at Step-1
  • Fixed different extensions in demultiplexed input
  • Experimental: UNITE-style dereplication (allows query sequences to vary in length at 100% similarity)
  • Experimental: support of alternative alignment penalty scores (ITS-specific feature)

v.0.4.0 [2023-05-08]

03 Aug 13:12
Compare
Choose a tag to compare

v.0.3.0 [2023-03-02]

05 May 12:44
Compare
Choose a tag to compare
  • Add support for pre-demultiplexed data as input
  • Add option for semi-full-length ITS (especially useful when forward primer is located at the very end of SSU and the HMM site can not be recognized by ITSx)
  • Add removal of long homopolymer artifacts at QC stage
  • Correct handling of a case with no valid sequences at primer checking step (thank to Taavi Riit for reporting the bug)
  • Bug fixed in assemble_its (thanks to Kadri Põldmaa for discovering the error)
  • Addition of ITSx detailed results (with information on the HMM profile used for ITS extraction)
  • Fixed sample names for the rescued chimeric sequences
  • Minor fixes related with the Singularity container, output directory, help message, and single-end QC

v.0.2.0 [2022-09-30]

05 May 12:41
Compare
Choose a tag to compare
  • Add Ilumina-based workflow (see --seqplatform flag)
  • Publish Singularity image to Singularity library
  • Minor bugfixes in primer_check (multiprimer artefacts), pool_seqs (sequence headers), and prep_asvtab (aggregation of non-unique joined Illumina sequences) processes
  • New logo design (thanks to Olesya Dulya)