Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix/fix preprocessing 20211119 #9

Open
wants to merge 36 commits into
base: master
Choose a base branch
from

Conversation

cschu
Copy link
Member

@cschu cschu commented Nov 19, 2021

No description provided.

cschu and others added 30 commits November 10, 2021 12:46
* R1/R2 lengths are now assessed and homogenised separately (i.e. they can have different lengths as supported by Figaro)
* initial read length distribution assessment is now performed by fastqc
* fastq filenames are normalised to R1/R2 naming scheme
* removed pair_id from sample meta information
added gaga2 workflow diagram
added workflow diagram
* paired-end reads are now filtered by whether they're spanning the amplicon length (figaro-requirement)
* single-end reads are not filtered
* fixed issue with homogeneous length trimming
* removed check for preprocessed reads that would prevent preprocessing for 'garbage' data
* read length distributions are now obtained from bbduk histograms (non-binned)
* experimental: allow shorter reads instead of forcing to completely cover amplicon
* version 0.5
* if reads are not covering the full amplicon size, a shortened amplicon size is provided to figaro (experimental)
* bbduk length histograms (instead of fastqc) are now provided to read length assessment
* sample classification was (temporarily?) moved into the fastq-collection Channel (due to issues with nf 21.10+)
* resolved a data flow issue that would allow dada2 processes to start before the preprocessing is finished
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant