Fix/fix preprocessing 20211119 #9

cschu · 2021-11-19T22:23:04Z

No description provided.

…ingle sample

* R1/R2 lengths are now assessed and homogenised separately (i.e. they can have different lengths as supported by Figaro) * initial read length distribution assessment is now performed by fastqc * fastq filenames are normalised to R1/R2 naming scheme * removed pair_id from sample meta information

added gaga2 workflow diagram

added workflow diagram

…lab/gaga2 into feature/add_preprocessing

* paired-end reads are now filtered by whether they're spanning the amplicon length (figaro-requirement) * single-end reads are not filtered

* fixed issue with homogeneous length trimming * removed check for preprocessed reads that would prevent preprocessing for 'garbage' data

…pseq db

* read length distributions are now obtained from bbduk histograms (non-binned) * experimental: allow shorter reads instead of forcing to completely cover amplicon

…dapters/primers

* version 0.5 * if reads are not covering the full amplicon size, a shortened amplicon size is provided to figaro (experimental) * bbduk length histograms (instead of fastqc) are now provided to read length assessment * sample classification was (temporarily?) moved into the fastq-collection Channel (due to issues with nf 21.10+) * resolved a data flow issue that would allow dada2 processes to start before the preprocessing is finished

cschu and others added 30 commits November 10, 2021 12:46

added preprocessing of raw reads

dddfc9c

version bump -> 0.4

e187ad6

added --preprocessed flag + changed flow-logic; minor cleanup

b229a70

Update README.md

c0727c7

fixed some that would generate the wrong table format when having a s…

31cd58b

…ingle sample

cleanup

b80eaaa

added bbduk label

56a6a6d

readlen homogenisation is now done by bbduk; added bbduk/dada2 labels

192e030

tidy directories

ab67729

added missing curly bracket

8b049da

added mapseq-profiling of asv sequences

a21e31f

added docs folder

5edcff8

Add files via upload

46b3ec9

added gaga2 workflow diagram

Update README.md

ec96e3b

added workflow diagram

added parameters for dada2 chimera removal

0c78457

Merge branch 'feature/add_preprocessing' of https://github.com/zeller…

91d0038

…lab/gaga2 into feature/add_preprocessing

removed vortex-code (prevalence check) as it fails for certain samples

75429dd

Update flow diagram

4c996cb

removed accidentally created file

da64390

added illumina 16S technical sequences

ebf395d

changed read length assessment

d413236

* paired-end reads are now filtered by whether they're spanning the amplicon length (figaro-requirement) * single-end reads are not filtered

changed adapter/primer removal to 2-step fwd/rev clipping

6676044

main workflow changes

e1a824e

* fixed issue with homogeneous length trimming * removed check for preprocessed reads that would prevent preprocessing for 'garbage' data

version bump -> 0.4.1

416d79b

Merge branch 'master' into fix/fix_preprocessing_20211119

2801221

Update README.md

277b56d

Update README.md

6ee07ba

fixed post-merge main.nf

66bb7ce

increased run time for dada2 processes, added custom mapseq db

38858f0

cschu added 6 commits November 22, 2021 13:51

changed mapseq output to simple, allowed custom mapseq databases

4aecd92

changed publish_mode to copy, upped memory for dada2, added custom ma…

9a35410

…pseq db

improved read length resolution during assessment

8eddf2c

* read length distributions are now obtained from bbduk histograms (non-binned) * experimental: allow shorter reads instead of forcing to completely cover amplicon

added stepwise read preprocessing, individually cleaning r1/r2 from a…

a716191

…dapters/primers

added --long_reads documentation

35cf558

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/fix preprocessing 20211119 #9

Fix/fix preprocessing 20211119 #9

cschu commented Nov 19, 2021

Fix/fix preprocessing 20211119 #9

Are you sure you want to change the base?

Fix/fix preprocessing 20211119 #9

Conversation

cschu commented Nov 19, 2021