Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Final bug fixes before submitting the paper #7

Open
wants to merge 318 commits into
base: epiGBS2v2
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
318 commits
Select commit Hold shift + click to select a range
362fd08
Reference hack doesn't seem to be necessary!
Jan 20, 2021
b507fb6
Add merge step
Jan 20, 2021
f785971
Snakemake syntax??
Jan 20, 2021
ea48c14
Typo
Jan 20, 2021
da2e3cf
Typo
Jan 20, 2021
27bf7a9
Removed field that stopped merging from being possible
Jan 20, 2021
39135fe
Added filter for sites with no alt allele and for which the reference…
Jan 20, 2021
b7d9a34
Added reference branch
Jan 20, 2021
542c4ab
Added reference branch (SNP_calling does not work yet)
Jan 20, 2021
88f1132
Update reference.rules
Jan 20, 2021
ca232de
Update reference.rules
Jan 20, 2021
fd812ad
test
Jan 20, 2021
0efe897
should work now
Jan 20, 2021
8bed584
Should actually work now
Jan 20, 2021
7c9cb09
typo
Jan 20, 2021
0b449d7
added correct output file to alignment en meth calling with bismark
Jan 21, 2021
b92aee9
Made SNP calling even more conservative in threads used...
Jan 21, 2021
c26bcff
Simplified snakemake structure
Jan 22, 2021
647e1dd
Missed commas
Jan 22, 2021
ddbcec2
Dictionary was in the wrong place
Jan 22, 2021
4956d88
Updated config. Renamed legacy to legacy
Jan 22, 2021
217785e
Re-added QC
Jan 22, 2021
210dd2e
Changed rule multiQC to work with new pipeline
Jan 25, 2021
001250e
Update Snakefile
Jan 25, 2021
1825f15
Missing comma
Jan 25, 2021
ffa787e
Forgot rules/
Jan 25, 2021
2b0b06e
Added Fleurs environments + removed removed old reference branch
Jan 25, 2021
d9f72ed
Added output to demultiplex.rules so it works with the new version
Jan 25, 2021
9f982a2
Removed duplicate lines from demultiplex.rules as they should be in t…
Jan 25, 2021
2589836
Synthaaaaaaaaaax
Jan 25, 2021
c3be4d4
Update demultiplex.rules
Jan 25, 2021
8ebcc75
Removed unused rules files
Jan 26, 2021
3e250ad
Attempted fix not all tmp files deleted. Adjusted default denovo para…
Jan 26, 2021
7e5f61f
Maybe do not delete these files?
Jan 26, 2021
e7ebe31
Removed QC from legacy branch (temporary?)
Jan 27, 2021
f413ce1
Update Snakefile
Jan 27, 2021
6e1501f
Update Snakefile
Jan 27, 2021
bf0d20b
Added sexy new multiqc
Jan 29, 2021
f141b65
Fixed parse log input
Jan 29, 2021
479f141
???
Jan 29, 2021
1a9fc38
?!
Jan 29, 2021
a7f9d9d
????
Jan 29, 2021
5259399
Fixed outdir
Jan 29, 2021
51fcda5
added QC to the legacy branch
Jan 29, 2021
c77cb27
Fixed temp files not being deleted
Feb 1, 2021
8330ef4
?
Feb 1, 2021
690b126
New attempt for proper temp file management
Feb 1, 2021
4088648
More temp file changes
Feb 1, 2021
2a969fd
Reference branch should work the same as the denovo branch... Denovo …
Feb 1, 2021
7e59ae0
Syntax
Feb 1, 2021
6c535ea
Fixed variable names not correct
Feb 1, 2021
2bfc9d0
Slightly better tmpdir structure for demultiplexing
Feb 1, 2021
93189da
removed SNP calling parameters from the legacy branch
Feb 2, 2021
6f59865
typo
Feb 3, 2021
644fe10
Changed rule all so you don't create the watson/crick files that you …
Feb 3, 2021
5d9f34c
typo
Feb 3, 2021
9891c57
Update fastqc-ref.rules
Feb 3, 2021
6c653de
Renamed rules to reference instead of denovo
Feb 3, 2021
c1e0416
New improved snp_calling?
Feb 4, 2021
848eb47
Update ProperSNPCallingReference.rules
Feb 4, 2021
179bec0
Update ProperSNPCallingReference.rules
Feb 4, 2021
19d0218
Update ProperSNPCallingReference.rules
Feb 4, 2021
5ccc435
Update ProperSNPCallingReference.rules
Feb 4, 2021
e5339eb
Update ProperSNPCallingReference.rules
Feb 4, 2021
c25db97
Update ProperSNPCallingReference.rules
Feb 4, 2021
6279ea1
Update ProperSNPCallingReference.rules
Feb 4, 2021
632dbf4
Update ProperSNPCallingReference.rules
Feb 4, 2021
586f5f9
Update ProperSNPCallingReference.rules
Feb 4, 2021
cf43ad6
Update ProperSNPCallingReference.rules
Feb 4, 2021
3f6d065
Update ProperSNPCallingReference.rules
Feb 4, 2021
e9a1dcd
Update ProperSNPCallingReference.rules
Feb 4, 2021
fe6b9f1
Update ProperSNPCallingReference.rules
Feb 4, 2021
081b516
Update ProperSNPCallingReference.rules
Feb 4, 2021
7697bf4
Update ProperSNPCallingReference.rules
Feb 4, 2021
a0f945c
Added improved SNP calling + moved multqc creation to before SNP call…
Feb 5, 2021
de3abcf
Synthax
Feb 5, 2021
0292690
Correct environment location
Feb 5, 2021
04174e2
Commas and removed un used script
Feb 5, 2021
7a328e8
Incorrect environment path
Feb 5, 2021
c220de4
Fixed dictionary not being created correctly when . are present in ge…
Feb 11, 2021
a9c482b
Update README.md
Feb 11, 2021
bac5d03
Update README.md
Feb 11, 2021
7b5ff6f
Minor update to when fastqc is started
Feb 15, 2021
04a2aec
Branch Test
Feb 22, 2021
d7fd705
Non-directional mapping
Feb 22, 2021
aa855ef
Attempt to get trimming to wrok
Feb 22, 2021
3dcb556
To many brackets
Feb 22, 2021
c022dfa
Removed part that I now skip
Feb 22, 2021
56931cd
Fixed synthax error
Feb 22, 2021
7680939
All rules get their own name
Feb 22, 2021
3a70088
Added adams python script
Feb 22, 2021
9adcd80
Create benchmarking.txt
Feb 23, 2021
e5c777b
Added better SNP calling to reference branch
Feb 23, 2021
e7fc391
Forgot comma
Feb 23, 2021
61a55fb
Out_dir != output_dir
Feb 23, 2021
8466f95
Snakemake synthax many files in few files out is complex
Feb 23, 2021
fcf8da4
Conda environment names
Feb 23, 2021
c1e20b6
Added new SNP calling to denovo branch
Feb 23, 2021
949197b
The denovo branch does not have a reference before you run it...
Feb 23, 2021
b79e215
Update denovo.rules
Feb 23, 2021
e4e2fc5
Update denovo.rules
Feb 23, 2021
61650e8
Better threads usage + config update + denovo creation parameters change
Feb 23, 2021
0ecb696
Changed calmd to run on the big file
Feb 24, 2021
a15a037
Update reference.rules
Feb 24, 2021
1e60d3c
Update reference.rules
Feb 24, 2021
faf0a6b
Fixed multiQC not working as intended
Feb 24, 2021
c7a32cb
Added multicoreness to samtool commands were ncessariy
Feb 24, 2021
981a666
Input != output
Feb 24, 2021
852ba32
Added index after calmd
Feb 24, 2021
6f88eb4
Fixed missing comma's
Feb 24, 2021
8759271
Added index + removed uncessary temp() input lines in trimming
Feb 24, 2021
e55d876
Learn to spel parralel
Feb 24, 2021
b82cfdf
Index prepared bam
Feb 24, 2021
a261e59
make tmp paths make sense
Feb 24, 2021
0b76ba8
Update reference.rules
Feb 24, 2021
42a2ab1
More paths
Feb 24, 2021
8908a72
Added RG properly in the bam files
Feb 24, 2021
0d4a640
No more cut adapt logs in the rule all
Feb 24, 2021
135e9eb
Typo
Feb 24, 2021
db2a858
location of mutliqc config file
Feb 24, 2021
777d8c9
Update fastqc-ref.rules
Feb 24, 2021
1d86c14
Give the correct things the correct name and not the incorrect things…
Feb 24, 2021
f68e1c9
No multicore for denovo branch
Feb 25, 2021
3f87662
Update to ensure samtools merge doesn't output 10.000 header lines
Feb 25, 2021
b4c4595
Denovo can ben multithread again!
Feb 25, 2021
90629bc
Now fastqc always works
Feb 25, 2021
c122c42
Added -Q parameter to ensure Qualities are masked
Feb 25, 2021
cb96106
Adding mapping quality to snp calling
Mar 2, 2021
4053e51
Update fastqc-ref.rules
Mar 2, 2021
61f275c
Update fastqc-ref.rules
Mar 2, 2021
c96afc6
(hopefully) changed module order in multiQC
Mar 3, 2021
2cd7857
added "running multiple lanes"
Mar 4, 2021
1d16ba4
Update reference.rules
Mar 4, 2021
1ceb3cd
updated Read me
Mar 4, 2021
113b854
Merge branch 'SneakyImplementationOfAdamsSNPcaller' of https://gitlab…
Mar 4, 2021
02aaaad
Correct qualities in SNP calling
Mar 4, 2021
d2fb1fd
added --no-population-priors
Mar 4, 2021
21f276d
Update reference.rules
Mar 10, 2021
ec23c5a
Update fastqc-ref.rules
Mar 10, 2021
fd012c6
Update denovo.rules
Mar 16, 2021
b90af7d
Added trimmed fastqc?
Mar 18, 2021
a0ef617
Typo
Mar 18, 2021
85fda3a
Cleaning
Mar 22, 2021
b919378
typo
Mar 22, 2021
d4bc94b
Typo's
Mar 22, 2021
3e7cf95
Update README.md
Mar 22, 2021
b6bd9c3
Added parameters of SNP calling to rules files
Mar 22, 2021
df33ca7
Merge branch 'SneakyImplementationOfAdamsSNPcaller' of https://gitlab…
Mar 22, 2021
b9eb577
add exception in parsing script to allow reference branch.
Mar 24, 2021
70a7f6b
Remove the old report which tends to break a lot....
Mar 30, 2021
ba047eb
Merge branch 'SneakyImplementationOfAdamsSNPcaller' of https://gitlab…
Mar 30, 2021
13f94ef
added untrimmed to the multiqc exclusion list
Mar 30, 2021
99ae067
Added parameterTest Things
Apr 12, 2021
3928c7c
Update paramTest.rules
Apr 12, 2021
7067c28
Update paramTest.rules
Apr 12, 2021
cf9c1d7
Update paramTest.rules
Apr 12, 2021
41e52b7
Update paramTest.rules
Apr 12, 2021
94055c4
Update paramTest.rules
Apr 12, 2021
0a02e7b
Update paramTest.rules
Apr 12, 2021
5cae095
Update paramTest.rules
Apr 12, 2021
f763b28
Update paramTest.rules
Apr 12, 2021
fd0d11a
Update paramTest.rules
Apr 12, 2021
1bb5dbc
Update paramTest.rules
Apr 12, 2021
367cf9c
Update paramTest.rules
Apr 12, 2021
ae6b521
Update Snakefile
Apr 12, 2021
7fe1f06
Update Snakefile
Apr 12, 2021
c9c879a
Update Snakefile
Apr 12, 2021
1990473
Update Snakefile
Apr 12, 2021
1c59afb
Update Snakefile
Apr 12, 2021
20109d5
Update paramTest.rules
Apr 12, 2021
9e7ae56
Jaaaaaaaaaa hooooooooor
Apr 12, 2021
b06e7da
Update paramTest.rules
Apr 12, 2021
4e03b75
Update paramTest.rules
Apr 12, 2021
3e28153
Update paramTest.rules
Apr 12, 2021
98d4a72
Fixed temp file issues + clustering ID incorrect values
Apr 13, 2021
2023da2
TAB seperated file
Apr 13, 2021
bc6353e
Update paramTest.rules
Apr 13, 2021
52439f4
Update paramTest.rules
Apr 13, 2021
572828a
Update paramTest.rules
Apr 13, 2021
e54ddcf
Update paramTest.rules
Apr 13, 2021
41639ac
Update paramTest.rules
Apr 13, 2021
0cae001
Update paramTest.rules
Apr 13, 2021
b0c14e7
Update paramTest.rules
Apr 13, 2021
edbc18d
Update paramTest.rules
Apr 13, 2021
4a8a5fa
Update paramTest.rules
Apr 13, 2021
14359a7
This works
Apr 13, 2021
9d14e8c
test
Apr 13, 2021
47d3233
Update paramTest.rules
Apr 13, 2021
87ad0dc
Fixed the issue!!
Apr 13, 2021
b0e5474
???!!!
Apr 13, 2021
177eedc
Fixed the issues with the tmp files
Apr 14, 2021
9ba96b5
Update paramTest.rules
Apr 14, 2021
30c8634
Update paramTest.rules
Apr 14, 2021
f6d8256
Update paramTest.rules
Apr 14, 2021
547fdd1
Update paramTest.rules
Apr 14, 2021
20d49a4
Update paramTest.rules
Apr 14, 2021
a2ad439
Update paramTest.rules
Apr 14, 2021
4451680
Update paramTest.rules
Apr 14, 2021
6a709d6
Update paramTest.rules
Apr 14, 2021
e80e587
Update paramTest.rules
Apr 14, 2021
d88d006
Update paramTest.rules
Apr 14, 2021
6a4534c
Update paramTest.rules
Apr 14, 2021
75f5951
Update paramTest.rules
Apr 14, 2021
27efc56
Update paramTest.rules
Apr 14, 2021
8cb61ee
Update paramTest.rules
Apr 14, 2021
ea739a9
Update paramTest.rules
Apr 14, 2021
29669c5
Update paramTest.rules
Apr 14, 2021
7aeb59e
Update paramTest.rules
Apr 14, 2021
bc47ca9
Update paramTest.rules
Apr 14, 2021
b7c31b4
New idea
Apr 14, 2021
b5e33a6
hahahaha
Apr 14, 2021
1cf7999
Update paramTest.rules
Apr 14, 2021
bbf6bb6
added stacks logs to multiqc in reference branch
Apr 19, 2021
00d0552
added mulitqc stacks support
Apr 19, 2021
b5a0961
Update README.md
Apr 19, 2021
786bb24
Making datasets for plots
Apr 20, 2021
6d8d8c6
Update paramTest.rules
Apr 20, 2021
09e6e07
Update paramTest.rules
Apr 20, 2021
242263e
Update paramTest.rules
Apr 20, 2021
1603e0a
Update paramTest.rules
Apr 20, 2021
b380491
Update paramTest.rules
Apr 20, 2021
f825bdb
Update paramTest.rules
Apr 20, 2021
4d2410c
Update paramTest.rules
Apr 20, 2021
96246e3
Update paramTest.rules
Apr 20, 2021
cce8e7b
Update paramTest.rules
Apr 20, 2021
6615f32
Added plot script, cleaned up the file structure
Apr 20, 2021
c9e76ef
Update paramTestPlot.R
Apr 20, 2021
e3ae928
Update paramTestPlot.R
Apr 20, 2021
8ff36aa
Merge branch 'SneakyImplementationOfAdamsSNPcaller' of https://gitlab…
Apr 20, 2021
ac86b20
Typo
Apr 20, 2021
b2e77fa
Typo
Apr 22, 2021
62d2910
Update multiqc.yaml
Apr 22, 2021
83e8cee
Stop annoying warning message from appearing
Apr 22, 2021
35a2f16
Add .gitlab-ci.yml to test quality of the code
Apr 30, 2021
5701697
Merge branch 'SneakyImplementationOfAdamsSNPcaller' into 'master'
Apr 30, 2021
22d2c8b
Update README.md
May 5, 2021
7b0fa58
Merge branch 'master' into 'master'
May 5, 2021
6e0c2f9
Update reference.rules
Jun 9, 2021
feafc2e
Removed incorrect information in comment
Jun 11, 2021
04f143c
Removed incorrect information from comment
Jun 11, 2021
d86b837
Update bismark.yaml so it does work on the NIOO server! (And probably…
Jun 16, 2021
915e1cb
Better default parameters for parameter_test
Jul 20, 2021
7ece398
Fixed bugs in paramTest.rules that caused the removal of temp files w…
Jul 20, 2021
413e8f2
Added removal of "bismark" files to ensure only bedgraphs are output
MaartenPostuma Oct 12, 2021
869875a
There was not as much output as I expected
MaartenPostuma Oct 12, 2021
ba66817
Improved performance of clone_filter by using a newer version of stacks
MaartenPostuma Nov 3, 2021
ffa7385
Added polyG problem and clone_filter memory issues to the error secti…
MaartenPostuma Jan 19, 2022
80b2b96
Update README.md
MaartenPostuma Jan 20, 2022
3948a2a
Final touches to readme
MaartenPostuma Jan 20, 2022
9ccb4b1
added some minor changes
FleurGaBru Jan 25, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions .gitlab-ci.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# This file is a template, and might need editing before it works on your project.
include:
template: Jobs/Code-Quality.gitlab-ci.yml
261 changes: 139 additions & 122 deletions README.md

Large diffs are not rendered by default.

90 changes: 74 additions & 16 deletions Snakefile
Original file line number Diff line number Diff line change
@@ -1,47 +1,105 @@

configfile: "config.yaml"
from snakemake.utils import Paramspace
import pandas as pd
import os
import random
df = pd.read_csv(os.path.join(config["input_dir"],config["barcodes"]), sep='\t', dtype="object").set_index('Sample')
SAMPLES = df.index
SAMPLE = SAMPLES[0]
flowCell = df.Flowcell[0]
lane = df.Lane[0]
projectName=random.randint(1,1000000) #To ensure non overlapping tmp directories


paramspace = Paramspace(pd.read_csv("src/parameter_test/paramTest.tsv", sep="\t"))



if config["mode"] == "reference":
include: "src/rules/fastqc-ref.rules"
include: "src/rules/with_reference.rules"
include: "src/rules/reference.rules"
include: "src/rules/demultiplex.rules"
include: "src/rules/report-ref.rules"
include: "src/rules/trimming.rules"


if config["mode"] == "denovo":
include: "src/rules/denovo_reference.rules"
include: "src/rules/demultiplex.rules"
include: "src/rules/denovo.rules"
include: "src/rules/trimming.rules"
include: "src/rules/fastqc.rules"
include: "src/rules/report.rules"

if config["mode"] == "paramTest":
include: "src/rules/demultiplex.rules"
include: "src/rules/paramTest.rules"
include: "src/rules/trimming.rules"


if config["mode"]== "reference":
rule all:
input: expand("{out}/output_demultiplex/clone-stacks/{sample}-Watson.1.fq.gz \
{out}/output_demultiplex/clone-stacks/{sample}-Watson.2.fq.gz \
{out}/output_demultiplex/clone-stacks/{sample}-Crick.1.fq.gz \
{out}/output_demultiplex/clone-stacks/{sample}-Crick.2.fq.gz \
{out}/fastqc/ \
{out}/multiQC_report.html \
{out}/cutadapt/{sample}_trimmed_filt_merged.1.fq.gz \
{out}/cutadapt/{sample}_trimmed_filt_merged.2.fq.gz \
{out}/alignment/{sample}_trimmed_filt_merged.1_bismark_bt2_pe.bam \
{out}/methylation_calling/{sample}_bismark_bt2_pe.CX_report.txt.gz \
{out}/snp_calling/snp.vcf.gz".split(),out=config["output_dir"],sample=SAMPLES)

if config["mode"]== "denovo":
rule all:
input: expand("{out}/output_demultiplex/Watson_R2.fq.gz \
{out}/output_demultiplex/Watson_R1.fq.gz \
{out}/output_demultiplex/Crick_R2.fq.gz \
{out}/output_demultiplex/Crick_R1.fq.gz \
{out}/multiQC_report.html \
{out}/fastqc/ \
{out}/trimmed/Watson_R1_trim_fastqc.html \
{out}/trimmed/Crick_R2_trim_fastqc.html \
{out}/report.html \
{out}/mapping/watson.bam \
{out}/mapping/crick.bam \
{out}/mapping/methylation.bed \
{out}/mapping/snp.vcf.gz".split(), out=config["output_dir"])
{out}/multiQC_report.html \
{out}/output_denovo/consensus_cluster.renamed.fa \
{out}/cutadapt/{sample}_trimmed_filt_merged.1.fq.gz \
{out}/cutadapt/{sample}_trimmed_filt_merged.1.fq.gz \
{out}/alignment/{sample}_trimmed_filt_merged.1_bismark_bt2_pe.bam \
{out}/methylation_calling/{sample}_bismark_bt2_pe.CX_report.txt.gz \
{out}/snp_calling/snp.vcf.gz".split(),out=config["output_dir"],sample=SAMPLES)

if config["mode"]== "paramTest":
rule all:
input: expand("{out}/output_demultiplex/clone-stacks/{samples}-Watson.1.fq.gz \
{out}/output_demultiplex/clone-stacks/{samples}-Watson.2.fq.gz \
{out}/output_demultiplex/clone-stacks/{samples}-Crick.1.fq.gz \
{out}/output_demultiplex/clone-stacks/{samples}-Crick.2.fq.gz \
{out}/output_demultiplex/Watson_R2.fq.gz \
{out}/output_demultiplex/Watson_R1.fq.gz \
{out}/output_demultiplex/Crick_R2.fq.gz \
{out}/output_demultiplex/Crick_R1.fq.gz \
{out}/paramTest/{params}/alignment/{sample}_trimmed_filt_merged.1_bismark_bt2_pe.bam \
{out}/paramTest/{params}/output_denovo/consensus_cluster.renamed.fa \
{out}/paramTest/Assembled.txt \
{out}/paramTest/averageDepth.txt \
{out}/paramTest/denovoParameter.tsv \
{out}/paramTest/denovoParameter.tiff \
{out}/cutadapt/{sample}_trimmed_filt_merged.1.fq.gz \
{out}/cutadapt/{sample}_trimmed_filt_merged.2.fq.gz".split(),out=config["output_dir"],params=paramspace.instance_patterns,sample=SAMPLE,samples=SAMPLES)


if config["mode"]== "denovo":
if config["mode"] == "legacy":
include: "src/rules/legacy.rules"
include: "src/rules/demultiplex.rules"
include: "src/rules/report.rules"


if config["mode"]== "legacy":
rule all:
input: expand("{out}/output_demultiplex/Watson_R2.fq.gz \
{out}/output_demultiplex/Watson_R1.fq.gz \
{out}/output_demultiplex/Crick_R2.fq.gz \
{out}/output_demultiplex/Crick_R1.fq.gz \
{out}/fastqc/ \
{out}/multiQC_report.html \
{out}/report.html \
{out}/mapping/watson.bam \
{out}/mapping/crick.bam \
{out}/output_denovo/consensus_cluster.renamed.fa \
{out}/mapping/methylation.bed \
{out}/mapping/snp.vcf.gz".split(), out=config["output_dir"])


21 changes: 4 additions & 17 deletions config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -17,10 +17,7 @@ barcodes: "barcodes.tsv"
# the pipeline produces some temporary files. Please indicate the tmp location on your server (in most cases /tmp)
tmpdir : "/tmp"

# some of the steps can be run in parallel. Please set the number of available computing threads on your system
threads: "12"

# mode of running pipeline (set denovo or reference)
# mode of running pipeline (set denovo, reference or legacy. PLEASE NOTE: legacy is not supported)
mode: "reference"

# genome directory (leaave it blank in denovo mode)
Expand All @@ -32,20 +29,10 @@ genome: "consensus_cluster.renamed.fa"
# advanced users have the possibility to change different parameter, leave them blank or write "default" to run them in default mode

# parameters in the denovo reference creation:
# identity: percentage of sequence identity in the last clustering step, in decimal number e.g. for 90% identity write 0.90, default 0.95
# min-depth: minimal cluster depth in the first clustering step to include a cluster, default 0
# max-depth: maximal cluster depth in the first clustering step to include a cluster, default 0
# identity: percentage of sequence identity in the last clustering step, in decimal number e.g. for 95% identity write 0.95, default 0.97
# min-depth: minimal cluster depth in the first clustering step to include a cluster, default 10
# max-depth: maximal cluster depth in the first clustering step to include a cluster, default 10000
param_denovo:
identity: ""
min-depth: ""
max-depth: ""

# parameters in the mpileup step (variant callin)
# max-depth: At a position, read maximally INT reads per input file, default 10000000, check if mpileup automatically reduced the maximal depth even more. You might have to re-run your analysis with a lower max-depth value to avoid this.
# min-MQ: Minimum mapping quality for an alignment to be used, default 0
# min-BQ: Minimum base quality for a base to be considered, default 15

param_SNPcalling:
max-depth: ""
min-MQ: ""
min-BQ: ""
1 change: 0 additions & 1 deletion data/README.md

This file was deleted.

2 changes: 1 addition & 1 deletion src/demultiplex/barcode_stacks.py
Original file line number Diff line number Diff line change
Expand Up @@ -118,7 +118,7 @@ def run_subprocess(cmd, args, log_message):
cmd = "process_radtags -1 %s -2 %s " % (read1,
read2)
cmd += "-b %s -o %s " % (os.path.join(args.outputdir, "barcode_stacks.tsv"), os.path.join(args.outputdir, "clone-stacks"))
cmd += "-r -D --inline_inline --barcode_dist_2 0 "
cmd += "-r -D --inline_inline --barcode_dist_1 0 "
cmd += "--renz_1 %s --renz_2 %s --retain_header --barcode_dist_2 0" % (line2split[bc_dict["ENZ_R1"][0]],
line2split[bc_dict["ENZ_R2"][0]])
log = "Run process_radtags from stacks"
Expand Down
29 changes: 29 additions & 0 deletions src/env/SNP-calling.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
name: SNP-calling
channels:
- conda-forge
- bioconda
- defaults
- r
dependencies:
- _libgcc_mutex=0.1=conda_forge
- _openmp_mutex=4.5=1_gnu
- ca-certificates=2020.12.5=ha878542_0
- certifi=2020.12.5=py36h5fab9bb_1
- libgcc-ng=9.3.0=h2828fa1_18
- libgomp=9.3.0=h2828fa1_18
- ncurses=5.9=10
- openssl=1.0.2u=h516909a_0
- pigz=2.3.4=hed695b0_1
- pip=20.3.3=pyhd8ed1ab_0
- python=3.6.3=4
- python_abi=3.6=1_cp36m
- readline=7.0=0
- samtools=1.3.1=0
- setuptools=49.6.0=py36h5fab9bb_3
- sqlite=3.20.1=2
- tabix=0.2.6=ha92aebf_0
- tk=8.6.10=h21135ba_1
- wheel=0.36.2=pyhd3deb0d_0
- xz=5.2.5=h516909a_1
- zlib=1.2.11=h516909a_1010
prefix: /home/NIOO.INT/fleurg/.conda/envs/SNP-calling
126 changes: 126 additions & 0 deletions src/env/SNP_calling_epiDiverse.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,126 @@
name: snps
channels:
- conda-forge
- bioconda
- defaults
dependencies:
- _libgcc_mutex=0.1=conda_forge
- _openmp_mutex=4.5=1_gnu
- _r-mutex=1.0.1=anacondar_1
- bcftools=1.10.2=h4f4756c_3
- binutils_impl_linux-64=2.34=h2122c62_9
- binutils_linux-64=2.34=h47ac705_27
- bwidget=1.9.14=0
- bz2file=0.98=py_0
- bzip2=1.0.8=h516909a_3
- c-ares=1.16.1=h516909a_3
- ca-certificates=2020.6.20=hecda079_0
- cairo=1.16.0=h3fc0475_1005
- certifi=2020.6.20=py38h32f6830_0
- curl=7.71.1=he644dc0_5
- cycler=0.10.0=py_2
- expat=2.2.9=he1b5a44_2
- fontconfig=2.13.1=h1056068_1002
- freebayes=1.3.2=py38h40864fe_2
- freetype=2.10.2=he06d7ca_0
- fribidi=1.0.10=h516909a_0
- gcc_impl_linux-64=7.5.0=hda68d29_13
- gcc_linux-64=7.5.0=hf34d7eb_27
- gettext=0.19.8.1=hc5be6a0_1002
- gfortran_impl_linux-64=7.5.0=h1104b78_16
- gfortran_linux-64=7.5.0=ha781d05_27
- glib=2.66.0=h0dae87d_0
- graphite2=1.3.13=he1b5a44_1001
- graphviz=2.42.3=h0511662_0
- gsl=2.6=h294904e_0
- gxx_impl_linux-64=7.5.0=h64c220c_13
- gxx_linux-64=7.5.0=ha781d05_27
- harfbuzz=2.7.2=hee91db6_0
- htslib=1.10.2=hd3b49d5_1
- icu=67.1=he1b5a44_0
- jpeg=9d=h516909a_0
- kernel-headers_linux-64=2.6.32=h77966d4_13
- khmer=3.0.0a3=py38hfcc51a0_1
- kiwisolver=1.2.0=py38hbf85e49_0
- krb5=1.17.1=hfafb76e_3
- kwip=0.2.0=h8b12597_3
- lcms2=2.11=hbd6801e_0
- ld_impl_linux-64=2.34=hc38a660_9
- libblas=3.8.0=17_openblas
- libcblas=3.8.0=17_openblas
- libcurl=7.71.1=hcdd3856_5
- libdeflate=1.6=h516909a_0
- libedit=3.1.20191231=he28a2e2_2
- libev=4.33=h516909a_1
- libffi=3.2.1=he1b5a44_1007
- libgcc-ng=9.3.0=h24d8f2e_16
- libgfortran-ng=7.5.0=hdf63c60_16
- libgomp=9.3.0=h24d8f2e_16
- libiconv=1.16=h516909a_0
- liblapack=3.8.0=17_openblas
- libnghttp2=1.41.0=h8cfc5f6_2
- libopenblas=0.3.10=pthreads_hb3c22a3_4
- libpng=1.6.37=hed695b0_2
- libssh2=1.9.0=hab1572f_5
- libstdcxx-ng=9.3.0=hdf63c60_16
- libtiff=4.1.0=hc7e4089_6
- libtool=2.4.6=h516909a_1005
- libuuid=2.32.1=h14c3975_1000
- libwebp-base=1.1.0=h516909a_3
- libxcb=1.13=h14c3975_1002
- libxml2=2.9.10=h68273f3_2
- lz4-c=1.9.2=he1b5a44_3
- make=4.3=h516909a_0
- matplotlib=3.3.1=1
- matplotlib-base=3.3.1=py38h91b0d89_1
- ncurses=6.2=he1b5a44_1
- numpy=1.19.1=py38hbc27379_2
- olefile=0.46=py_0
- openssl=1.1.1g=h516909a_1
- pango=1.42.4=h7062337_4
- parallel=20200722=0
- pcre=8.44=he1b5a44_0
- pcre2=10.35=h2f06484_0
- perl=5.30.3=h516909a_1
- pillow=7.2.0=py38h9776b28_1
- pip=20.2.3=py_0
- pixman=0.38.0=h516909a_1003
- pthread-stubs=0.4=h14c3975_1001
- pyparsing=2.4.7=pyh9f0ad1d_0
- pysam=0.16.0.1=py38hbdc2ae9_1
- python=3.8.5=h1103e12_7_cpython
- python-dateutil=2.8.1=py_0
- python_abi=3.8=1_cp38
- r-base=4.0.2=he766273_1
- r-dotcall64=1.0_0=r40h9bbef5b_1005
- r-fields=11.4=r40h9bbef5b_0
- r-maps=3.3.0=r40hcdcec82_1004
- r-spam=2.5_1=r40hbf399a0_2
- readline=8.0=he28a2e2_2
- samtools=1.10=h2e538c0_3
- screed=1.0.4=pyh9f0ad1d_0
- sed=4.8=hbfbb72e_0
- setuptools=49.6.0=py38h32f6830_0
- six=1.15.0=pyh9f0ad1d_0
- sqlite=3.33.0=h4cf870e_0
- sysroot_linux-64=2.12=h77966d4_13
- tk=8.6.10=hed695b0_0
- tktable=2.10=h555a92e_3
- tornado=6.0.4=py38h1e0a361_1
- wheel=0.35.1=pyh9f0ad1d_0
- xorg-kbproto=1.0.7=h14c3975_1002
- xorg-libice=1.0.10=h516909a_0
- xorg-libsm=1.2.3=h84519dc_1000
- xorg-libx11=1.6.12=h516909a_0
- xorg-libxau=1.0.9=h14c3975_0
- xorg-libxdmcp=1.1.3=h516909a_0
- xorg-libxext=1.3.4=h516909a_0
- xorg-libxpm=3.5.13=h516909a_0
- xorg-libxrender=0.9.10=h516909a_1002
- xorg-libxt=1.1.5=h516909a_1003
- xorg-renderproto=0.11.1=h14c3975_1002
- xorg-xextproto=7.3.0=h14c3975_1002
- xorg-xproto=7.0.31=h14c3975_1007
- xz=5.2.5=h516909a_1
- zlib=1.2.11=h516909a_1009
- zstd=1.4.5=h6597ccf_2
37 changes: 37 additions & 0 deletions src/env/bcftools.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
name: bcftools
channels:
- conda-forge
- bioconda
- r
- defaults
dependencies:
- _libgcc_mutex=0.1=conda_forge
- _openmp_mutex=4.5=1_gnu
- bcftools=1.11=h7c999a4_0
- bzip2=1.0.8=h7f98852_4
- c-ares=1.11.0=h470a237_1
- ca-certificates=2020.12.5=ha878542_0
- gsl=2.6=he838d99_2
- htslib=1.11=hd3b49d5_2
- krb5=1.17.2=h926e7f8_0
- libblas=3.9.0=7_openblas
- libcblas=3.9.0=7_openblas
- libcurl=7.71.1=hcdd3856_8
- libdeflate=1.7=h7f98852_5
- libedit=3.1.20191231=he28a2e2_2
- libev=4.33=h516909a_1
- libgcc-ng=9.3.0=h2828fa1_18
- libgfortran-ng=9.3.0=hff62375_18
- libgfortran5=9.3.0=hff62375_18
- libgomp=9.3.0=h2828fa1_18
- libnghttp2=1.41.0=hab1572f_1
- libopenblas=0.3.12=pthreads_h4812303_1
- libssh2=1.9.0=hab1572f_5
- libstdcxx-ng=9.3.0=h6de172a_18
- ncurses=6.2=h58526e2_4
- openssl=1.1.1i=h7f98852_0
- perl=5.32.0=h36c2ea0_0
- tk=8.6.10=h21135ba_1
- xz=5.2.5=h516909a_1
- zlib=1.2.11=h516909a_1010
prefix: /home/NIOO.INT/maartenp/.conda/envs/bcftools
Loading