Snakemake workflow: smallRNA Pipeline

Workflow for the analysis of small RNA-seq data for Yan et. al, "An endogenous small RNA-binding protein safeguards prime editing" (in press).

The workflow is written using Snakemake and Quarto.

Dependencies are installed using Bioconda where possible.

The workflow consists of two pieces, one written in Snakemake, the other is composed of Quarto notebooks.

Snakemake workflow

Setup environment and run workflow

Clone workflow into working directory
```
git clone <repository> <dir>
cd <dir>
```
Download input data (or skip and use demo-data)

Copy the fastq files into data directory

Edit the configuration as needed (not needed if using demo-data)

# Edit location of fastq files
nano config/units.yaml
# Generally, these can remain unchanged 
nano config/samples.yaml
nano config/config.yaml

Install dependencies into isolated environment

conda env create -n <project> --file environment.yaml

Activate environment
```
source activate <project>
```
Execute main workflow (using cluster options is recommended)
```
snakemake --cores 1
```

Quarto notebooks

The Quarto notebooks utilize R and are run separately.

Run the workflow as above
Load the Rproject pe-small-rna-seq-analysis.Rproj in RStudio.
This project uses renv to keep track of installed packages. Install renv if not installed and load dependencies with renv::restore().
Load one of the quarto notebooks below and notebook and run all of the cells or use the "Render" button in RStuido.
- biotype-comparison.qmd
- fragment-size-distributions.qmd
- alignment_statistics.qmd
- coverage-plots.qmd
- three-prime-quantification.qmd
Some of the notebooks use parameters to generate a few different versions of the plots. If Quarto and all of the required R packages are installed, you can use the render_quarto_reports.sh script to render all of the quarto notebooks.

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
.github/workflows		.github/workflows
.test		.test
config		config
data		data
demo-data/fastq		demo-data/fastq
renv		renv
workflow		workflow
.Rprofile		.Rprofile
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.lintr		.lintr
LICENSE		LICENSE
README.md		README.md
_dependencies.R		_dependencies.R
_exogenous-alignments.qmd		_exogenous-alignments.qmd
_human-small-rna-counts.qmd		_human-small-rna-counts.qmd
_plot-aesthetics.qmd		_plot-aesthetics.qmd
_sample-metadata.qmd		_sample-metadata.qmd
alignment_statistics.qmd		alignment_statistics.qmd
biotype-comparison.qmd		biotype-comparison.qmd
coverage-plots.qmd		coverage-plots.qmd
differential-expression.qmd		differential-expression.qmd
environment.yml		environment.yml
exogenous-rna-profiles.html		exogenous-rna-profiles.html
exogenous-rna-profiles.qmd		exogenous-rna-profiles.qmd
fragment-size-distributions.qmd		fragment-size-distributions.qmd
ncrna_intersections.R		ncrna_intersections.R
ncrna_intersections.html		ncrna_intersections.html
ncrna_intersections.qmd		ncrna_intersections.qmd
pe-small-rna-seq-analysis.Rproj		pe-small-rna-seq-analysis.Rproj
pegrna_alignment_strandedness.R		pegrna_alignment_strandedness.R
pegrna_plots.R		pegrna_plots.R
project_description.yaml		project_description.yaml
render_quarto_reports.sh		render_quarto_reports.sh
renv.lock		renv.lock
riboswitch-reads.Rmd		riboswitch-reads.Rmd
riboswitch-reads.nb.html		riboswitch-reads.nb.html
rnatypes.Rmd		rnatypes.Rmd
rnatypes.nb.html		rnatypes.nb.html
run_snakemake_argo.sh		run_snakemake_argo.sh
run_snakemake_cetus.sh		run_snakemake_cetus.sh
three-prime-quantification.qmd		three-prime-quantification.qmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Snakemake workflow: smallRNA Pipeline

Snakemake workflow

Setup environment and run workflow

Quarto notebooks

About

Releases 3

Languages

License

Princeton-LSI-ResearchComputing/PE-small-RNA-seq-analysis

Folders and files

Latest commit

History

Repository files navigation

Snakemake workflow: smallRNA Pipeline

Snakemake workflow

Setup environment and run workflow

Quarto notebooks

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 3

Languages