Skip to content

Latest commit

 

History

History
 
 

1_download_metadata

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Download RNAseq metadata

This folder contains scripts to download and format all RNA-seq metadata for an organism from NCBI Sequence Read Archive. To simplify the process, we have created a docker container with all pre-requisite software.

Example usage

Docker

The following code finds all RNA-seq data for Bacillus subtilis and saves the data to the file Bacillus_subtilis.tsv. Note that the species name must be enclosed in quotes.

docker run --rm -it avsastry/get-all-rnaseq:latest "Bacillus subtilis" > Bacillus_subtilis.tsv

Local Installation

Local installation requires Entrez direct and Python 3.6 or higher with Pandas. These are included in the conda environment file in the home directory.

./download_metadata.sh "Bacillus subtilis" > Bacillus_subtilis.tsv