This repository contains the code for performing the analyses in:
- R (version 3.6.3 was used in our study)
- R packages
- ImNotaGit/my.utils
- ruppinlab/Rcplex2: required to run genome-scale metabolic modeling (GEM); otherwise may be omitted
- ruppinlab/gembox
- other needed packages can be obtained from CRAN or Bioconductor, please see the R scripts
- IBM ILOG CPLEX Optimization Studio or Gurobi: required to run GEM; otherwise may be omitted (CPLEX 12.10 was used in our study)
The required data can be downloaded by running the dnload.sh
bash script in the data
folder. dnload.sh
uses wget
and gdown
(the latter used for downloading files from Google Drive and can be installed with pip install gdown
). Data files containing results from the various analyses are also included in the download.
Alternatively, one can manually download the data files from this Google Drive link then decompress them into the data
folder. The two single-cell RNA-sequencing datasets from Liao et al. and Chua et al. are quite large and not included in this download. Check dnload.sh
for details on how these two datasets should be downloaded.
The scripts should be run in the same order as they are introduced below. Each script should be run from the directory it is located in as the working directory.
All required data should be saved in this folder.
dnload.sh
: a bash script for downloading the required datacollect.validation.data.R
: prepare the data for validating the MTA prediction
Scripts for gene expression-level analysis:
de.and.gsea.R
: differential expression (DE) and gene set enrichment analysis (GSEA) between the SARS-CoV-2-infected samples and non-infected controls in each datasetde.and.gsea.new.R
: DE and GSEA between the SARS-CoV-2-infected samples and non-infected controls in each dataset -- modified such that DESeq2 was consistently used for most of the bulk RNA-seq datasetsde.and.gsea.remdesivir.R
: DE and GSEA analysis for the Vero E6 cell remdesivir treatment data
Scripts for genome-scale metabolic modeling (GEM) analysis with Recon 1:
prepare.data.R
: prepare data for GEMimat.and.mta.R
: run iMAT and rMTA on each datasetdflux.R
: differential flux analysis between the SARS-CoV-2-infected samples and non-infected controls in each datasetcollect.results.R
: collect the differential flux analysis and rMTA results across datasets
Scripts for GEM analysis with Recon 3D:
prepare.data.R
: prepare data for GEMimat.and.mta.R
: run iMAT and rMTA on each datasetdflux.R
: differential flux analysis between the SARS-CoV-2-infected samples and non-infected controls in each datasetcollect.results.R
: collect the differential flux analysis and rMTA results across datasets
Scripts for GEM analysis on the Vero E6 cell remdesivir treatment data with Recon 1:
prepare.data.R
: prepare data for GEMimat.R
: run iMAT for each experimental groupmta.R
: run rMTA for pairs of experimental groupsdflux.R
: differential flux analysis between pairs of experimental groupscollect.results.R
: collect the differential flux analysis results
Scripts for GEM analysis on the Vero E6 cell remdesivir treatment data with Recon 3D:
prepare.data.R
: prepare data for GEMimat.R
: run iMAT for each experimental groupmta.R
: run rMTA for pairs of experimental groupsdflux.R
: differential flux analysis between pairs of experimental groupscollect.results.R
: collect the differential flux analysis results
Scripts and R notebooks for inspecting the results from various analyses and for validating the rMTA predictions:
functions.R
: various functions, will be sourced in check.mta.results.R and check.mta.results.remdesivir.Rcheck.mta.results.R
: various validations and pathway enrichment analysis of rMTA resultscheck.mta.results.remdesivir.R
: various validations and pathway enrichment analysis of rMTA results for predicting combinatory target with remdesivir- Various
.Rmd
files: R notebook for inspecting and visualizing results from different analyses, see their contents for details