This repository contains all scripts used to generate the results for "Nepali oral microbiomes reflect a gradient of lifestyles from traditional to industrialized." All analyses start in the same base directory. All analyses are conducted in R, unless otherwise specified.
Note: all scripts assume that the working directory is set to a directory called Nepal_oral_microbiome\
. Within Nepal_oral_microbiome\
, there are sub-directories called data\
, output\
, and figures\
. Within data\
, there are directories called fwd\
and filt_path\
. Data is stored in the fwd\
directory, while filt_path\
is empty. Note that, with the exception of the metadata table oralmicrobiome_surveydata.csv
, data files are not included in the repo due to size but are included as supplemental tables in the publication or on SRA under BioProject PRJNA1098228.
Nepal_oral_microbiome
|- README # Description of analysis scripts
|
|- data/ # Any data put into analyses - may be raw or processed (note: not version controlled currently due to size)
| |- fwd/
| |- filt_path/
|
|- output/ # Will contain output from scripts after they are run
|
|- figures/ # Will contain figures generated from scripts after they are run
+
All package info is in the R script package_info.R
oral_phyloseq.Rmd
- clean 16S rRNA gene amplicon sequence data and generate phyloseq objectdecontam.Rmd
- remove potential contaminantsqc.Rmd
- additional QC and cleaning of the phyloseq objectextraction_comparison.Rmd
- compare Qiagen and PowerSoil extraction kit datamicrobiome_characterization.Rmd
- examine metrics for standard microbiome characteristics (alpha and beta diversity)random_forest.Rmd
- use Random Forests to predict lifestyle based on lifestyle survey and microbiome datadifferential_abundance.Rmd
- perform differential abundance analysis with ALDEx2 to identify taxa that differ based on lifestylemicrobiome_trend.Rmd
- perform trend test on all genera to see which microbial abundances follow the lifestyle trendCCA.Rmd
- conduct CCA to identify which specific lifestyle factors correlate with microbiome compositiontaxa_lifestyle.Rmd
- identify significant associations between specific lifestyle factors and DA microbes identified from the trend test.picrust2_prep.Rmd
- prepping data for PICRUSt2picrust_stratified.sh
(shell) - run stratified version of PICRUSt2 to predict pathway abundances.picrust_analysis.Rmd
- analyze PICRUSt2 output. All PICRUSt2 output frompicrust_stratified.sh
is assumed to be stored in its own directorypicrust2_qiagen_output\
network_analysis.Rmd
- conduct network analysis of the microbiome using SparCCgut_oral_comparison.Rmd
- examine the relationship between the oral and gut microbiomes