-
Notifications
You must be signed in to change notification settings - Fork 3
Running sort results on your data
Once you have run hominid
and hominid_stability_selection
, you've
found the SNPs that are significantly correlated with the microbiome
abundances, and the taxa (or other covariates) that
are associated with those SNPs.
To make graphing of those associated taxa's abundances easier
(e.g., to make a boxplot of the transformed abundances
for the three different SNP alleles, like figure 4 from the HOMINID paper)
use hominid_sort_results
to pull out the (transformed) abundances
of those taxa.
hominid_sort_results
runs on a single processor.
Command-line arguments 1 through 6 are all required and are expected in this order.
-
the output file from
hominid_stability_selection
-
the input OTU/taxon table that was used as input to
hominid
andhominid_stability_selection
-
transformation of the input abundance data. Use the same value as in
hominid
andhominid_stability_selection
-
cutoff for R2:
hominid_sort_results
will print out results for SNPs whose R2 is greater than the cutoff. -
cutoff for the stability score.
hominid_sort_results
prints out results for the taxa whose stability score are greater than or equal to the cutoff. -
SNP count: print out results for this many SNPs.
-
optionally print out extra columns from the
hominid_stability_selection
output. If you want to retain the SNP annotation columns in the output ofhominid_sort_results
, include those columns with this optional command-line argument. For example, if you want to include the CHROM, POS, ID, REF, and ALT columns in the output, command-line argument 7 would be specified like this:--extra-columns=CHROM,POS,ID,REF,ALT
To see a sample hominid_sort_results
command, see test_sort_results.sh
The output file is in this format:
GENE_ID SNP_ID rsq_median
stability_score OTU/taxon/covariate
abundance variant_allele_count genotype sample_id [additional columns specified by "--extra-columns" command-line option]
[data row for sample A]
[data row for sample B]
[data row for sample C]
...
The values under the abundance column are the transformed abundances, not the abundances in the input OTU/taxon table.