DeNovoWEST
is a simulation-based method to test for a statistically significant enrichment of damaging de novo mutations (DNMs) in individual genes. This method scores all classes of variants (e.g. nonsense, missense, splice site) on a unified severity scale based on the empirically-estimated positive predictive value of being pathogenic, and incorporates a gene-based weighting derived from the deficit of protein truncating variants in the general population.
More information on the method can be found in Kaplanis, Samocha, Wiel, Zhang et al Nature 2020.
There are three main components of this repository:
DeNovoWEST
code- Input files needed to recreate the figures in the manuscript
- Code to recreate figures in the manuscript
- A directory to recreate the weights used in
DeNovoWEST
To run DeNovoWEST
, you will need Python 3.x and R. More details are provided in the README
in the DeNovoWEST
directory.
This directory contains the files needed to recreate some of the main text figures from the Kaplanis, Samocha, Wiel, Zhang et al manuscript. Specifically, the following files are provided:
- De novo mutations from 31,058 individuals with a developmental disorder
- Extended
DeNovoWEST
results file with information about per gene mutation rates - Sex information for probands in the study
- Positive predictive value weight files that were generated as part of
DeNovoWEST
- Results of downsampling analysis
- Results from simulations of modelling the number of remaining HI DD-associated genes
- Classifications for likelihood of structural malformation of ultrasound
- Gene features used to generate Figure 2(b) in manuscript
- Results file from DeNovoNear
- Results file from all variant enrichment part of
DeNovoWEST
- Results file from missense variant enrichment part of
DeNovoWEST
- List of diagnostic developmental disorders according to DDG2P
For each of the main text figures, there is a separate Rmarkdown file or directory with the code to generate the figures from the provided input files.
We have provided an Rmarkdown that gives an overview of how to create the positive predictive value (PPV) weights that are used as part of DeNovoWEST
. We have included some warnings about applying this aspect of the pipeline to other datasets in that directory's README.