Code to reproduce analyses in Biswas, Khimulya, Alley et al (2021). Nature Methods.
We use docker to provide a consistent environment and facilitate reprodubility. First build an image:
To use GPU acceleration:
$ docker build -f docker/Dockerfile.gpu -t low-n-gpu .
To use CPU-only:
$ docker build -f docker/Dockerfile.cpu -t low-n-cpu .
Note you will need a GPU in order to completely reproduce the results, especially where UniRep inference is required.
The above has been tested on p3.2xlarge
(which have a NVIDIA V100) AWS instances running Deep Learning AMI (Ubuntu 18.04) Version 30.0 - ami-029510cec6d69f121
.
Here's the repository layout:
-
In
analysis
you'll find the code needed to reproduce the analyses in the paper. Every sub-directory withinanalysis
contains a README for what analyses and figures are covered. Sorry in advance for the sometimes cryptic subdirectory names! We have a few instances of (deeply) hardcoded paths in the code. We didn't change these in order to minimize changes to the original code we wrote, figuring this would help with reproducibility. -
docker
contains files needed to build the docker environment (more instructions below) needed to reproduce analyses and figures. -
requirements
contains the requirements that are pulled into the docker environment.