Copy .env.sample
to .env
and fill in with your own s3 credentials. Modify the S3 remote in .dvc/config
to point to your own s3 bucket. This is used to store intermediate results and pipeline outpus (e.g. models and metrics).
This repo uses vscode devcontainers. If you are using vscode and have the devcontainers extension, there's potentially nothing you have to do to setup an environment. Just open the repo in vscode.
If you have nvidia containers setups (--gpus=all
support), there's nothing to do.
If you don't have nvidia container support, remove "--gpus=all",
option from .devcontainer/devcontainer.json
.
Without devcontainers, just run the following in whatever environment you use.
# Requirements
$ pip install -r requirements.txt
# Install local package
$ pip install -e .
# Get the data
$ dvc pull data/external/*
All training and metrics can be reproduced by running dvc repro
.