Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
balhafni authored Feb 7, 2024
1 parent 55664c9 commit 9f1e6cb
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,10 +16,10 @@ pip install -r requirements.txt
We make the data we use to train and test our models publicly available in this [release](). Details on how the data was obtained are described [here](data).

## Training:
Replicating our 1B Pythia baseline and Prefix models can be done using the [scripts/baseline.sh](scripts/baseline.sh) and [scripts/prefix.sh](scripts/prefix.sh) scripts, respectively.
Replicating our 1B Pythia baseline and Prefix models can be done using the [scripts/baseline.sh](scripts/baseline.sh) and [scripts/prefix.sh](scripts/prefix.sh) scripts, respectively. Both scripts can be used to also replicate the smaller Pythia models we report on in the paper.

## Inference and Evaluation:

Once the models are trained, we run the inference on the Dev set using all the models' checkpoints can be done using the [scripts/inference_checkpoints.sh](scripts/inference_checkpoints.sh). We pick the best checkpoint using the [eval/get_best_checkpoint.py](eval/get_best_checkpoint.py) based on the performance on the Dev set. We then run the inference on the Test set using the best checkpoint by using the [scripts/infernece.sh](scripts/inference.sh) script.


# Hugging Face Integration:
Expand Down

0 comments on commit 9f1e6cb

Please sign in to comment.