Skip to content

Commit

Permalink
Update kaggle.md
Browse files Browse the repository at this point in the history
  • Loading branch information
woodthom2 authored Feb 7, 2024
1 parent 8a124ac commit ba61335
Showing 1 changed file with 22 additions and 2 deletions.
24 changes: 22 additions & 2 deletions content/en/blog/kaggle.md
Original file line number Diff line number Diff line change
Expand Up @@ -38,9 +38,29 @@ kaggle competitions download -c harmony-pdf-and-word-questionnaires-extract
unzip harmony-pdf-and-word-questionnaires-extract.zip
```

5. Run [create_sample_submission.py](https://github.com/harmonydata/pdf-questionnaire-extraction/blob/main/data/create_sample_submission.py) in the folder containing your data. It will create a CSV file `submission.csv`.
5. Run [create_sample_submission.py](https://github.com/harmonydata/pdf-questionnaire-extraction/blob/main/data/create_sample_submission.py) in the folder containing your data to create your train and test predictions:

6. Submit your CSV file to Kaggle
To generate predictions for the training data and write to train_predictions.csv:

```
python create_sample_submission.py train
```

To evaluate the train predictions:

```
python evaluate_train_results.py
```

6. To modify the prediction logic or inject your own model, you can edit the function `dummy_extract_questions`.

7. To generate predictions for the test data and write to submission.csv:

```
python create_sample_submission.py test
```

8. Submit your CSV file to Kaggle

```
kaggle competitions submit -c harmony-pdf-and-word-questionnaires-extract -f submission.csv -m "Message"
Expand Down

0 comments on commit ba61335

Please sign in to comment.