Skip to content

Does capitalization matter in the dataset? #125

Closed Answered by erew123
Weroxig asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @Weroxig

I ran the finetune.py on my dataset, but looking into the metadata_train.csv and metadata_eval.csv, capitalisation is ignored. When prompting the model, does capitalization matter at all?
It doesn't matter in any way that I am aware of.

Also, can I just change the .csv files after running whisper to modify things in case whisper got it wrong? Is that how it works?
Yes you can change/edit these before moving to the training step.

Thanks

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Weroxig
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants