-
Notifications
You must be signed in to change notification settings - Fork 8
/
Readme.txt
22 lines (14 loc) · 1.01 KB
/
Readme.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
1)Run pip install requirements.txt to install the dependencies:
(The GPU versions of pytorch and tensorflow is recommended for faster training and inference)
2) Place an audio wave file for fine-tuning the system in Data/SampleAudio directory.
1) Create an audio directory (named SampleAudio) in Data with the following structure:
>Data
>>SampleAudio
>>>speaker_name
>>>>fileid_subjectname_audiotitle.wav [PCM16 encoded]
Ensure there are no spaces in fileid, subjectname, or audiotitle
We have already created the Data/SampleAudio directory with an audio from a speaker to serve as an example.
2) run preprocess_audio.py (This will preprocess the audio from previous step to make it compatible for fine-tuning the DeepTalk model)
Example: python preprocess_audio.py Data/SampleAudio Data/ProcessedAudio
3) run train_DeepTalk.py (This will use the preprocessed audio to fine-tune the DeepTalk model)
4) A fine-tuned model directory bearing the <speaker_name> should now appear in the trained_models directory