-
Notifications
You must be signed in to change notification settings - Fork 10
/
Copy path__init__.py
104 lines (50 loc) · 1.09 KB
/
__init__.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
"""
data generation
train.csv
val.csv
db structure:
speaker_background_spectrograms/
per speaker part 1/
speaker_clean.pkl
speaker_chatter_i.pkl
per speaker part 2/
speaker_clean.pkl
apeaker_chatter_i.pkl
two_speakers_mix_spectrograms/
per speaker/
clean.pkl
mix_with_other_i.pkl
speaker_video_spectrograms
per_speaker part 1//
clean.pkl
per_speaker part 2/
clean.pkl
chatter audios/
part1/
part2/
part3/
clean audios/
videos/
frames/
pretrained_model/
facenet_model.h5
if save memory:
clean those after processing
model
train
validation results
-----------------------------
add chatter slicer
------------------------------
STEPS:
git clone https://github.com/davidsandberg/facenet.git
pip install face_recognition
sudo apt-get --assume-yes install ffmpeg
sudo apt-get install youtube-dl
install requirements
./prepare_directory
download avspeech dataset and put in data/
python3 data/chatter_download.py
python3 data/chatter_slicer.py
python data/data_download.py
"""