An Improved Deep Embedding Learning Method for Short Duration Speaker Verification - Pytorch Implementation

This is a pytorch implementation of the model(modified cross-conv. pooling) presented by Zhifu Gao in An Improved Deep Embedding Learning Method for Short Duration Speaker Verification.

I am sorry that most of the code except the model is old and dirty. Because I try to it only private database. but there is no problem with performance or operation. If you only fit the input size - batch X 1 X feature dim. X frame.

Original paper's parameter is very big model. Cross-conv. pooling layer output is 512 x 512 = 262144, it makes small batch size and a lot of training time and so on. I recommend you use small size parameter about 128 x 128.

I hope this code helps researcher reach higher score.

Data input

batch X 1 X feature dim. X frame.

Credits

Original paper:

Gao's paper:

@article{,
  author    = {Zhifu Gao, Yan Song, Ian McLoughlin, Wu Guo and Lirong Dai},
  title     = {An Improved Deep Embedding Learning Method for Short Duration Speaker Verification},
  conference   = {Interspeech 2018},
  year      = {2018},
}

Also, use the part of code:

my git repository
- Baseline code - data loader and so on.
liorshk's git repository
- Facenet pytorch implimetation
hbredin's git repository
- Voxceleb Database reader

Features

This code has only model implementation. Data loader and the other code was recycled from this code

Authors

[email protected]( or [email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
DeepSpeakerDataset.py		DeepSpeakerDataset.py
DeepSpeakerDataset_dynamic.py		DeepSpeakerDataset_dynamic.py
DeepSpeakerDataset_static.py		DeepSpeakerDataset_static.py
LICENSE		LICENSE
README.md		README.md
VoxcelebTestset.py		VoxcelebTestset.py
audio_processing.py		audio_processing.py
constants.py		constants.py
eval_metrics.py		eval_metrics.py
logger.py		logger.py
model_gao.py		model_gao.py
train_triplet.py		train_triplet.py
voxceleb_wav_reader.py		voxceleb_wav_reader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Improved Deep Embedding Learning Method for Short Duration Speaker Verification - Pytorch Implementation

Data input

Credits

Features

Authors

About

Releases

Packages

Languages

License

qqueing/speaker_embedding-pytorch

Folders and files

Latest commit

History

Repository files navigation

An Improved Deep Embedding Learning Method for Short Duration Speaker Verification - Pytorch Implementation

Data input

Credits

Features

Authors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages