Speaker Verification task in Voxceleb1 dataset

This repository contains simple scripts for a training i-vector speaker recognition system on Voxceleb1[1] dataset using Kaldi. It was modified based on swshon's work[2]. Note that this experiment is not speaker verification indeed. The scoring is to compute similarity between two test utterances rather than that between an enrolled speaker and a test utterance.

Requirement

Kaldi Toolkit

How to use

Download and unzip audio files from http://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html
Create a directory named voxceleb1 with two subdirectories named train and test. Move dev data to train directory, test data to test directory.
Download List of trial pairs for Verification(http://www.robots.ox.ac.uk/~vgg/data/voxceleb/meta/veri_test.txt). Move it to voxceleb1 dir.
run cmd: ln -fsr "your path to kaldi-trunk/egs/sre08/v1/sid" sid
run cmd: ln -fsr "your path to kaldi-trunk/egs/sre08/v1/steps" steps
run cmd: ln -fsr "your path to kaldi-trunk/egs/sre08v/1/utils" steps
Modify dataset directories and parameters in run.sh file to fit in your machine.
Run run.sh file

Result

The 2048 component GMM-UBM and 600-dimensional i-vector extractor were trained using voxceleb1 training data for verification task. Training parameter is almost same compared to sre10 baseline on Kaldi egs.

GMM-2048 CDS eer : 15.6%
GMM-2048 LDA+CDS eer : 7.937%
GMM-2048 PLDA eer : 5.652%

Note

The Voxceleb1 dataset, a large-scale speaker identification dataset was published in 2017 with speaker embedding baseline[1] and reported i-vector shows 8.8% EER. The i-vector was extracted using 1024 component GMM-UBM, so the EER is fairly worse compared to the result above.

Reference

[1] A. Nagraniy, J. S. Chung, and A. Zisserman, “VoxCeleb: A large-scale speaker identification dataset,” in Interspeech, 2017, pp. 2616–2620.

[2] https://github.com/swshon/voxceleb-ivector

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
conf		conf
data		data
local		local
LICENSE		LICENSE
README.md		README.md
cmd.sh		cmd.sh
path.sh		path.sh
run.sh		run.sh
sid		sid
steps		steps
utils		utils

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker Verification task in Voxceleb1 dataset

Requirement

How to use

Result

Note

Reference

About

Releases

Packages

Languages

License

JerryPeng21cuhk/voxceleb-ivector

Folders and files

Latest commit

History

Repository files navigation

Speaker Verification task in Voxceleb1 dataset

Requirement

How to use

Result

Note

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages