Better speaker clustering #430

rroohhh · 2024-01-05T00:07:42Z

No description provided.

This switches from agglomerative clustering to spectral clustering. Of the "standard" clustering methods, it achieves the best speaker identification for my test data. Furthermore this should closely match what the original paper on speaker identification using the ECAPA-TDNN model uses [1]. I can get better clustering combining something like t-SNE with a "standard" clustering method, however as t-SNE and others do not preserve distances and therefore do not seem like a general solution. [1] Dawalatabad, Nauman, et al. "ECAPA-TDNN embeddings for speaker diarization."

anuejn

LGTM

rroohhh requested review from phlmn, pajowu and anuejn January 5, 2024 00:07

rroohhh force-pushed the better_speaker_clustering branch from 9ef6022 to 79c38f8 Compare January 5, 2024 00:08

anuejn approved these changes Feb 25, 2024

View reviewed changes

rroohhh added this pull request to the merge queue Feb 26, 2024

Merged via the queue into main with commit cc8fecc Feb 26, 2024
2 checks passed

rroohhh deleted the better_speaker_clustering branch February 26, 2024 18:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better speaker clustering #430

Better speaker clustering #430

rroohhh commented Jan 5, 2024

anuejn left a comment

Better speaker clustering #430

Better speaker clustering #430

Conversation

rroohhh commented Jan 5, 2024

anuejn left a comment

Choose a reason for hiding this comment