-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Speaker diarization errors #33
Comments
Few things you can try: disable run the tests exactly as the disable cuda / directml features and check if it works with the CPU. if it works it's an issue in sherpa-onnx. |
Thank you for your response. Disabling Disabling
I did notice this on my Mac earlier but was ultimately unable to reproduce it. This also happened on Ubuntu. I'm unsure of where said NaNs could be coming from. I am using the default config options. |
Have you tried other speaker embedding models? I notice that you only posted info about NeMo models. |
I tried, on Ubuntu x86-64:
On Arm64 macOS, I managed to track down the NaN errors for NeMo to the |
Could you describe how to reproduce it with sherpa-onnx? Please also attach the test wave. |
We need to add an epsilon to the denominator to prevent division by 0 Could you try that? |
I can write some reproduction code. Is C OK?
I actually found that NaNs were showing up far earlier than that line. Have you been able to reproduce (without steps) yet? |
sorry, I need to sleep now. Have not tried to reproduce it yet. Yes, C code is fine for reproducing. |
No problem, thank you for your help. I was able to reproduce the problem on Ubuntu 24.04 x86-64 using the The test wave file: a_pii_removed.wav.zip
|
Please see It should fix this issue. |
Awesome thank you so much. Gave it a try today and it works 😁 I will leave this issue open until the fix has been merged into sherpa-rs as well. |
I am having a lot of trouble with speaker diarization across lots of different platforms and models.
Embeddings model:
nemo_en_titanet_large.onnx
Segmentation model:
sherpa-onnx-pyannote_segmentation-3.0.onnx
andmodels/sherpa-onnx-reverb-diarization-v1.onnx
GPU: RTX 3080
OS: Ubuntu 24.04 x86-64
Embeddings model:
nemo_en_titanet_large.onnx
Segmentation model:
sherpa-onnx-reverb-diarization-v1.onnx
GPU: RTX 3080
OS: Windows 11 x86-64
Embeddings model:
nemo_en_titanet_large.onnx
Segmentation model:
sherpa-onnx-pyannote_segmentation-3.0.onnx
OS: macOS M2 Max Sequoia 15.0.1 Arm64
This was the best information I could get regarding this foreign exception, which Rust wouldn't catch or display:
Sometimes the example does work. I think it's something related to the actual audio I'm trying to diarize. I have attached it below (the sine tones are from PII I have removed btw)
a_pii_removed.wav.zip
The text was updated successfully, but these errors were encountered: