Change the repository type filter
All
Repositories list
28 repositories
FS-EEND
PublicThe official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"UMA-ASR
PublicRealMAN
PublicA description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]FN-SSL
PublicThe Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]NBSS
PublicThe official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberationATST-SED
PublicThis repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".SAR-SSL
PublicA python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]ATST-RCT
Publicaudiossl
PublicA library built for easier audio self-supervised training, downstream tasks evaluationRVAE-EM
PublicOfficial PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]FullSubNet
PublicPyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."McNet
PublicThe official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023RCT
PublicNarrowband_DeepFiltering
PublicRTF_InterFrameSpecSub
PublicRS_noisePSD
PublicDP_RTF_SSL
Publicbss_ctf_lasso
Publicdereverb_ctf_nonneg
PublicBSS_CTF_EM
PublicLSTM-noisePSD
Publicctf_mint
PublicOnlineSSL_DPRTF_EG
PublicSMIF_online_dereverb
Public