Skip to content
Change the repository type filter

All

    Repositories list

    • CLI for the radioship_transcriber: create .txt transcripts for .mp3 files using neural networks
      Python
      MIT License
      0100Updated Jan 10, 2024Jan 10, 2024
    • Segment audiofiles, remove noise and music.
      Python
      BSD 2-Clause "Simplified" License
      0000Updated Jan 7, 2024Jan 7, 2024
    • .github

      Public
      0000Updated Dec 8, 2023Dec 8, 2023
    • Filtering stopwords
      Python
      0000Updated Sep 29, 2023Sep 29, 2023
    • Python
      0010Updated Sep 25, 2023Sep 25, 2023
    • Python
      0000Updated Sep 8, 2023Sep 8, 2023
    • Jupyter Notebook
      MIT License
      0000Updated Jul 25, 2023Jul 25, 2023
    • Experimenting with audio preprocessing
      Jupyter Notebook
      0000Updated May 8, 2023May 8, 2023
    • CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
      Python
      MIT License
      133000Updated May 5, 2023May 5, 2023
    • 💩 Profanity means swear words. The adjective is 'profane'. Profanities can also be called curse ("cuss") words, dirty words, bad words, foul language, obscenity, obscene language, or expletives. It can be called swearing, although this also has a normal meaning of making a "solemn promise".
      Python
      36000Updated Apr 25, 2023Apr 25, 2023
    • Python
      0000Updated Apr 22, 2023Apr 22, 2023
    • key phrase extraction, summarization and corpus statistics
      Jupyter Notebook
      0000Updated Mar 10, 2023Mar 10, 2023
    • kenlm

      Public
      KenLM: Faster and Smaller Language Model Queries
      C++
      Other
      514000Updated Mar 5, 2023Mar 5, 2023
    • JavaScript
      0022Updated Feb 21, 2023Feb 21, 2023
    • wordninja

      Public
      Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
      Python
      MIT License
      109000Updated Feb 19, 2023Feb 19, 2023
    • A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
      Python
      MIT License
      177000Updated Feb 15, 2023Feb 15, 2023
    • Start a data science project with modern tools
      Python
      BSD 3-Clause "New" or "Revised" License
      39000Updated Feb 6, 2023Feb 6, 2023
    • HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
      Python
      MIT License
      45000Updated Nov 2, 2022Nov 2, 2022
    • HTML
      0000Updated Sep 29, 2022Sep 29, 2022
    • JavaScript
      2000Updated May 18, 2022May 18, 2022
    • Jupyter Notebook
      MIT License
      32000Updated Feb 22, 2022Feb 22, 2022
    • Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
      21000Updated May 20, 2019May 20, 2019