Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Voice-to-text conversion #617

Closed
ananyag309 opened this issue Oct 27, 2024 · 4 comments
Closed

Voice-to-text conversion #617

ananyag309 opened this issue Oct 27, 2024 · 4 comments
Labels
enhancement New feature or request

Comments

@ananyag309
Copy link
Contributor

Description
This project aims to build a speech recognition model that can convert spoken language (audio input) into written text. The model uses techniques from Natural Language Processing (NLP) and deep learning to process audio data and predict corresponding text. It is based on the principles of speech-to-text algorithms and Recurrent Neural Networks (RNNs).

Model Architecture
The model consists of RNN layers (such as LSTM or GRU) for processing the sequence data.
The final layer is a dense layer with a softmax activation for predicting the probability distribution over the vocabulary.
Connectionist Temporal Classification (CTC) loss function is used to handle the alignment between input audio sequences and output text sequences.

Copy link

Thanks for creating the issue in ML-Nexus!🎉
Before you start working on your PR, please make sure to:

  • ⭐ Star the repository if you haven't already.
  • Pull the latest changes to avoid any merge conflicts.
  • Attach before & after screenshots in your PR for clarity.
  • Include the issue number in your PR description for better tracking.
    Don't forget to follow @UppuluriKalyani – Project Admin – for more updates!
    Tag @Neilblaze,@SaiNivedh26 for assigning the issue to you.
    Happy open-source contributing!☺️

@UppuluriKalyani
Copy link
Owner

@ananyag309 arey ananaya you are raising already existed one's please check and raise

Copy link

Hello @ananyag309! Your issue #617 has been closed. Thank you for your contribution!

@ananyag309
Copy link
Contributor Author

@UppuluriKalyani
We have text to speech, there are none for speech to text.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants