Skip to content

A simple API hub for Automatic Speech Recognition (RealTime)

Notifications You must be signed in to change notification settings

uom-cse-realitix/speech_api_hub

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RealitiX Speech Recognition API Hub

This repository contains a python wrapper for Speech Recognition Engine used for generating speech transcriptions. These transcriptions are fed towards the fusion engine as the primary modality.

API Hub

An API Hub is merely a router which forwards the requests to the respective microservice. In this wrapper, free ASR services provided by IBM (Watson) and Google Cloud are used.

Setting up

  1. Obtain the API KEY and URL for Speech-to-text engine provided by IBM Watson.
  2. Fill in config.yaml using the credentials obtained in the step above.
  3. Install relevant dependencies.
sudo apt-get install portaudio19-dev python-all-dev python3-all-dev
pip install pyyaml
pip install --upgrade pyaudio
pip install --upgrade ibm-watson

Make sure you check out the Python-SDK provided by Watson Developer Cloud as well. This code is obtained from that particular repository.

Running

Run the code using python sr.py.

Contributions

  1. IBM Watson.
  2. Pyyaml.

Resources

  1. Google blog on Speech Recognition .

Let us play a game!

<iframe src="https://www.crazygames.com/gameframe/bullet-force-multiplayer" width="900" height="545" marginwidth="0" marginheight="0" hspace="0" vspace="0" frameborder="0" scrolling="no" webkitallowfullscreen="true" mozallowfullscreen="true" msallowfullscreen="true" allowfullscreen="true"></iframe>

About

A simple API hub for Automatic Speech Recognition (RealTime)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages