This repository contains a python wrapper for Speech Recognition Engine used for generating speech transcriptions. These transcriptions are fed towards the fusion engine as the primary modality.
An API Hub is merely a router which forwards the requests to the respective microservice. In this wrapper, free ASR services provided by IBM (Watson) and Google Cloud are used.
- Obtain the
API KEY
andURL
for Speech-to-text engine provided by IBM Watson. - Fill in
config.yaml
using the credentials obtained in the step above. - Install relevant dependencies.
sudo apt-get install portaudio19-dev python-all-dev python3-all-dev
pip install pyyaml
pip install --upgrade pyaudio
pip install --upgrade ibm-watson
Make sure you check out the Python-SDK provided by Watson Developer Cloud as well. This code is obtained from that particular repository.
Run the code using python sr.py
.