A Freeswitch module that generates real-time transcriptions on a Freeswitch channel by using AWS streaming transcription API
The freeswitch module exposes the following API commands:
aws_transcribe <uuid> start <lang-code> [interim]
Attaches media bug to channel and performs streaming recognize request.
uuid
- unique identifier of Freeswitch channellang-code
- a valid AWS language code that is supported for streaming transcriptioninterim
- If the 'interim' keyword is present then both interim and final transcription results will be returned; otherwise only final transcriptions will be returned
aws_transcribe <uuid> stop
Stop transcription on the channel.
The plugin will first look for channel variables, then environment variables. If neither are found, then the default AWS profile on the server will be used.
The names of the channel variables and environment variables are:
variable | Description |
---|---|
AWS_ACCESS_KEY_ID | The Aws access key ID |
AWS_SECRET_ACCESS_KEY | The Aws secret access key |
AWS_REGION | The Aws region |
aws_transcribe::transcription
- returns an interim or final transcription. The event contains a JSON body describing the transcription result:
[
{
"is_final": true,
"alternatives": [{
"transcript": "Hello. Can you hear me?"
}]
}
]
When using drachtio-fsrmf, you can access this API command via the api method on the 'endpoint' object.
ep.api('aws_transcribe', `${ep.uuid} start en-US interim`);
You will need to build the AWS C++ SDK. You can use this ansible role, or refer to the specific steps here.