these are the NVIDIA Riva C++ and Python clients only (found under /opt/riva/python-clients)
see riva_quickstart_arm64 from NGC to start the core Riva server container first
Riva API reference docs: https://docs.nvidia.com/deeplearning/riva/user-guide/docs/

Start Riva Server

Before doing anything, you should download and run the Riva server container from riva_quickstart_arm64 using riva_start.sh

This will run locally on your Jetson Xavier or Orin device and is supported on JetPack 5. You can disable NLP/NMT in its config.sh and it will use around ~5GB of memory for ASR+TTS. It's then recommended to test the system with these examples under /opt/riva/python-clients

You can also see this helpful video and guide from JetsonHacks for setting up Riva: Speech AI on Jetson Tutorial

List Audio Devices

This will print out a list of audio input/output devices that are connected to your system:

./run.sh --workdir /opt/riva/python-clients $(./autotag riva-client:python) \
   python3 scripts/list_audio_devices.py

You can refer to them in the steps below by either their device number or name. Depending on the sample rate they support, you may also need to set --sample-rate-hz below to a valid frequency (e.g. 16000 44100 48000)

Streaming ASR

./run.sh --workdir /opt/riva/python-clients $(./autotag riva-client:python) \
   python3 scripts/asr/transcribe_mic.py --input-device=24 --sample-rate-hz=48000

You can find more ASR examples to run at https://github.com/nvidia-riva/python-clients#asr

Streaming TTS

./run.sh --workdir /opt/riva/python-clients $(./autotag riva-client:python) \
   python3 scripts/tts/talk.py --stream --output-device=24 --sample-rate-hz=48000 \
     --text "Hello, how are you today? My name is Riva."

You can set the --voice argument to one of the available voices (the default is English-US.Female-1)

Also, you can customize the rate, pitch, and pronunciation of individual words/phrases by including inline SSML in your text.

Loopback

To feed the live ASR transcript into the TTS and have it speak your words back to you:

./run.sh --workdir /opt/riva/python-clients $(./autotag riva-client:python) \
   python3 scripts/loopback.py --input-device=24 --output-device=24 --sample-rate-hz=48000

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs.md

docs.md

Start Riva Server

List Audio Devices

Streaming ASR

Streaming TTS

Loopback

Files

docs.md

Latest commit

History

docs.md

File metadata and controls

Start Riva Server

List Audio Devices

Streaming ASR

Streaming TTS

Loopback