Replies: 1 comment 1 reply
-
But does it work with other GPU? Having something that only works with nvidia is far from ideal.
As far as I could see they only provide Python examples. So the first impression is that making this working in a C++ app will be painful... And does it work without any kind of internet access. Considering facebook is involved I have my doubts this is fully offline. IN order to be put in the pipeline this package has to have a C++ or C API that allows it to to receive the audio buffer size and sampling rate every time PipeWire calls the processing callback. |
Beta Was this translation helpful? Give feedback.
-
Hey folks!
I recently experimented with https://huggingface.co/facebook/seamless-streaming. You can run it locally or check it out on a Hugging Face space: https://huggingface.co/spaces/facebook/seamless-streaming
This enables real-time translation directly from audio to audio with minimal latency (since it doesn't have an intermediate text conversion step, then translation, and then speech synthesis).
I thought this project could be well-suited for integrating this feature into the effect pipeline as it's essentially an effect, right? 😀
Imagine...
Moreover, the system requirements aren't unrealistically high: an Nvidia GPU with 8GB of VRAM is sufficient to run the medium model.
If any maintainers are reading this: How could we approach creating a small proof-of-concept? My expertise lies more in DevOps topics and automation rather than software engineering. Yet, I'm willing to give it a try, of course.
Best regards,
qdrop
Beta Was this translation helpful? Give feedback.
All reactions