feat: support fish audio TTS #7982

leng-yue · 2024-09-04T12:01:36Z

Checklist:

Important

Please review the checklist below before submitting your pull request.

Please open an issue before creating a PR or link to an existing issue
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

Description

This PR add Fish Audio TTS support to dify Close #7984.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update, included: Dify Document
Improvement, including but not limited to code refactoring, performance optimization, and UI/UX improvement
Dependency upgrade

Testing Instructions

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Test TTS with public English Voice, Normal Latency
Test TTS with private English Voice, Low Latency

leslie2046 · 2024-09-05T14:42:44Z

Cannot list my private models?
BTW ,i find that the API will return the private voice list if self=false, and not passing the language param.
@leng-yue

leng-yue · 2024-09-05T17:42:59Z

Cannot list my private models? BTW ,i find that the API will return the private voice list if self=false, and not passing the language param. @leng-yue

Let me take a look

leng-yue · 2024-09-05T21:33:24Z

Cannot list my private models? BTW ,i find that the API will return the private voice list if self=false, and not passing the language param. @leng-yue

I fixed the filtering behavior for language—it now filters based on the model language only, without the title language. I double-checked with self=false and without the language parameter on the API Playground and couldn't reproduce the issue.

leslie2046 · 2024-09-06T00:54:49Z

@leng-yue OK,thank you

leslie2046 · 2024-09-06T02:19:56Z

@leng-yue what's the difference between GPT-SoVITS and fish?which is better?

cubxxw · 2024-09-14T08:49:21Z

I hope that the client can configure TTS when calling the API of DIFY ChatFlow. Is it currently supported? Will you consider supporting it?

    "voice_preferences": {                  // 语音回复的偏好设置 (可选)
        "gender": "female",                 // 声音性别 (female, male)
        "voice_speed": "normal",            // 语速 (slow, normal, fast)
        "emotion": ["neutral"]              // 情绪选择 (neutral, happy, sad, etc.)
    },

momomobinx · 2024-12-24T07:24:05Z

这个好像不兼容本地部署的fish-speech 的api
https://speech.fish.audio/inference/#3-generate-vocals-from-semantic-tokens

python -m tools.api_server
--listen 0.0.0.0:8080
--llama-checkpoint-path "checkpoints/fish-speech-1.5"
--decoder-checkpoint-path "checkpoints/fish-speech-1.5/firefly-gan-vq-fsq-8x1024-21hz-generator.pth"
--decoder-config-name firefly_gan_vq

add fish audio support

74695aa

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. ⚙️ feat:model-runtime labels Sep 4, 2024

leng-yue changed the title ~~add fish audio support~~ feat: support fish audio TTS Sep 4, 2024

add unittest for fish audio

339ebc6

crazywoola approved these changes Sep 5, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Sep 5, 2024

crazywoola merged commit bd09922 into langgenius:main Sep 5, 2024
6 checks passed

ProseGuys pushed a commit to hustyichi/dify that referenced this pull request Sep 5, 2024

feat: support fish audio TTS (langgenius#7982)

69528d4

mehrajagdish pushed a commit to Sbazar-GmbH/dify that referenced this pull request Sep 6, 2024

feat: support fish audio TTS (langgenius#7982)

a463df1

cuiks pushed a commit to cuiks/dify that referenced this pull request Sep 26, 2024

feat: support fish audio TTS (langgenius#7982)

7443d6d

lau-td pushed a commit to heydevs-io/dify that referenced this pull request Oct 23, 2024

feat: support fish audio TTS (langgenius#7982)

9235f62

idonotknow pushed a commit to AceDataCloud/Dify that referenced this pull request Nov 16, 2024

feat: support fish audio TTS (langgenius#7982)

e202bc1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support fish audio TTS #7982

feat: support fish audio TTS #7982

leng-yue commented Sep 4, 2024 •

edited by crazywoola

Loading

leslie2046 commented Sep 5, 2024 •

edited

Loading

leng-yue commented Sep 5, 2024

leng-yue commented Sep 5, 2024

leslie2046 commented Sep 6, 2024

leslie2046 commented Sep 6, 2024

cubxxw commented Sep 14, 2024 •

edited

Loading

momomobinx commented Dec 24, 2024 •

edited

Loading

feat: support fish audio TTS #7982

feat: support fish audio TTS #7982

Conversation

leng-yue commented Sep 4, 2024 • edited by crazywoola Loading

Checklist:

Description

Type of Change

Testing Instructions

leslie2046 commented Sep 5, 2024 • edited Loading

leng-yue commented Sep 5, 2024

leng-yue commented Sep 5, 2024

leslie2046 commented Sep 6, 2024

leslie2046 commented Sep 6, 2024

cubxxw commented Sep 14, 2024 • edited Loading

momomobinx commented Dec 24, 2024 • edited Loading

leng-yue commented Sep 4, 2024 •

edited by crazywoola

Loading

leslie2046 commented Sep 5, 2024 •

edited

Loading

cubxxw commented Sep 14, 2024 •

edited

Loading

momomobinx commented Dec 24, 2024 •

edited

Loading