Distortion of audio #20

egorsmkv · 2024-01-02T06:24:43Z

I am training pflowtts for Ukrainian with phonemes and seeing the following:

Loss is going down:

What can it be or how I can debug more about this distortion?

syj901220 · 2024-01-02T06:43:35Z

Consider your data distribution.
If your dataset has a lot of short lengths, 3s masking loss would be devastating.
And short prompt leads to poor speaker similarity.
I try 1s prompt replication method like Hierspeech++, it doesn't work well.

egorsmkv · 2024-01-02T07:06:46Z

@syj901220 thanks, I'll look into the data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distortion of audio #20

Distortion of audio #20

egorsmkv commented Jan 2, 2024

syj901220 commented Jan 2, 2024

egorsmkv commented Jan 2, 2024

Distortion of audio #20

Distortion of audio #20

Comments

egorsmkv commented Jan 2, 2024

syj901220 commented Jan 2, 2024

egorsmkv commented Jan 2, 2024