F5TTS-TensorRT-LLM

截止2024/12/19, 更新了module.py，增加了缺失的rotate_every_two_3dim函数。

该项目主要受https://github.com/DakeQQ/F5-TTS-ONNX 启发，前处理和后处理由于耗时较低并没有进行优化，都用的onnx方案。而backbone部分主要是onnx版的trtllm实现，重写了网络，重写一些trtllm不支持的算子和操作。最终推理性能提高了4倍左右。

需要安装配置好tensorrt-llm环境

（推荐docker方式搭建环境，docker image:nvcr.io/nvidia/pytorch:23.10-py3）

进入容器后
# 卸载TensorRT  
pip uninstall -y tensorrt  
pip uninstall -y torch-tensorrt  
  
pip install mpi4py 
pip install polygraphy

# 重新安装TensorRT
tensorrt：10.6.0

目录中model和example分别对应Tensorrt-LLM源码中的tensorrt_llm/models和example。

在Tensorrt-LLM源码中的tensorrt_llm/models和example目录下分别新建f5tts/ 目录，然后将repo中的代码放入对应的目录。
example/f5tts目录如下。需要通过导出前处理以及后处理的onnx、basic_ref_zh.wav和vocab.txt可从源码repo中获取。

.  
├── F5_Decode.onnx  
├── F5_Preprocess.onnx  
├── __pycache__  
│   └── diffusion.cpython-310.pyc  
├── basic_ref_zh.wav  
├── ckpts  
│   └── model_1200000.pt  
├── convert_checkpoint.py  
├── sample_tts.py  
└── vocab.txt

tensorrt_llm/models/f5tts目录如下。

.  
├── model.py  
├── module.py

在tensorrt_llm/models/init.py导入f5tts

from .f5tts.model import F5TTS

模型并在MODEL_MAP中注册模型。

'F5TTS': F5TTS

1.convert_checkpoint

cd example/f5tts
python convert_checkpoint.py

2.build engine(支持Tensor 并行, --tp_size)

trtllm-build --checkpoint_dir ./tllm_checkpoint/ --remove_input_padding disable --bert_attention_plugin disable
如报参数dtype不一致错误，那是因为我们默认参数是fp16，而网络参数默认需要fp32，需要在tensorrt_llm/parameter.py中将参数默认_DEFAULT_DTYPE = trt.DataType.HALF

3.inference

python sample.py

Reference

F5-TTS-ONNX：https://github.com/DakeQQ/F5-TTS-ONNX
F5-TTS：https://github.com/SWivid/F5-TTS

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
Export_ONNX		Export_ONNX
example		example
model		model
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

F5TTS-TensorRT-LLM

截止2024/12/19, 更新了module.py，增加了缺失的rotate_every_two_3dim函数。

需要安装配置好tensorrt-llm环境

（推荐docker方式搭建环境，docker image:nvcr.io/nvidia/pytorch:23.10-py3）

1.convert_checkpoint

2.build engine(支持Tensor 并行, --tp_size)

3.inference

Reference

About

Releases

Packages

Languages

Bigfishering/f5-tts-trtllm

Folders and files

Latest commit

History

Repository files navigation

F5TTS-TensorRT-LLM

截止2024/12/19, 更新了module.py，增加了缺失的rotate_every_two_3dim函数。

需要安装配置好tensorrt-llm环境

（推荐docker方式搭建环境，docker image:nvcr.io/nvidia/pytorch:23.10-py3）

1.convert_checkpoint

2.build engine(支持Tensor 并行, --tp_size)

3.inference

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages