Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Yolov5s trt engines performance #22

Open
vilmara opened this issue Mar 14, 2021 · 0 comments
Open

Yolov5s trt engines performance #22

vilmara opened this issue Mar 14, 2021 · 0 comments

Comments

@vilmara
Copy link

vilmara commented Mar 14, 2021

Hi @DanaHan, I am getting unexpected performance when running the Yolov5s trt engines with trtexec vs DeepStream, and even when running BS=1 versus BS>1. Please see below:

TensorRT: 7.2.1
DeepStream: 5.1

With trtexec and BS=1:
$ LD_PRELOAD=build/libmyplugins.so /usr/src/tensorrt/bin/trtexec --loadEngine=yolov5s<precision>.engine
Performance:

  • TRT-FP32: 179.7 qps
  • TRT-FP16: 396.9 qps
  • TRT-INT8 : 469.9 qps

With DeepStream and BS=1:

  • TRT-FP32: 109.1 FPS
  • TRT-FP16: 91.6 FPS
  • TRT-INT8: 88.9 FPS

TRT-INT8 (built with BS=8) |

  • run with trtexec and BS=1 : 469.9 qps | run with DS and BS=1 : 88.9 FPS
  • run with trtexec and BS=8: 753.3 qps | run with DS and BS=8 : 42.7 FPS

What is the formula to convert qps to FPS?. What parameters do I need to set in the DeepStream config files to fix the performance issues?,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant