Comments (5)
Here 1 << 20
means 1MB, which is the workspace size for tensorrt engine building phase.
This one will not impact the runtime GPU memory utilization.
from tensorrtx.
Okay! So, I am using RetinaFace for inference on a video. And it utilizes only 1000 MB during inference. As a result of which processes only at 25 fps. Which is very low for the standards of TensorRT. Can we somehow increase the runtime GPU memory utilization or the 20 fps processing speed??
from tensorrtx.
You can try smaller input size to get faster speed.
from tensorrtx.
Hey @wang-xinyu
Is there a way we can allot more GPU memory to the engine or force the engine to use higher memory?
from tensorrtx.
@aditya-dl I didn't see any TensorRT API can do this.
I think tensorrt is merging the layers and channels during building phase, then calling cudnn and cuda functions to do inference during runtime phase. It aims at saving the GPU memory, and get shorter latency at the same time.
If you want to use more GPU memory to get shorter latency, I think it can't.
If you want to use more GPU memory to get more FPS, you can try increasing the batch size to processing multi images a time.
from tensorrtx.
Related Issues (20)
- A bug about yolov5 decoder HOT 1
- yolov9 for tensorrtx HOT 2
- Yolov8 seg, only X model occured error. HOT 3
- 什么时候支持下,OBB和yolov9 HOT 1
- retinface seems to fail to serialize on Jetson Orin AGX 64 GB
- yolov9 小尺寸模型部署 HOT 2
- RetinaFace, calibration int8, tensorRT8.6.1 error [pluginV2Runner.cpp::execute::265] Error Code 2: Internal Error (Assertion status == kSTATUS_SCUESS failed. ) HOT 7
- 使用yolov5s推理没有检测到目标(空) HOT 5
- showing results empty HOT 5
- YOLOV9 wts转换错误 HOT 16
- yolov8 转wts文件(自定义数据集)失败 HOT 2
- What is warpaffine in relation to yolov9? HOT 1
- [TRT] 2: [utils.cpp::checkMemLimit::380] Error Code 2: Internal Error (Assertion upperBound != 0 failed. Unknown embedded device HOT 3
- yolov5s.engine 推理得到的 bbox 和 pytorch 不一样 HOT 2
- yolov5_seg.cpp 为啥只有预测框 没有 mask 输出
- YOLOv9 build error HOT 12
- yolov8-p2的tensorrtx模型转换 HOT 19
- 如何转换一份engine,在部署环境下,同时支持算力7.5和8.6的显卡 HOT 1
- 我们这个代码库,怎么支持Yolov5su.pt模型啊? HOT 2
- Bug in Yolov8 INT8 Quantization HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tensorrtx.