Comments (3)
Do you mind sharing the binary so that we can use it for debugging? Thanks.
from nvbit.
Hi,
using the onnx2trt could also reproduce it. onnx file could be found at: https://media.githubusercontent.com/media/onnx/models/master/vision/classification/resnet/model/resnet50-v2-7.onnx
Then using like instr_count.so:
LD_PRELOAD=./instr_count.so onnx2trt -b 1 -d 16 -w 20000000000 resnet50-v2-7.onnx -o 1.trt
from nvbit.
I am not able to reproduce it with my local binary.
- My binary does not have
trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_small_nhwc_linkable_tn_v1
but only
trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_small_nhwc_tn_v1
,trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_medium_nhwc_tn_v1
andtrt_ampere_h1688cudnn_128x128_ldg8_relu_exp_large_nhwc_tn_v1
. - no crash happened when instrumenting my
trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_*
kernels.
from nvbit.
Related Issues (20)
- TensorFlow stuck in infinite loop when executed with NVBit.
- How to close the WARNING in NVBit
- control flow graph can not be extended
- F2FP or Uf2FP?
- RNNT error, operation not permitted when stream is capturing HOT 2
- Texture Memory Reference HOT 1
- How to flush instruction cache? HOT 1
- What is the maximum number of bits/bytes allowed after nvbit_insert_call?
- Loading NVBit tool with dlopen() explicitly HOT 2
- Finding source line number (in the host code) for cudaMalloc and similar API calls
- Getting PC from NVBit HOT 2
- NVBit hangs when creating Cuda Contexts in parallel (multi-gpu)
- Nvbit misses kernels compared to Nsight products
- Using Nvbit for a specific region of code
- How to use APIs in our own application code
- CUDA 12.0 / Driver > 510 - Unsupported? HOT 1
- NVbit can not work with Lammps? HOT 1
- mem_trace address type
- Non-deterministic behavior
- CUDA 12 and more recent nvidia driver support HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nvbit.