Comments (3)
The error may by revelant to the NVCC and CUDA version. My machine is an A100 GPU with CUDA Driver version 545.23.08 and CUDA version 12.3. I encounted the same problem when building the tool with NVCC 12.1.1 and 11.8. However, when I changed to NVCC 11.4.0, this error just vanished and the tool worked well.
LD_PRELOAD=../../tools/instr_count_bb/instr_count_bb.so ./vectoradd
------------- NVBit (NVidia Binary Instrumentation Tool v1.5.5) Loaded --------------
NVBit core environment variables (mostly for nvbit-devs):
NVDISASM = nvdisasm - override default nvdisasm found in PATH
NOBANNER = 0 - if set, does not print this banner
---------------------------------------------------------------------------------
START_GRID_NUM = 0 - Beginning of the kernel gird launch interval where to apply instrumentation
END_GRID_NUM = 4294967295 - End of the kernel grid launch interval where to apply instrumentation
COUNT_WARP_LEVEL = 1 - Count warp level or thread level instructions
EXCLUDE_PRED_OFF = 0 - Exclude predicated off instruction from count
ACTIVE_FROM_START = 1 - Start instruction counting from start or wait for cuProfilerStart and cuProfilerStop
MANGLED_NAMES = 1 - Print kernel names mangled or not
TOOL_VERBOSE = 0 - Enable verbosity inside the tool
----------------------------------------------------------------------------------------------------
kernel 0 - _Z6vecAddPdS_S_i - #thread-blocks 98, kernel instructions 50066, total instructions 50066
Final sum = 100000.000000; sum/n = 1.000000 (should be ~1)
Total app instructions: 50066
from nvbit.
I got it to work on CUDA 11.7 with NVIDIA driver 545.23.08.
from nvbit.
Thanks for reply, I made it work with CUDA 10.2
from nvbit.
Related Issues (20)
- Finding source line number (in the host code) for cudaMalloc and similar API calls
- Getting PC from NVBit HOT 2
- NVBit hangs when creating Cuda Contexts in parallel (multi-gpu)
- Nvbit misses kernels compared to Nsight products
- Using Nvbit for a specific region of code
- How to use APIs in our own application code
- CUDA 12.0 / Driver > 510 - Unsupported? HOT 1
- NVbit can not work with Lammps? HOT 1
- mem_trace address type
- Non-deterministic behavior
- CUDA 12 and more recent nvidia driver support HOT 2
- Support for CUDA dynamic parallelism
- Support for cooperative groups
- Instructions after exit
- Modify an instruction?
- Last instruction not instrumented when setting instrumenting position to be `IPOINT_AFTER` HOT 1
- [ERROR]: ASSERT FAIL: function.cpp:805:void Function::gen_new_code(std::unordered_map<std::__cxx11::basic_string<char>, Function*>&): FAIL !(instr_func_map.find(c.instr_func_name) != instr_func_map.end()) MSG: instrumentation function instrument_mem not found in binary! HOT 1
- Does NVBit support Orin? HOT 1
- Any update on NVbit? HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nvbit.