Comments (2)
I've found that
ib_write_lat
doesn't support CUDA mode. Wonder whether there is any intrinsic issue that prevents supporting this? I think it should not be CUDA issue because NCCL library is using IB write with GPU. If there isn't a big obstacle, I can help draft a PR to fix this.
Can you share your PR link ?
I remove the error exit, and try to run on A100, it will be crash
and gdb showed that not host memory, so it could be CUDA memory issue
Thanks
from perftest.
Hi, sorry for misleading. I meant I don’t know the key issue to support write latency for CUDA either.
from perftest.
Related Issues (20)
- BRCM N2100GD devid 0x1751 cannot support inline mode.
- Question about BW and PPS? HOT 1
- UD is not supported in perftest? HOT 2
- what does "num_of_calculated_iters *=num_of_qps" mean in WRITE_BW(infinity) ? HOT 3
- Failed to disconnect RDMA CM connection with ib_write_lat test
- ib_write_bw doesnt show BW peak HOT 1
- Multiple machines cannot generate json files HOT 1
- ib_write_bw works fine without -R. But only very small bandwidth when using -R option HOT 1
- ib_write_bw poor GPU performance buy CPU works fine HOT 1
- Compilation issue when HAVE_AES_XTS is not defined HOT 1
- Integer overflow issue HOT 1
- timeout param not work, request retrans from 2ms
- ib_read_bw is far less than 200Gb/s
- Error opening XRC Domain HOT 1
- Test "ib_write_bw" is failing with "Completion with error at client" at client and "ethernet_read_keys: Couldn't read remote address" at server HOT 1
- how to use muti threads HOT 1
- What is the different between bandwidth testing and lantency testing in RDMA?
- couldn't allocate MR while test GDR with cuda. HOT 6
- How the immediate data is transfered? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from perftest.