Giter VIP home page Giter VIP logo

Comments (6)

rashikakheria avatar rashikakheria commented on July 28, 2024

Thanks Dmitry for the patch again and we apologize for overriding the previous changes. I have reviewed the patch and have one critical feedback.

The major blocker I see in testing psm3 provider is availability/access to appropriate hardware. Does your team have any continuous integration system that our PRs can use? This will help us prevent such issues in future.

from aws-ofi-nccl.

dmaryin avatar dmaryin commented on July 28, 2024

Hello Rashika (@rashikakheria),

May I ask whether your test\CI system have RoCE or IB (InfiniBand) network card?
If so, PSM3 can be tested there out of the box, it does not require Intel NICs.
PSM3 is fully opensource and shipped with libfabric.

Accessing to our team's test system is a tough topic, I would like to discuss easier approaches first.

BRs,
Denis

from aws-ofi-nccl.

rashikakheria avatar rashikakheria commented on July 28, 2024

May I ask whether your test\CI system have RoCE or IB (InfiniBand) network card?

No, we don't have access to systems with IB or RoCE network card.

from aws-ofi-nccl.

rashikakheria avatar rashikakheria commented on July 28, 2024

PR merged.

from aws-ofi-nccl.

dmaryin avatar dmaryin commented on July 28, 2024

May I ask whether your test\CI system have RoCE or IB (InfiniBand) network card?

No, we don't have access to systems with IB or RoCE network card.

Hello Rashika (@rashikakheria),
Sorry for late reply. Unfortunately accessing to our team internal CI infrastructure from outside is a topic required thorough discussion (primarily with infosec representatives). We initiated discussion, but currently the answer is that we are not allowed to do this. I will update if something changes, but chances are low.

from aws-ofi-nccl.

rashikakheria avatar rashikakheria commented on July 28, 2024

Hello Rashika (@rashikakheria), Sorry for late reply. Unfortunately accessing to our team internal CI infrastructure from outside is a topic required thorough discussion (primarily with infosec representatives). We initiated discussion, but currently the answer is that we are not allowed to do this. I will update if something changes, but chances are low.

Thanks for getting back! One idea is that you could possibly pull in the changes from aws-ofi-nccl repository for every PR and test it on your end. You could comment on the PRs if your CI system break (until the time this repository has direct access to your testing).

from aws-ofi-nccl.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.