Giter VIP home page Giter VIP logo

Comments (4)

pranavsharma avatar pranavsharma commented on June 3, 2024

Resolved.

from onnxruntime_backend.

zeruniverse avatar zeruniverse commented on June 3, 2024

@pranavsharma Thanks for implementing this but can arena shrink be smarter? Currently, user should specify where the model is (e.g. cpu:0), but actually triton knows where the model is better than user. I suggest user just specify this value as 0 or 1, and triton translates it to cpu:0, gpu:0, gpu:1 etc.

from onnxruntime_backend.

pranavsharma avatar pranavsharma commented on June 3, 2024

@zeruniverse We would like to mirror the configs from the ORT framework here as much as possible. This has a few advantages.

  1. It allows us to point to existing documentation that is being continually enriched.
  2. Users can search for the config in https://github.com/Microsoft/onnxruntime/issues or get perf recommendations and when a config is mentioned they won't have to think how to translate the ORT framework string to ORT backend config.
  3. It avoids the unnecessary translation of configs from ORT backend to ORT framework invocations. This allows for easy reproducibility of ORT framework related issues. You can paste the exact config and file an issue.

When you configure Triton to run with ORT backend and CPU, you've to mention the instance kind to be CPU any way. I realize you've to repeat it in this string but the minor inconvenience is a tradeoff against the advantages listed above.

from onnxruntime_backend.

zeruniverse avatar zeruniverse commented on June 3, 2024

@pranavsharma thanks for your reply! Yes, I understand for CPU, user can just specify cpu:0. The problem is when there’re multiple GPUs(say 2) and each GPU has one instance of model X. In this case, instance 1 should have value gpu:0 and instance 2 should have gpu:1. I don’t know if user would be able to configure this since I assume parameter settings apply to all instances

from onnxruntime_backend.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.