Giter VIP home page Giter VIP logo

Comments (6)

wukong1992 avatar wukong1992 commented on August 19, 2024 1

@cheyang done, I deploy kube-scheduler not in container, so the IP addr in the scheduler-policy-config.json file should not be '127.0.0.1', should be clusterIP.

from gpushare-scheduler-extender.

cheyang avatar cheyang commented on August 19, 2024

I think "aliyun.com/gpu-mem":"10G" should be changed to "aliyun.com/gpu-mem":"10" . Please take a look at https://github.com/AliyunContainerService/gpushare-scheduler-extender/blob/master/docs/userguide.md

from gpushare-scheduler-extender.

cheyang avatar cheyang commented on August 19, 2024

Btw, can you show me the result of kubectl describe po gpushare?

from gpushare-scheduler-extender.

wukong1992 avatar wukong1992 commented on August 19, 2024

@cheyang I use the "aliyun.com/gpu-mem":"2", and result is "Error: failed to start container "test": Error response from daemon: OCI runtime create failed: container_linux.go:344: starting container process caused "process_linux.go:424: container init caused "process_linux.go:407: running prestart hook 0 caused \"error running hook: exit status 1, stdout: , stderr: exec command: [/usr/bin/nvidia-container-cli --load-kmods configure --ldconfig=@/sbin/ldconfig --device=no-gpu-has-2MiB-to-run --utility --pid=14123 /home/docker/overlay2/b20cfd4438c42624b7b077786963b307545df9e2e205687291a3f5436908b306/merged]\\nnvidia-container-cli: device error: unknown device id: no-gpu-has-2MiB-to-run\\n\""": unknown"

from gpushare-scheduler-extender.

wukong1992 avatar wukong1992 commented on August 19, 2024

@cheyang my kubernetes version is v1.10.6

from gpushare-scheduler-extender.

cheyang avatar cheyang commented on August 19, 2024

Please run kubectl-inspect-gpushare first to check the gpu memory unit you are using. Looks like it's MiB.

You can set "aliyun.com/gpu-mem":"1024" to try.

from gpushare-scheduler-extender.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.