Giter VIP home page Giter VIP logo

rocm_pytorch_informations's Issues

arch linux 5.7.10 kernel panic

Hi!

Nice tutorial, official docs are really bad

but after i installed some of this packages rocm-dkms rocm-libs hipcub rccl miopen-hip i've gotten kernel panic

deinstallation didn't help, only lts kernel was able to boot my system

[   25.053990] kauditd_printk_skb: 12 callbacks suppressed
[   25.053992] audit: type=1131 audit(1595939464.843:24): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   44.490842] audit: type=1131 audit(1595939484.280:25): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-hostnamed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   44.642536] audit: type=1334 audit(1595939484.433:26): prog-id=10 op=UNLOAD
[   44.643392] audit: type=1334 audit(1595939484.433:27): prog-id=9 op=UNLOAD
[  319.355039] audit: type=1130 audit(1595939759.143:28): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=getty@tty2 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  321.613645] audit: type=1100 audit(1595939761.403:29): pid=438 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:authentication grantors=pam_securetty,pam_tally2,pam_shells,pam_unix,pam_permit acct="root" exe="/usr/bin/login" hostname=reache addr=? terminal=tty2 res=success'
[  321.619593] audit: type=1101 audit(1595939761.410:30): pid=438 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting grantors=pam_tally2,pam_access,pam_unix,pam_permit,pam_time acct="root" exe="/usr/bin/login" hostname=reache addr=? terminal=tty2 res=success'
[  321.620133] audit: type=1103 audit(1595939761.410:31): pid=438 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:setcred grantors=pam_securetty,pam_tally2,pam_shells,pam_unix,pam_permit acct="root" exe="/usr/bin/login" hostname=reache addr=? terminal=tty2 res=success'
[  321.620209] audit: type=1006 audit(1595939761.410:32): pid=438 uid=0 old-auid=4294967295 auid=0 tty=tty2 old-ses=4294967295 ses=1 res=1
[  321.620250] audit: type=1300 audit(1595939761.410:32): arch=c000003e syscall=1 success=yes exit=1 a0=3 a1=7ffff4b85ae0 a2=1 a3=7ffff4b857f7 items=0 ppid=1 pid=438 auid=0 uid=0 gid=0 euid=0 suid=0 fsuid=0 egid=0 sgid=0 fsgid=0 tty=tty2 ses=1 comm="login" exe="/usr/bin/login" key=(null)
[  321.620325] audit: type=1327 audit(1595939761.410:32): proctitle=2F62696E2F6C6F67696E002D70002D2D0020202020
[  321.633869] audit: type=1130 audit(1595939761.423:33): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user-runtime-dir@0 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  321.638862] audit: type=1101 audit(1595939761.427:34): pid=441 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting grantors=pam_tally2,pam_access,pam_unix,pam_permit,pam_time acct="root" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  321.638938] audit: type=1103 audit(1595939761.427:35): pid=441 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:setcred grantors=? acct="root" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'

can you help or give right direction at least?

Some rocm queries

It seems AMD is catching up with Nvidia. I am thinking about considering AMD GPUs instead of Nvidia because of the scarcity of 30 series.
Need Some infos.

  1. How is the fp16 performance for ROCm pytorch?
  2. What are the known issues with the rocm_pytorch?
  3. Is there any performance benchmark?

ImportError: libhip_hcc.so.3: cannot open shared object file: No such file or directory

I installed everything as described in Readme

torch seems to be successfully installed, but when I try to import it I get

ImportError: libhip_hcc.so.3: cannot open shared object file: No such file or directory

Do you have any ideas what might be causing that?

Something might didn't work out b4 since when I type

/opt/rocm/opencl/bin/x86_64/clinfo

I get

bash: /opt/rocm/opencl/bin/x86_64/clinfo: No such file or directory

Installed ROCm4.0, but how to use it?

Hey,
I just installed ROCm4.0 from the official pytorch website. How can I now use my radeon gpu for training?
I tried your command:

python3 -c 'import torch;print("DeviceName:",str(torch.cuda.get_device_name(torch.cuda.current_device())))'

which gives the error

Traceback (most recent call last):
File "", line 1, in
File "/home/bryan/.local/lib/python3.6/site-packages/torch/cuda/init.py", line 388, in current_device
_lazy_init()
File "/home/bryan/.local/lib/python3.6/site-packages/torch/cuda/init.py", line 170, in _lazy_init
torch._C._cuda_init()
RuntimeError: No HIP GPUs are available

Any clues what I am doing wrong or could you give me any hint, how to use my gpu after installing ROCm4.0?

Thanks in advance

Download Pytorch sub-modules

I am trying to build pytorch and I ran into an error saying that files in pytorch repository weren't found. With a bit of googling, I found out that this was because I didn't get all the sub-modules which wasn't in the directions.

git submodule update --init

Running that right after the pytorch github repository was downloaded seems to fix it. I was too lazy to create a pull request but will tomorrow adding this direction to the appropriate position.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.