Giter VIP home page Giter VIP logo

amdacceleratorcloudguides's Introduction

Welcome to AMD Accelerator Cloud (AAC) Reference Documentation

Getting Started

Contact your AMD Sponsor to sign up for access to AMD Accelerator Cloud resources.

How to Login to the Web Interface Login

Go to https://aac.amd.com

How to SSH to the Plano Slurm Cluster

  1. From the laptop or system used to generate SSH keys for AAC User Account Registration, enter the following at the Terminal or PowerShell prompt to SSH to the AAC Plano Slurm cluster:
    ssh <your_userid>@aac1.amd.com
    
    The SSH keys should be accessible under $HOME/.ssh directory or via PuTTy or Mobaterm tools used to generate the SSH keys

Contacting AAC Support Team

For questions or support requests, please email them to [email protected]


July 13-14th AAC Maintenance Update Notice for AAC Users


The AAC Web Interface has moved to https://aac.amd.com

1. How to Fix ssh <USERID>@aac1.amd.com failed with Host key verification failed message

The Slurm login node was changed during maintenance, so the host key fingerprint is different. SSH users may see a failure to login with a WARNING message such as one shown below. Please update $HOME/.ssh/known_hosts by removing the existing entries for dell-r08-01 and retry ssh <USERID>@aac1.amd.com and accept the new fingerprints:

ECDSA key fingerprint is SHA256:u1u0/uh0GLcs19KNHrmZIA6EDLMvJACK5y2fMkVg1fg.
ECDSA key fingerprint is MD5:76:6a:a4:34:56:c0:04:fa:7f:84:e6:85:0b:f1:65:e5.

Solution:

  1. First remove existing fingerprint of old Slurm login host: ssh-keygen -R aac1.amd.com
  2. Login to the AAC Plano Slurm cluster: ssh <USERID>@aac1.amd.com
  3. Enter "yes" to accept new host key fingerprint at the prompt to continue to login.

2. How to Fix The selected queue is no longer available error

lhkcojnlehhnkkck

Solution:

The Slurm partition/queue names were changed during the maintenance to remove duplicate queue names and standardize on one set. The new partitition names can be used to allocate single node or a multi-node cluster using the Slurm commands.

1CN128C8G2H_2IB_MI210_RHEL9
1CN128C8G2H_2IB_MI210_RHEL8
1CN128C8G2H_2IB_MI210_SLES15
1CN128C8G2H_2IB_MI210_Ubuntu22
1CN96C8G1H_4IB_MI250_Ubuntu22

amdacceleratorcloudguides's People

Contributors

amddcgpuce avatar sree-harsha-assk avatar naimishared avatar arpitkhard avatar ozziemoreno avatar antentus avatar gurumohan123 avatar jagadish-amd avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.