Comments (12)
Tried the lustre_full example--same result--
Fri Jan 24 09:52:30 PST 2020 : creating vmss: compute
az vmss create: 'os=StandardSSD_LRS' is not a valid value for '--storage-sku'. See 'az vmss create --help'.
from azurehpc.
Looking into this error, stay tuned.
from azurehpc.
The beegfs_local_ssd example deploys for me correctly.
I wonder if you have an out-of-date azurehpc repo?
or
your az cli is too old?
my az cli version is azure-cli 2.0.80
What is the version number of your az cli (az --version)?
If it's old please update it to the latest version.
Also did you pull the latest azurehpc repo? (git pull)
If the az cli and/or azurehpc repo is out-of-date, please update to latest version and try deploying again.
from azurehpc.
from azurehpc.
Hi Tim, yes I think the version you have for az cli is too old. You can verify by
az vm create -h | more
search for the "--storage-sku" command option my description says
"The SKU of the storage account with which to persist VM. Use a
singular sku that would be applied across all disks, or specify
individual disks. Usage: [--storage-sku SKU | --storage-sku
ID=SKU ID=SKU ID=SKU...], where each ID is "os" or a 0-indexed
lun. Allowed values: Standard_LRS, Premium_LRS,
StandardSSD_LRS, UltraSSD_LRS."
I suspect your older version does not support the newer syntax "os=StandardSSD_LRS "etc
Thanks,
Cormac.
from azurehpc.
I think the new version mostly worked to fix the issue. Sorry for the trouble. The cluster creation failed, but I'm not sure if this is even a bug. Any suggestions would be appreciated--thank you, Tim
Mon Jan 27 10:59:40 PST 2020 : creating vm: beegfsm
Mon Jan 27 10:59:50 PST 2020 : creating vmss: beegfssm
Mon Jan 27 11:00:03 PST 2020 : creating vmss: compute
Mon Jan 27 11:00:16 PST 2020 : creating vm: headnode
Mon Jan 27 11:00:29 PST 2020 : waiting for beegfsm to be created
Mon Jan 27 11:01:01 PST 2020 : waiting for beegfssm to be created
Mon Jan 27 11:04:37 PST 2020 : waiting for compute to be created
Mon Jan 27 11:04:39 PST 2020 : waiting for headnode to be created
Mon Jan 27 11:04:40 PST 2020 : getting public ip for headnode
Mon Jan 27 11:04:42 PST 2020 : building hostlists
Mon Jan 27 11:04:45 PST 2020 : building install scripts
rsync azhpc_install_config to headnode60f349.westeurope.cloudapp.azure.com
rsync: connection unexpectedly closed (0 bytes received so far) [sender]
rsync error: unexplained error (code 255) at io.c(605) [sender=3.0.9]
Mon Jan 27 11:06:53 PST 2020 : running the install scripts
Step 0 : install_node_setup.sh (jumpbox_script)
Error: (255) Errors while running azhpc_install_config/install/00_install_node_setup.sh
rsync: connection unexpectedly closed (0 bytes received so far) [Receiver]
rsync error: unexplained error (code 255) at io.c(605) [Receiver=3.0.9]
Mon Jan 27 11:11:08 PST 2020 error : There were errors while running scripts, exiting
from azurehpc.
It looks like you may have some network issues? The error occurred when azhpc tried to copy some scripts/env from your local WS to the headnode you created in Azure.
Can you ssh to the headnode?
ssh -i <YOUR_KEY> [email protected]
from azurehpc.
I could, but then I deleted it all in order to try again. Perhaps being behind a firewall is biting me here. ssh can go through, as it is set up with a proxy, but perhaps the scripts need to have a proxy set somewhere else. Or maybe they need an http proxy. Thanks for looking the error message over. I will try again this afternoon. If there is a place to set a proxy, please let me know. Thank you--Tim
from azurehpc.
Failure in same place. I think it is proxy related. I've succeeded in copying the azhpc_install_config scripts over. Just run them in order? Sorry to clutter the issue database.
from azurehpc.
I would suggest you to use Azure Cloud Shell as it comes with a full working AZ CLI environment. Just clone azhpc from there, you can use "vi" or "code" as editors.
from azurehpc.
Thanks for the follow up. I will try this and use that as a backup plan if this fails.
from azurehpc.
We deviating from the original issue raised. If you still are having problems, please open another issue.
from azurehpc.
Related Issues (20)
- [bug] Pipeline image creation failed - BuildCluster Gen#1, BuildCluster Gen#2
- [bug] Slum_autoscale pipeline failed with headnode connnection refused. HOT 1
- [bug] cc_anf pipeline failed with provisioning failed (InternalServerError) HOT 1
- Unable to create a cluster out of an HPC Image derived from a VHD - package epel-release is not installed epel-release-7-11.noarch HOT 3
- Support OpenPBS 20 HOT 1
- xfs nobarrier is deprecated since kernel 4.13
- [bug] NFS mount fails due issues in nfs.conf HOT 1
- support cyclecloud8 in cc_install.sh HOT 3
- cyclecloud8 config fails on "authorization.check_datastore_permissions"' HOT 1
- Using existing resources: RG, Vnet, Jumpbox etc HOT 4
- [slurm version in AutoScale script] HOT 3
- start_gpu_data_collector.sh script failure when tried to excute HOT 1
- [feature] Add the link of this video in the documentation
- gpu_monitoring: Script returns error on Ubuntu 20.04 LTS [bug]
- [feature] specify subscription through config.json?
- This repo is missing important files
- [bug] "Error with `azhpc-scp` command in `apps/wrf/readme.md` : -r flag unrecognized"
- [bug]: Unable to locate a modulefile for 'spack/spack' in `build-wrf.sh`and `build_wps.sh` HOT 1
- [bug] `azhpc-build` script fails but resources are created, leading to unintended charges
- doesn't connect cluster
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from azurehpc.