Giter VIP home page Giter VIP logo

cyclecloud-nodehealth's People

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

cyclecloud-nodehealth's Issues

hcheck logging to cyclecloud portal is not working correctly

According to nhc logging /var/log/nhc.log, there is an error but hcheck/send_log is not sending it to the cyclecloud portal.

tail /var/log/nhc.log
Running check: "check_hw_physmem 928858388kB 928858388kB 3%"
Running check: "check_hw_swap 0kB 0kB 3%"
Running check: "check_hw_ib 40 mlx5_an0:1"
ERROR: nhc: Health check failed: check_hw_ib: No IB port mlx5_an0:1 is ACTIVE (LinkUp 40 Gb/sec).

In NHC DETACHED_MODE, hcheck creates /var/log/healthcheck.json, but it reports no errors (exit-code is 0)?
{
"metadata": {
"azEnvironment": "AzurePublicCloud",
"isHostCompatibilityLayerVm": "false",
"location": "southcentralus",
"name": "xxxxx",
"offer": "ubuntu-hpc",
"osType": "Linux",
"platformFaultDomain": "0",
"platformUpdateDomain": "1",
"provider": "Microsoft.Compute",
"publisher": "microsoft-dsvm",
"resourceGroupName": "slm27u2004hc-xxxx",
"resourceId": "/subscriptions/xxxxx",
"subscriptionId": "xxxx",
"version": "20.04.2022121201",
"vmId": "xxxxx",
"vmScaleSetName": "xxxxx",
"vmSize": "Standard_ND96asr_v4",
"zone": ""
},
"testresults": {
"/opt/cycle/nodehealth/nhc-runner.sh": {
"exit-code": 0,
"test-time": 155.6628,
"extra-info": "None",
"message": "Node Health Check detached parent completed successfully (0s).\n"
}
}
}

I think the problem is

hpcadmin@slm27u2004hc-hpc-pg0-1:/opt/cycle/nodehealth$ sudo ./nhc-runner.sh

  • getopts L x
  • OPTIND=0
  • nhc NHC_CHECK_ALL=0 LOGFILE=- MARK_OFFLINE=0
    Node Health Check detached parent completed successfully (0s).
    + exit 0

Also the hcheck tool does not seem to work when nhc is running in DETACHED_MODE=0 (not detached mode), I get this error message "There was an error in launching the script: Script ./nhc-runner.sh timed out.". ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.