Comments (4)
from nvoc_by_fullzero_community_release.
@papampi that log seems correct. Where you see "SMI Query output with errors @[GPU2]:" it is the full query output, and it does not contain the only one related to the current gpu. It is correctly initiating a countdown to reboot for every GPU it has failed to query. There are some cases where the query fails for only a single GPU but returns valid results for the others and cases like this one where a major bus failure does prevent querying other devices since nvidia-smi is not giving other results apart from the error message. In this case, watchdog reacted before 60 seconds from the first failure, otherwise tempcontrol would have rebooted the rig by itself.
Therefore, I think it's working as expected. If you see from your logs another example of what happens when a single gpu query fails, you should also see outputs for other non-failing gpus between "SMI Query output with errors" as well.
@leenoox happy to see you back
from nvoc_by_fullzero_community_release.
@leenoox
Cant wait to have you back ...
from nvoc_by_fullzero_community_release.
@LuKePicci
When you get time can you see if we can make watchdog to behave like tempcontrol on problematic GPU to check it once more before rebooting the rig?
from nvoc_by_fullzero_community_release.
Related Issues (20)
- XMR-Stak don't start on 3main launching HOT 12
- NICE_CNV8 & NICE_CNHEAVY not displayed by WTM_SWITCHER HOT 10
- chose cuda version default HOT 1
- nvOC commands autocomplete HOT 3
- Algo and hashrate missing in minerinfo HOT 4
- Invalid default PHI2 miner HOT 1
- Can't add grin using gminer HOT 7
- Could you please add support for nicehash grin and beam using gminer 1.29? HOT 20
- Add support for multiple miner instances to segregate GPUs on single rigs HOT 7
- Please add support for cuckaroo31 using gminer 1.31 to mine grin31 HOT 11
- Could you please add bminer 14.3.1 support for cuckaroo29 and cuckatoo31 (pool and NH) HOT 13
- Aeternity option in gminer should be aeterity not cuckoo. Also please add f2pool as default AE pool HOT 1
- Disabled GPU not working correctly on PhoenixMiner and Claymore HOT 1
- Add support for NBminer 21.0 for grin etc algos HOT 3
- NICE_* address argument missing in miner command line HOT 4
- Common defaults should exploit COIN_MINER_OPTS where possible
- Add support RainbowMiner HOT 6
- nvoc 3.2 - not able to start ethminer HOT 9
- gtx 1660 super HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nvoc_by_fullzero_community_release.