Comments (6)
Restarting of the OSD was successful?
If the OSD process exits, the pod will restart. So if the OSD did not restart after that error, the ceph-osd process must not have exited.
from rook.
@travisn Yes after restarting osd pod it was showing up
before that it was showing out in ceph status but pod was running but it was stuck.
In osd pod we can see below logs :
"
handle_connect_message_2 accept replacing existing(lossy) channel (new one lossy = 1)
no message from osd.x
osd not healthy; waiting to boot
osd is healthy faluse - only 0/12 up peers(less than 33%)
set_numa_affinity unable to identify public interface"
In between there were logs
"feature acting upacting
transitioning to stray"
Lastly it was showing
: /var/lib/ceph/osd/osd-x/block close
fbmap shutdown
but osd pod wasn't restarted
@Rakshith-R can you please help on above to find root cause ?
from rook.
@akash123-eng Was there any active client IO in the cluster? If the OSD's device was closed, the OSD may not notice until it tries to commit the IO. At that point, then it should fail and restart.
from rook.
@travisn yes there was active client io in the cluster
Other osd were working fine
from rook.
@travisn yes there was active client io in the cluster Other osd were working fine
Ok, then not sure. This just happened once, or has it happened multiple times?
from rook.
@travisn yes it happened once for now. but wanted to get behind its root cause
so we should fix it
from rook.
Related Issues (20)
- Ceph monitoring doesn't work when ceph status goes to error state
- Integrate Rook with ceph-csi-operator HOT 4
- Allow configuration of the ceph osd full settings from the CephCluster CR HOT 1
- Is it compatible with rook-ceph-default when the name of cephblockpool is not replicapool ? HOT 1
- rook-ceph-exporters left unschedulable for deleted nodes HOT 15
- exporter Pods for external Ceph cluster fail to start due to missing Secret HOT 2
- Exporter pods for external Ceph cluster are stuck Initializing HOT 2
- After OSD Remove Cluster has unknonw pgs HOT 4
- build issue with go module checksum mismatch in the master branch HOT 5
- Remove holder pod capabilities in Rook v1.16
- [HELP] Why my osd has 300GB RAW USE even there is no pg on the osd HOT 4
- Pool update and pool creation does not honor subFailureDomain HOT 3
- HEALTH_WARN HOT 8
- Compression causes csi-cephfs-plugin to stop functioning. HOT 7
- Failed to set up rook ceph on K3S HOT 2
- pgs degraded and undersized HOT 12
- After rook-ceph clean up, there is leftover process "[ceph-msgr] HOT 4
- External script's mon connection info can break when external Ceph cluster has non-default ports HOT 3
- Secret rook-ceph-crash-collector-keyring Not Found HOT 1
- Prometheus is failing rule evaluations after upgrading to 1.14.6 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rook.