When a Ceph OSD daemon crashes, you may see the following warning in your Proxmox environment:
HEALTH_WARN: 1 daemons have recently crashed
osd.3 crashed on host sm01d at 2025-01-26T17:43:41.644866ZFollow these steps to resolve the issue:
1. Check the Crash Logs
First, identify the crash details by listing recent crashes:
ceph crash lsTo get more information about a specific crash:
ceph crash info <CRASH_ID>Replace <CRASH_ID> with the actual crash ID displayed in the previous command.
2. Acknowledge the Crash Report
After reviewing the details, archive the crash report to clear the warning:
ceph crash archive <CRASH_ID>To archive all crash reports at once:
ceph crash archive-all3. Restart the OSD Daemon
If osd.3 is still down, restart it manually:
systemctl restart ceph-osd@3Check its status:
systemctl status ceph-osd@34. Verify Ceph Health
After restarting, check the Ceph cluster’s health:
ceph health detail
ceph -sIf the error persists, review system logs for further investigation:
journalctl -u ceph-osd@3 --no-pager --lines=100