How to acknowledge and clear Ceph osd crashed errors in Proxmox VE

When a Ceph OSD daemon crashes, you may see the following warning in your Proxmox environment:

HEALTH_WARN: 1 daemons have recently crashed
osd.3 crashed on host sm01d at 2025-01-26T17:43:41.644866Z

Follow these steps to resolve the issue:

1. Check the Crash Logs

First, identify the crash details by listing recent crashes:

ceph crash ls

To get more information about a specific crash:

ceph crash info <CRASH_ID>

Replace <CRASH_ID> with the actual crash ID displayed in the previous command.

2. Acknowledge the Crash Report

After reviewing the details, archive the crash report to clear the warning:

ceph crash archive <CRASH_ID>

To archive all crash reports at once:

ceph crash archive-all

3. Restart the OSD Daemon

If osd.3 is still down, restart it manually:

systemctl restart ceph-osd@3

Check its status:

systemctl status ceph-osd@3

4. Verify Ceph Health

After restarting, check the Ceph cluster’s health:

ceph health detail
ceph -s

If the error persists, review system logs for further investigation:

journalctl -u ceph-osd@3 --no-pager --lines=100