Hi, Am 8/21/25 um 10:07 schrieb Miles Goodhew:
It seems odd that the metrics mention NVM/e - I'm guessing that it's just a cross-product test and tries all tools on all devices. SMART test failure is more of an issue. It's a pity the error message is so nondescript. Some things I can think of from simplest to most complicated are: * Are smartmontools installed on the drive host? * Does the monitoring UID have sudo access?
This is not under our control as this is a containerized installation. AFAICS smartmontools are available inside the container.
* Does a manual "sudo smartctl -a /dev/sdc" give the same or similar result?
This works outside the container on the host.
* Is the drive managed by a hardware RAID controller or concentrator (Like Dell PERC or a USB adapter or something)
Yes, but as smartctl gives us values the controller seems not to be the issue.
Regards -- Robert Sander Linux Consultant Heinlein Consulting GmbH Schwedter Str. 8/9b, 10119 Berlin https://www.heinlein-support.de Tel: +49 30 405051 - 0 Fax: +49 30 405051 - 19 Amtsgericht Berlin-Charlottenburg - HRB 220009 B Geschäftsführer: Peer Heinlein - Sitz: Berlin _______________________________________________ ceph-users mailing list -- ceph-users@ceph.io To unsubscribe send an email to ceph-users-le...@ceph.io