Hi Robert, I'm not an expert on the low-level details and "modern" Ceph, so I hope I don't lead you on any wild goose chases, but I might at least give some leads. It seems odd that the metrics mention NVM/e - I'm guessing that it's just a cross-product test and tries all tools on all devices. SMART test failure is more of an issue. It's a pity the error message is so nondescript. Some things I can think of from simplest to most complicated are: * Are smartmontools installed on the drive host? * Does the monitoring UID have sudo access? * Does a manual "sudo smartctl -a /dev/sdc" give the same or similar result? * Is the drive managed by a hardware RAID controller or concentrator (Like Dell PERC or a USB adapter or something) * (This is a stretch) Is there an OSD for the drive that's given the "NVME" class?
Hope that gives you something. M0les. On Thu, 21 Aug 2025, at 17:15, Robert Sander wrote: > Hi, > > On a new cluster with version 19.2.3 the device health metrics only show a > smartctl error: > > { > "20250821-000313": { > "dev": "/dev/sdc", > "error": "smartctl failed", > "nvme_smart_health_information_add_log_error": "nvme returned an > error: sudo: exit status: 1", > "nvme_smart_health_information_add_log_error_code": -22, > "nvme_vendor": "ata", > "smartctl_error_code": -22, > "smartctl_output": "smartctl returned an error (1): stderr:\nsudo: > exit status: 1\nstdout:\n" > } > } > > The device in question (like all the other in the cluster) is a Samsung > MZ7L37T6 SATA SSD. > > What is happening here? > > Regards > -- > Robert Sander > Linux Consultant > > Heinlein Consulting GmbH > Schwedter Str. 8/9b, 10119 Berlin > > https://www.heinlein-support.de > > Tel: +49 30 405051 - 0 > Fax: +49 30 405051 - 19 > > Amtsgericht Berlin-Charlottenburg - HRB 220009 B > Geschäftsführer: Peer Heinlein - Sitz: Berlin > _______________________________________________ > ceph-users mailing list -- ceph-users@ceph.io > To unsubscribe send an email to ceph-users-le...@ceph.io > _______________________________________________ ceph-users mailing list -- ceph-users@ceph.io To unsubscribe send an email to ceph-users-le...@ceph.io