Public bug reported:

I've a couple of dedicated Hetzner servers with two Samsung NVME drives
each.  After upgrading them to Ubuntu 22.04 LTS every reboot causes
smartd to send me an emails like this:

    From: root
    To: root
    Subject: SMART error (ErrorCount) detected on host: ...
    Date: Fri, 18 Apr 2025 12:02:10 +0200 (CEST)

    This message was generated by the smartd daemon running on:

       host name:  ...
       DNS domain: ...

    The following warning/error was logged by the smartd daemon:

    Device: /dev/nvme1, number of Error Log entries increased from 0 to
1

    Device info:
    SAMSUNG MZVLB512HBJQ-00000, S/N:S4GENX0N71...., FW:EXF7201Q, 512 GB

    For details see host's SYSLOG.

    You can also use the smartctl utility for further investigation.
    Another message will be sent in 24 hours if the problem persists.

The SMART error log is

    # smartctl -l error /dev/nvme0n1
    smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.0-138-generic] (local 
build)
    Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

    === START OF SMART DATA SECTION ===
    Error Information (NVMe Log 0x01, 16 of 64 entries)
    Num   ErrCount  SQId   CmdId  Status  PELoc          LBA  NSID    VS
      0          1     0  0x1008  0x4004      -            0     0     -

and the same for /dev/nvme1n1, while on the second server (which has
seen more reboots) the CmdIds differ  between the two drives (0x1000 and
0x1005), the error counts are higher (4), but the status is the same
0x4004.

Internet search about the status code led me to
https://superuser.com/questions/1812742/unable-to-identify-smart-errors-
issues-of-my-nvme-disk#1814225, which described a decoding procedure
that decodes the 0x4004 to "invalid field in command".

My current hypothesis is that these errors are caused by smartd itself
issuing invalid commands to the drives on startup, or something like
that.  This never happened on Ubuntu 20.04 LTS.

ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: smartmontools 7.2-1ubuntu0.1
ProcVersionSignature: Ubuntu 5.15.0-138.148-generic 5.15.178
Uname: Linux 5.15.0-138-generic x86_64
ApportVersion: 2.20.11-0ubuntu82.6
Architecture: amd64
CasperMD5CheckResult: unknown
Date: Fri Apr 18 13:48:23 2025
SourcePackage: smartmontools
UpgradeStatus: Upgraded to jammy on 2025-03-10 (39 days ago)

** Affects: smartmontools (Ubuntu)
     Importance: Undecided
         Status: New


** Tags: amd64 apport-bug jammy

** Bug watch added: Debian Bug tracker #900244
   https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=900244

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2107644

Title:
  smartd causes ErrorCount to increase on every boot

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/smartmontools/+bug/2107644/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to