Bug#900244: NVM error information log entry count increase not an error

2021-09-30 Thread Benjamin Poirier
I also see this issue with the following drive, nvme1 on my system:

Model Number:   Samsung SSD 970 EVO Plus 1TB
Firmware Version:   2B2QEXM7
PCI Vendor/Subsystem ID:0x144d

This has been going on for months and until recently the output of
`smartctl -l error /dev/nvme1` showed

=== START OF SMART DATA SECTION ===
Error Information (NVMe Log 0x01, 16 of 64 entries)
No Errors Logged

However now it shows

=== START OF SMART DATA SECTION ===
Error Information (NVMe Log 0x01, 16 of 64 entries)
Num   ErrCount  SQId   CmdId  Status  PELoc  LBA  NSIDVS
  0   3401 0  0x0006  0x4004  -0 0 -

`nvme smart-log /dev/nvme1` shows
.
 Entry[ 0]
.
error_count : 3401
sqid: 0
cmdid   : 0x6
status_field: 0x4004(INVALID_FIELD: A reserved coded value or an 
unsupported value in a defined field)
parm_err_loc: 0x
lba : 0
nsid: 0
vs  : 0
trtype  : The transport type is not indicated or the error is not 
transport related.
cs  : 0
trtype_spec_info: 0
.

The other entries are not populated, they all have
status_field: 0(SUCCESS: The command completed successfully)

The same system also has another drive, /dev/nvme0:
Model Number:   Samsung SSD 980 PRO 1TB
Firmware Version:   2B2QGXA7
PCI Vendor/Subsystem ID:0x144d

That drive does not show this problem, `smartctl -A /dev/nvme0` remains
at
Error Information Log Entries:  0

In my case this is a dual boot system; nvme1 is used for Windows 10 and
rarely mounted in Linux. However in the past I was also using nvme1 for
my linux root partition and the error count was also increasing. Like
others have said, the error count on nvme1 seems to increase after
reboots though not systematically.

I have edited /etc/smartmontools/run.d/10mail to filter out the messages
about nvme1 error log entries.



Bug#900244: NVM error information log entry count increase not an error

2021-04-15 Thread alain
Package: smartmontools
Followup-For: Bug #900244
X-Debbugs-Cc: compte.perso.de-al...@bbox.fr

$ apt policy smartmontools
smartmontools:
  Installé : 7.2-1
  Candidat : 7.2-1

$ sudo smartctl -i -l error /dev/nvme0n1
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.10.0-6-amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:   CT1000P1SSD8
Serial Number:  2006E289B7B7
Firmware Version:   P3CR013
PCI Vendor/Subsystem ID:0xc0a9
IEEE OUI Identifier:0x00a075
Controller ID:  1
NVMe Version:   1.3
Number of Namespaces:   1
Namespace 1 Size/Capacity:  1 000 204 886 016 [1,00 TB]
Namespace 1 Formatted LBA Size: 512
Namespace 1 IEEE EUI-64:00a075 20e289b7b7
Local Time is:  Thu Apr 15 12:04:03 2021 CEST

=== START OF SMART DATA SECTION ===
Error Information (NVMe Log 0x01, 16 of 256 entries)
No Errors Logged

-- Journal begins at Sun 2021-04-04 09:01:07 CEST, ends at Thu 2021-04-15
12:06:08 CEST. --
avril 15 11:08:24 sid pulseaudio[1966]: GetManagedObjects() failed:
org.freedesktop.DBus.Error.NoReply: Did not receive a reply. Possible causes
include: the remote application did not send a>
avril 15 11:08:02 sid gdm-launch-environment][1022]: pam_systemd(gdm-launch-
environment:session): Failed to release session: Appel système interrompu
avril 15 11:08:01 sid hp-systray[2534]: hp-systray(qt5)[2534]: error: Unable to
find hp-upgrade --notify on PATH.
avril 15 11:07:59 sid pulseaudio[1090]: GetManagedObjects() failed:
org.freedesktop.DBus.Error.TimedOut: Failed to activate service 'org.bluez':
timed out (service_start_timeout=25000ms)
avril 15 11:07:59 sid pipewire[1965]: Failed to receive portal pid:
org.freedesktop.DBus.Error.NameHasNoOwner: Could not get PID of name
'org.freedesktop.portal.Desktop': no such name
avril 15 11:07:58 sid gdm-password][1940]: gkr-pam: unable to locate daemon
control file
avril 15 11:07:33 sid pipewire[1089]: Failed to receive portal pid:
org.freedesktop.DBus.Error.NameHasNoOwner: Could not get PID of name
'org.freedesktop.portal.Desktop': no such name
avril 15 11:07:33 sid systemd[913]: radeon-profile-daemon.service: Failed at
step EXEC spawning /usr/bin/radeon-profile-daemon: No such file or directory

this bug seems not to exist anymore

-- System Information:
Debian Release: 11.0
  APT prefers testing-security
  APT policy: (500, 'testing-security'), (500, 'unstable'), (100, 'testing'),
(100, 'stable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 5.10.0-6-amd64 (SMP w/16 CPU threads)
Kernel taint flags: TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE
Locale: LANG=fr_FR.UTF-8, LC_CTYPE=fr_FR.UTF-8 (charmap=UTF-8), LANGUAGE not
set
Shell: /bin/sh linked to /usr/bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages smartmontools depends on:
ii  debianutils  4.11.2
ii  libc62.31-11
ii  libcap-ng0   0.7.9-2.2+b1
ii  libgcc-s110.2.1-6
ii  libselinux1  3.1-3
ii  libstdc++6   10.2.1-6
ii  libsystemd0  247.3-5
ii  lsb-base 11.1.0

smartmontools recommends no packages.

Versions of packages smartmontools suggests:
ii  curl   7.74.0-1.2
ii  gpg2.2.27-1
pn  gsmartcontrol  
ii  mailutils [mailx]  1:3.11.1-5
pn  smart-notifier 
ii  wget   1.21-1+b1


Bug#900244: NVM error information log entry count increase not an error

2020-10-03 Thread Günter Frenz
Hi,

I don't know if this is the same bug as described above: since some time
I get the same message via mail always on system-boot but my error
log seems to be empty:

corinnis:~# smartctl -i -l error /dev/nvme0n1
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.8.0-2-amd64] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke,
www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:   Samsung SSD 970 EVO 500GB
Serial Number:  S466NX0M112847J
Firmware Version:   2B2QEXE7
PCI Vendor/Subsystem ID:0x144d
IEEE OUI Identifier:0x002538
Total NVM Capacity: 500.107.862.016 [500 GB]
Unallocated NVM Capacity:   0
Controller ID:  4
Number of Namespaces:   1
Namespace 1 Size/Capacity:  500.107.862.016 [500 GB]
Namespace 1 Utilization:174.323.851.264 [174 GB]
Namespace 1 Formatted LBA Size: 512
Namespace 1 IEEE EUI-64:002538 5191b033dd
Local Time is:  Sat Oct  3 10:37:34 2020 CEST

=== START OF SMART DATA SECTION ===
Error Information (NVMe Log 0x01, max 64 entries)
No Errors Logged

best regards

Günter


-- 
---
Günter Frenz
Börschgasse 16a, D-51143 Köln
(h) gu...@guefz.de, gu...@freenet.de
(w) f...@gso-koeln.de
---




pgpvStth1L55W.pgp
Description: Digitale Signatur von OpenPGP


Bug#900244: NVM error information log entry count increase not an error

2019-02-18 Thread Robert Scheer
Hello, I can provide such output. I have the same problem on 2 (of 32)
identical servers running Debian stretch. The smartmontools software
on all of them is version 6.6-1 from Debian buster, rebuilt on stretch
with CONFIGURE_ARGS=--with-nvme-devicescan

Here are a fews consecutive outputs of "smartctl -i -l error,100 /dev/nvme0"
and the system information of one of those servers.

I'd be happy to provide more info if you need it.

Regards,
Robert Scheer

-- System Information:
Debian Release: 9.8
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'stable')
Architecture: amd64 (x86_64)

Kernel: Linux 4.9.0-8-amd64 (SMP w/16 CPU cores)
Locale: LANG=C, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE=C (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.9.0-8-amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:   INTEL SSDPEKKA512G7
Serial Number:  BTPY71930KCA512F
Firmware Version:   PSF119D
PCI Vendor/Subsystem ID:0x8086
IEEE OUI Identifier:0x5cd2e4
Controller ID:  1
Number of Namespaces:   1
Namespace 1 Size/Capacity:  512,110,190,592 [512 GB]
Namespace 1 Formatted LBA Size: 512
Local Time is:  Fri Feb 15 19:01:01 2019 CET

=== START OF SMART DATA SECTION ===
Error Information (NVMe Log 0x01, max 64 entries)
Num   ErrCount  SQId   CmdId  Status  PELoc  LBA  NSIDVS
  0 19 7  0x002e  0x0004  0x0180 1 -
  1 18 1  0x009b  0x0004  0x0180 1 -
  2 17 1  0x009b  0x0004  0x0180 1 -
  3 16 7  0x002e  0x0004  0x0180 1 -
  4 15 5  0x00f5  0x0004  0x0180 1 -
  5 14 6  0x00e4  0x0004  0x0180 1 -
  6 13 6  0x00e4  0x0004  0x0180 1 -
  7 12 8  0x003e  0x0004  0x0180 1 -
  8 11 8  0x003e  0x0004  0x0180 1 -
  9 10 7  0x00f7  0x0004  0x0180 1 -
 10  9 1  0x00dd  0x0004  0x0180 1 -
 11  8 1  0x00dd  0x0004  0x0180 1 -
 12  7 1  0x00dd  0x0004  0x0180 1 -
 13  6 4  0x0046  0x0004  0x0180 1 -
 14  5 5  0x0046  0x0004  0x0180 1 -
 15  4 6  0x00e4  0x0004  0x0180 1 -
 16  3 6  0x00e4  0x0004  0x0180 1 -
 17  2 6  0x00e4  0x0004  0x0180 1 -
 18  1 6  0x00e4  0x0004  0x0180 1 -

smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.9.0-8-amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:   INTEL SSDPEKKA512G7
Serial Number:  BTPY71930KCA512F
Firmware Version:   PSF119D
PCI Vendor/Subsystem ID:0x8086
IEEE OUI Identifier:0x5cd2e4
Controller ID:  1
Number of Namespaces:   1
Namespace 1 Size/Capacity:  512,110,190,592 [512 GB]
Namespace 1 Formatted LBA Size: 512
Local Time is:  Sat Feb 16 01:01:01 2019 CET

=== START OF SMART DATA SECTION ===
Error Information (NVMe Log 0x01, max 64 entries)
Num   ErrCount  SQId   CmdId  Status  PELoc  LBA  NSIDVS
  0 20 5  0x00cf  0x0004  0x0180 1 -
  1 19 7  0x002e  0x0004  0x0180 1 -
  2 18 1  0x009b  0x0004  0x0180 1 -
  3 17 1  0x009b  0x0004  0x0180 1 -
  4 16 7  0x002e  0x0004  0x0180 1 -
  5 15 5  0x00f5  0x0004  0x0180 1 -
  6 14 6  0x00e4  0x0004  0x0180 1 -
  7 13 6  0x00e4  0x0004  0x0180 1 -
  8 12 8  0x003e  0x0004  0x0180 1 -
  9 11 8  0x003e  0x0004  0x0180 1 -
 10 10 7  0x00f7  0x0004  0x0180 1 -
 11  9 1  0x00dd  0x0004  0x0180 1 -
 12  8 1  0x00dd  0x0004  0x0180 1 -
 13  7 1  0x00dd  0x0004  0x0180 1 -
 14  6 4  0x0046  0x0004  0x0180 1 -
 15  5 5  0x0046  0x0004  0x0180 1 -
 16  4 6  0x00e4  0x0004  0x0180 1 -
 17  3 6  0x00e4  0x0004  

Bug#900244: NVM error information log entry count increase not an error

2018-05-27 Thread martin f krafft
Package: smartmontools
Version: 6.5+svn4324-1
Severity: normal
File: /usr/sbin/smartd

As of late, I get

  Device: /dev/nvme0, number of Error Log entries increased from 373 to 374

mailed to me every day. However, this is not an error log, but an
error "information" log, and used by NVM apparently also for simple
state logging, i.e. a new entry being appended to the log does not
actually constitute an error.

I'll try to figure out how to filter this stuff using smartd.conf,
but filing the issue here as I think the default handling of NVMs
should be fixed accordingly.

Thanks,
martin

-- System Information:
Debian Release: buster/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 4.16.0-1-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_NZ, LC_CTYPE=en_NZ.UTF-8 (charmap=UTF-8), LANGUAGE=en_NZ:en 
(charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages smartmontools depends on:
ii  debianutils  4.8.4
ii  init-system-helpers  1.51
ii  libc62.27-3
ii  libcap-ng0   0.7.9-1
ii  libgcc1  1:8.1.0-2
ii  libselinux1  2.7-2+b2
ii  libstdc++6   8.1.0-2
ii  lsb-base 9.20170808

Versions of packages smartmontools recommends:
ii  bsd-mailx [mailx]  8.1.2-0.20160123cvs-4

Versions of packages smartmontools suggests:
pn  gsmartcontrol   
pn  smart-notifier  

-- Configuration Files:
/etc/smartd.conf changed [not included]

-- debconf-show failed


-- 
 .''`.   martin f. krafft  @martinkrafft
: :'  :  proud Debian developer
`. `'`   http://people.debian.org/~madduck
  `-  Debian - when you have better things to do than fixing systems


digital_signature_gpg.asc
Description: Digital GPG signature (see http://martin-krafft.net/gpg/sig-policy/999bbcc4/current)