Redit tells me:
Critical Warning: 0x04
Turns out, that according to Page 122 of the NVMe Document , Byte 00,
bit 4 (0x04) of the Critical Warning means:
If set to ‘1’, then the volatile memory backup device has failed. This
field is only valid if the controller has a volatile memory backup solution.
~~~~~~~~~~~~~~~~~~~~~~~
webgen@webgen-01:~$ sudo smartctl -a /dev/nvme0
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-6.5.0-21-generic] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Number: SAMSUNG MZVLB512HBJQ-000L2
Serial Number: S4DYNF0M840942
Firmware Version: 3L1QEXF7
PCI Vendor/Subsystem ID: 0x144d
IEEE OUI Identifier: 0x002538
Total NVM Capacity: 512,110,190,592 [512 GB]
Unallocated NVM Capacity: 0
Controller ID: 4
NVMe Version: 1.3
Number of Namespaces: 1
Namespace 1 Size/Capacity: 512,110,190,592 [512 GB]
Namespace 1 Utilization: 384,306,511,872 [384 GB]
Namespace 1 Formatted LBA Size: 512
Namespace 1 IEEE EUI-64: 002538 8891b87f0e
Local Time is: Sat Mar 9 07:51:53 2024 AEST
Firmware Updates (0x16): 3 Slots, no Reset required
Optional Admin Commands (0x0017): Security Format Frmw_DL Self_Test
Optional NVM Commands (0x005f): Comp Wr_Unc DS_Mngmt Wr_Zero
Sav/Sel_Feat Timestmp
Log Page Attributes (0x03): S/H_per_NS Cmd_Eff_Lg
Maximum Data Transfer Size: 512 Pages
Warning Comp. Temp. Threshold: 84 Celsius
Critical Comp. Temp. Threshold: 85 Celsius
Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 8.00W - - 0 0 0 0 0 0
1 + 6.30W - - 1 1 1 1 0 0
2 + 3.50W - - 2 2 2 2 0 0
3 - 0.0760W - - 3 3 3 3 210 1200
4 - 0.0050W - - 4 4 4 4 2000 8000
Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 0
=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
- NVM subsystem reliability has been degraded
SMART/Health Information (NVMe Log 0x02)
Critical Warning: 0x04
Temperature: 52 Celsius
Available Spare: 100%
Available Spare Threshold: 10%
Percentage Used: 113%
Data Units Read: 4,391,625,218 [2.24 PB]
Data Units Written: 4,225,854,408 [2.16 PB]
Host Read Commands: 57,320,267,236
Host Write Commands: 6,553,944,614
Controller Busy Time: 136,513
Power Cycles: 1,584
Power On Hours: 7,564
Unsafe Shutdowns: 1,080
<------------------------------------------- this doesn't look good!
Media and Data Integrity Errors: 0
Error Information Log Entries: 1,762
Warning Comp. Temperature Time: 0
Critical Comp. Temperature Time: 0
Temperature Sensor 1: 52 Celsius
Temperature Sensor 2: 52 Celsius
Error Information (NVMe Log 0x01, 16 of 64 entries)
Num ErrCount SQId CmdId Status PELoc LBA NSID VS
0 1762 0 0x0004 0x4004 - 0 0 -
Thanks
P
_______________________________________________
luv-main mailing list -- [email protected]
To unsubscribe send an email to [email protected]