Just to give my 2ยข, I'm going to guess it is something to do with software 
around suspend/resume rather than the drive, and this error about the drive is 
a bit of a red herring, potentially to do more with reading the hibernation 
status from the drive than an issue with the drive itself... Given the age, and 
that there are no other noticeable problems with the drive. Sounds like 
configuration to me, and some updated configuration file may have combined 
poorly with preexisting config values.

Are you on a systemd distro? If so, can you check out journalctl from around 
when it happens? I've run into misconfiguration stuff with resuming from 
encrypted swap and journalctl usually gave helpful feedback.

https://wiki.archlinux.org/title/Power_management/Suspend_and_hibernate#System_freezes_for_60_seconds_and_then_wakes_back_up_or_hangs_after_waking_up

This, for example, sounds strikingly similar to the symptoms you describe, 
though doesn't mention any drive issues. But journalctl will let you know 
quickly if that's what's going on.

Sounds frustrating, always hate issues with suspend/resume, felt like the the 
that most should just work, and like magic when it does!

- Sara (she/her <https://pronoun.is/she/>)

----- Original message -----
De: Brian May via luv-main <[email protected]>
Para: Rodney Brown <[email protected]>, [email protected]
Assunto: Re: suspend/resume buffer IO/Error on root filesystem
Data: Segunda, 16 de setembro de 2024 15:18

Rodney Brown via luv-main <[email protected]> writes:

> While I'm not sure what to do with it, does the smartctl  output show 
> anything?  The error reports may help.
>
> smartctl controls the Self-Monitoring, Analysis and Reporting 
> Technology  (SMART)  system  built into most ATA/SATA and SCSI/SAS hard 
> drives and solid-state drives.

Output from smartctl looks OK to me ....


=== cut ===
$ smartctl -a /dev/nvme0
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.8.12] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       WD PC SN810 SDCQNRZ-2T00-1201
Serial Number:                      23138Q440804
Firmware Version:                   61540101
PCI Vendor/Subsystem ID:            0x15b7
IEEE OUI Identifier:                0x001b44
Total NVM Capacity:                 2,048,408,248,320 [2.04 TB]
Unallocated NVM Capacity:           0
Controller ID:                      8224
NVMe Version:                       1.4
Number of Namespaces:               1
Namespace 1 Size/Capacity:          2,048,408,248,320 [2.04 TB]
Namespace 1 Formatted LBA Size:     512
Namespace 1 IEEE EUI-64:            001b44 4a48a7ec37
Local Time is:                      Mon Sep 16 15:16:07 2024 AEST
Firmware Updates (0x14):            2 Slots, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL Self_Test
Optional NVM Commands (0x00df):     Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat 
Timestmp Verify
Log Page Attributes (0x1e):         Cmd_Eff_Lg Ext_Get_Lg Telmtry_Lg Pers_Ev_Lg
Maximum Data Transfer Size:         128 Pages
Warning  Comp. Temp. Threshold:     84 Celsius
Critical Comp. Temp. Threshold:     88 Celsius
Namespace 1 Features (0x02):        NA_Fields

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
0 +     8.00W    8.00W       -    0  0  0  0        0       0
1 +     3.50W    3.50W       -    0  0  0  0        0       0
2 +     2.60W    2.60W       -    0  0  0  0        0       0
3 -   0.0250W       -        -    3  3  3  3     5000   10000
4 -   0.0035W       -        -    4  4  4  4     3900   45700

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
0 +     512       0         2
1 -    4096       0         1

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02)
Critical Warning:                   0x00
Temperature:                        24 Celsius
Available Spare:                    100%
Available Spare Threshold:          10%
Percentage Used:                    0%
Data Units Read:                    14,283,787 [7.31 TB]
Data Units Written:                 6,089,159 [3.11 TB]
Host Read Commands:                 65,743,172
Host Write Commands:                121,552,028
Controller Busy Time:               499
Power Cycles:                       290
Power On Hours:                     133
Unsafe Shutdowns:                   101
Media and Data Integrity Errors:    0
Error Information Log Entries:      0
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0

Error Information (NVMe Log 0x01, 16 of 256 entries)
No Errors Logged

Self-test Log (NVMe Log 0x06)
Self-test status: No self-test in progress
No Self-tests Logged
=== cut ===

-- 
Brian May @ Linux Penguins
_______________________________________________
luv-main mailing list -- [email protected]
To unsubscribe send an email to [email protected]
_______________________________________________
luv-main mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to