Bug#910069: linux-image-4.18.0-1-amd64: NVME quirk needed for Intel 760p + Thinkpad T480

2019-02-24 Thread Chris Chiappa
Doh, I had checked for firmware when I first ran into the problem and
never checked again.  Indeed, having upgraded the firmware, the
command line workaround no longer seems necessary, feel free to close
the bug.  Thanks!

On Sun, Feb 10, 2019 at 04:36:09PM +, Ben Hutchings wrote:
> Control: tag -1 moreinfo
> 
> On Tue, 02 Oct 2018 05:44:23 -0400 Chris Chiappa  
> wrote:
> [...]
> > # nvme id-ctrl /dev/nvme0 
> > NVME Identify Controller:
> > vid   : 0x8086
> > ssvid : 0x8086
> > sn: BTHH82260487512D
> > mn: INTEL SSDPEKKW512G8 
> > fr: 001C
> [...]
> 
> According to the release notes for the Intel SSD Firmware Update Tool
> 
> there is a firmware update (version 004C) and the description suggests
> it may address this problem.
> 
> Please upgrade the firmware (but make sure you have a backup of the
> drive) and check whether that fixes the problem.
> 
> Ben.
> 
> -- 
> Ben Hutchings
> The world is coming to an end.Please log off.
> 



Bug#910069: linux-image-4.18.0-1-amd64: NVME quirk needed for Intel 760p + Thinkpad T480

2019-02-10 Thread Ben Hutchings
Control: tag -1 moreinfo

On Tue, 02 Oct 2018 05:44:23 -0400 Chris Chiappa  
wrote:
[...]
> # nvme id-ctrl /dev/nvme0 
> NVME Identify Controller:
> vid   : 0x8086
> ssvid : 0x8086
> sn: BTHH82260487512D
> mn: INTEL SSDPEKKW512G8 
> fr: 001C
[...]

According to the release notes for the Intel SSD Firmware Update Tool

there is a firmware update (version 004C) and the description suggests
it may address this problem.

Please upgrade the firmware (but make sure you have a backup of the
drive) and check whether that fixes the problem.

Ben.

-- 
Ben Hutchings
The world is coming to an end.  Please log off.



signature.asc
Description: This is a digitally signed message part


Bug#910069: linux-image-4.18.0-1-amd64: NVME quirk needed for Intel 760p + Thinkpad T480

2018-10-02 Thread Chris Chiappa
Package: src:linux
Version: 4.18.8-1
Severity: normal

T480 with a SATA SSD works fine out of box.  Replacing the SATA SSD
with an Intel 760p, the latest Buster installer gets the system
installed fine, but it is extremely unstable after rebooting into the
new installation.  There's a "dmesg" output I was able to get from one
such hang.  When the drive hangs, on reboot the BIOS is not even able
to see it.

Googling turns up nothing about this particular combination, but there
are other (presumably more popular) NVME SSDs that exhibit similar
problems:

https://askubuntu.com/questions/905710/ext4-fs-error-after-ubuntu-17-04-upgrade/906105#906105
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1678184
https://lkml.org/lkml/2017/5/24/941

I've confirmed that with the kernel flag
nvme_core.default_ps_max_latency_us=5500 the machine is stable.

It seems like there are some SSDs that always need
NVME_QUIRK_NO_DEEPEST_PS and there are some for which it's only a
specific combination of SSD and machine.  I don't have any other NVME
capable machines here so don't know if I can determine that.  Happy to
do more debugging if there's anything I can provide.  As far as I can
tell, there is no firmware update for this drive available.

# nvme list
Node SN   Model
Namespace Usage  Format   FW Rev  
   
- --  
/dev/nvme0n1 BTHH82260487512D INTEL SSDPEKKW512G8  
1 512.11  GB / 512.11  GB512   B +  0 B   001C

# nvme id-ctrl /dev/nvme0 
NVME Identify Controller:
vid   : 0x8086
ssvid : 0x8086
sn: BTHH82260487512D
mn: INTEL SSDPEKKW512G8 
fr: 001C
rab   : 6
ieee  : 5cd2e4
cmic  : 0
mdts  : 6
cntlid: 1
ver   : 10300
rtd3r : 7a120
rtd3e : 1e8480
oaes  : 0x200
ctratt: 0
rrls  : 0
oacs  : 0x17
acl   : 4
aerl  : 7
frmw  : 0x14
lpa   : 0xf
elpe  : 255
npss  : 4
avscc : 0
apsta : 0x1
wctemp: 348
cctemp: 353
mtfa  : 50
hmpre : 0
hmmin : 0
tnvmcap   : 0
unvmcap   : 0
rpmbs : 0
edstt : 5
dsto  : 1
fwug  : 0
kas   : 0
hctma : 0x1
mntmt : 303
mxtmt : 348
sanicap   : 0x3
hmminds   : 0
hmmaxd: 0
nsetidmax : 0
sqes  : 0x66
cqes  : 0x44
maxcmd: 0
nn: 1
oncs  : 0x5f
fuses : 0
fna   : 0x4
vwc   : 0x1
awun  : 0
awupf : 0
nvscc : 0
acwu  : 0
sgls  : 0
subnqn: 
ioccsz: 0
iorcsz: 0
icdoff: 0
ctrattr   : 0
msdbd : 0
ps0 : mp:9.00W operational enlat:0 exlat:0 rrt:0 rrl:0
  rwt:0 rwl:0 idle_power:- active_power:-
ps1 : mp:4.60W operational enlat:0 exlat:0 rrt:1 rrl:1
  rwt:1 rwl:1 idle_power:- active_power:-
ps2 : mp:3.80W operational enlat:0 exlat:0 rrt:2 rrl:2
  rwt:2 rwl:2 idle_power:- active_power:-
ps3 : mp:0.0450W non-operational enlat:2000 exlat:2000 rrt:3 rrl:3
  rwt:3 rwl:3 idle_power:- active_power:-
ps4 : mp:0.0040W non-operational enlat:6000 exlat:8000 rrt:4 rrl:4
  rwt:4 rwl:4 idle_power:- active_power:-

# smartctl -a /dev/nvme0
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.18.0-1-amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:   INTEL SSDPEKKW512G8
Serial Number:  BTHH82260487512D
Firmware Version:   001C
PCI Vendor/Subsystem ID:0x8086
IEEE OUI Identifier:0x5cd2e4
Controller ID:  1
Number of Namespaces:   1
Namespace 1 Size/Capacity:  512,110,190,592 [512 GB]
Namespace 1 Formatted LBA Size: 512
Local Time is:  Tue Oct  2 05:41:32 2018 EDT
Firmware Updates (0x14):2 Slots, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL *Other*
Optional NVM Commands (0x005f): Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat 
*Other*
Maximum Data Transfer Size: 64 Pages
Warning  Comp. Temp. Threshold: 75 Celsius
Critical Comp. Temp. Threshold: 80 Celsius

Supported Power States
St Op Max   Active Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 + 9.00W   --0  0  0  00   0
 1 + 4.60W   --1  1  1  10   0
 2 + 3.80W   --2  2  2  20   0
 3 -   0.0450W   --3  3  3  3 20002000
 4 -   0.0040W   --4  4  4  4 60008000

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 + 512   0 0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe