Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-29 Thread Christoph Hellwig
On Tue, Aug 29, 2017 at 11:40:53AM +0200, Paul Menzel wrote:
> It looks like I was using blk-mq, as it was the default up to commit
> cbe7dfa26eee (Revert "scsi: default to scsi-mq"). So with Linux 4.13-rc7 and
> disabling blk-mq for SCSI, the system is functional again after resume.

Thanks for the confirmation Paul - I'll keep you on Cc when
implementing blќ runtime pm to make sure this issue doesn't reappear.


Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-29 Thread Paul Menzel

Dear Christoph,


On 08/23/17 14:15, Paul Menzel wrote:


On 08/23/17 13:48, Christoph Hellwig wrote:

Are you running with blk-mq enabled?  Also this never
occurred with 4.12, right?  Were you also running with or
without blk-mq for scsi there?


To my knowledge, I am using the defaults from Debian 9. I’ll check in 
one week, as I am away from the system.


It looks like I was using blk-mq, as it was the default up to commit 
cbe7dfa26eee (Revert "scsi: default to scsi-mq"). So with Linux 4.13-rc7 
and disabling blk-mq for SCSI, the system is functional again after resume.



Kind regards,

Paul


Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-23 Thread Paul Menzel

Dear Christoph,


On 08/23/17 13:48, Christoph Hellwig wrote:

Are you running with blk-mq enabled?  Also this never
occured with 4.12, right?  Were you also running with or
without blk-mq for scsi there?


To my knowledge, I am using the defaults from Debian 9. I’ll check in 
one week, as I am away from the system.



Kind regards,

Paul


Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-23 Thread Christoph Hellwig
Are you running with blk-mq enabled?  Also this never
occured with 4.12, right?  Were you also running with or
without blk-mq for scsi there?


Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-21 Thread Paul Menzel

Dear Christoph,


On 2017-08-21 20:41, Christoph Hellwig wrote:


with 4.13-rc6 we're not using blk-mq by default any more, do you
still see the issue with that one?


Yes, I do see it this commit 6470812e2226 (Merge
git://git.kernel.org/pub/scm/linux/kernel/git/davem/sparc).

```
00.831: [  575.945132] BUG: unable to handle kernel NULL pointer 
dereference at 00f4

00.830: [  575.948009] IP: blk_set_runtime_active+0x27/0x60
00.830: [  575.948009] *pde = 
00.831: [  575.948009]
00.831: [  575.948009] Oops: 0002 [#1] SMP
00.831: [  575.948009] Modules linked in: joydev wacom_w8001 serport 
cpufreq_powersave cpufreq_conservative cpufreq_userspace binfmt_misc 
iTCO_wdt iTCO_vendor_support arc4 coretemp snd_hda_codec_analog 
snd_hda_codec_generic iwl3945 snd_hda_intel pcmcia iwlegacy 
snd_hda_codec kvm mac80211 snd_hda_core irqbypass yenta_socket snd_pcsp 
lpc_ich snd_hwdep thinkpad_acpi pcmcia_rsrc mfd_core serio_raw snd_pcm 
sg pcmcia_core nvram cfg80211 snd_timer rng_core snd rfkill battery 
soundcore shpchp evdev ac acpi_cpufreq parport_pc ppdev lp parport 
ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 fscrypto ecb cbc 
algif_skcipher af_alg dm_crypt dm_mod sr_mod cdrom sd_mod ata_generic 
ahci libahci sdhci_pci firewire_ohci ata_piix sdhci firewire_core libata 
e1000e i2c_i801 psmouse mmc_core crc_itu_t ptp scsi_mod i915 pps_core
00.831: [  575.948009]  ehci_pci video button uhci_hcd i2c_algo_bit 
ehci_hcd drm_kms_helper thermal usbcore syscopyarea sysfillrect 
sysimgblt fb_sys_fops drm
00.831: [  575.948009] CPU: 0 PID: 1126 Comm: kworker/u4:36 Not tainted 
4.13.0-rc6+ #110
00.831: [  575.948009] Hardware name: LENOVO 636338U/636338U, BIOS 
CBET4000 TIMELESS 01/01/1970

00.831: [  575.948009] Workqueue: events_unbound async_run_entry_fn
00.831: [  575.948009] task: f2ed8bc0 task.stack: f2ecc000
00.831: [  575.948009] EIP: blk_set_runtime_active+0x27/0x60
00.831: [  575.948009] EFLAGS: 00010046 CPU: 0
00.831: [  575.948009] EAX:  EBX: f5f3f820 ECX: f5f3f918 EDX: 
00010d7b
00.831: [  575.948009] ESI: f8ac3cc0 EDI: 0010 EBP: 0010 ESP: 
f2ecdea4

00.831: [  575.948009]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
00.831: [  575.948009] CR0: 80050033 CR2: 00f4 CR3: 0e3a9000 CR4: 
06d0

00.831: [  575.948009] Call Trace:
00.831: [  575.948009]  ? scsi_bus_resume_common+0x6e/0x110 [scsi_mod]
00.831: [  575.948009]  ? dpm_run_callback+0x4f/0x150
00.831: [  575.948009]  ? wait_for_completion+0x29/0x140
00.831: [  575.948009]  ? scsi_bus_thaw+0x10/0x10 [scsi_mod]
00.831: [  575.948009]  ? device_resume+0x8e/0x180
00.831: [  575.948009]  ? async_resume+0x1b/0x40
00.831: [  575.948009]  ? async_run_entry_fn+0x3f/0x1a0
00.831: [  575.948009]  ? process_one_work+0x136/0x310
00.831: [  575.948009]  ? worker_thread+0x39/0x3b0
00.831: [  575.948009]  ? kthread+0xd7/0x110
00.831: [  575.948009]  ? process_one_work+0x310/0x310
00.831: [  575.948009]  ? kthread_create_on_node+0x30/0x30
00.831: [  575.948009]  ? ret_from_fork+0x19/0x24
00.831: [  575.948009] Code: 8d 74 26 00 3e 8d 74 26 00 53 89 c3 8b 80 
fc 00 00 00 e8 2d 48 32 00 31 c0 8b 15 20 9e 24 ce 89 83 54 01 00 00 8b 
83 50 01 00 00 <89> 90 f4 00 00 00 ba 09 00 00 00 8b 83 50 01 00 00 e8 
f3 f2 16
00.831: [  575.948009] EIP: blk_set_runtime_active+0x27/0x60 SS:ESP: 
0068:f2ecdea4

00.831: [  575.948009] CR2: 00f4
00.831: [  575.948009] ---[ end trace b3f1ac10115418ab ]---
00.831: [  576.195662] pciehp :00:1c.0:pcie004: Timeout on hotplug 
command 0x1038 (issued 574920 msec ago)
00.831: [  576.204847] pciehp :00:1c.0:pcie004: Device :01:00.0 
already exists at :01:00, cannot hot-add
00.832: [  576.214460] pciehp :00:1c.0:pcie004: Cannot add device at 
:01:00
00.834: [  576.223117] atkbd serio0: Spurious ACK on isa0060/serio0. 
Some program might be trying to access hardware directly.

00.834: [  576.233968] ata1.00: configured for UDMA/33
00.927: [  576.328159] pciehp :00:1c.0:pcie004: Device :01:00.0 
already exists at :01:00, cannot hot-add
00.929: [  576.340348] pciehp :00:1c.0:pcie004: Cannot add device at 
:01:00
01.002: [  576.420139] usb 5-6: reset high-speed USB device number 2 
using ehci-pci
01.372: [  576.796072] firewire_core :05:00.1: rediscovered device 
fw0
03.010: [  578.440083] ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 
300)

05.274: [  580.710027] ata3.00: ATA Identify Device Log not supported
05.276: [  580.718136] ata3.00: Security Log not supported
05.279: [  580.725856] ata3.00: ATA Identify Device Log not supported
05.282: [  580.733887] ata3.00: Security Log not supported
05.284: [  580.740838] ata3.00: configured for UDMA/100
```


Kind regards,

Paul


Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-21 Thread Christoph Hellwig
Hi Paul,

with 4.13-rc6 we're not using blk-mq by default any more, do you
still see the issue with that one?


Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-18 Thread Paul Menzel

Dear Christoph,


On 08/06/17 20:06, Paul Menzel wrote:


On 2017-08-05 11:30, Christoph Hellwig wrote:

On Thu, Aug 03, 2017 at 07:42:15PM +0200, Paul Menzel wrote:


Since the merge windows opened for Linux 4.13, I am unable to resume 
from ACPI S3 suspend on a Lenovo X60t. The graphics comes back, but I am 
unable to enter anything, and the system seems to be hung. Magic SysRq keys 
still work though, but powering the system of doesn’t work. The power 
button also does not work.


Please find the stack trace with Linux 4.13-rc3 captured over the serial
console below.


Is this really -rc3?  rc3 has a commit to disable block runtime pm
for blk-mq, which is now the default for scsi.  So with -rc1 we've
seen similar reports, but rc3 would be odd and suggest we have further
problems.


Yes, this was 4.13-rc3. Rebuilding the Linux kernel from commit 0fdd951c 
(Merge tag 'media/v4.13-2' of 
git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-media) shows 
the same behavior.


Just an update, that this is still present in Linux 4.13-rc5+, that 
means commit 04d49f3638d0 (Merge tag 'drm-fixes-for-v4.13-rc6' of 
git://people.freedesktop.org/~airlied/linux).



Kind regards,

Paul


Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-06 Thread Paul Menzel

Dear Christoph,


On 2017-08-05 11:30, Christoph Hellwig wrote:

On Thu, Aug 03, 2017 at 07:42:15PM +0200, Paul Menzel wrote:


Since the merge windows opened for Linux 4.13, I am unable to resume 
from
ACPI S3 suspend on a Lenovo X60t. The graphics comes back, but I am 
unable
to enter anything, and the system seems to be hung. Magic SysRq keys 
still
work though, but powering the system of doesn’t work. The power button 
also

does not work.

Please find the stack trace with Linux 4.13-rc3 captured over the 
serial

console below.


Is this really -rc3?  rc3 has a commit to disable block runtime pm
for blk-mq, which is now the default for scsi.  So with -rc1 we've
seen similar reports, but rc3 would be odd and suggest we have further
problems.


Yes, this was 4.13-rc3. Rebuilding the Linux kernel from commit 0fdd951c 
(Merge tag 'media/v4.13-2' of 
git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-media) shows 
the same behavior.



Kind regards,

Paul



Re: [Regression 4.13-rc1] Resume does not work on Lenovo X60t

2017-08-05 Thread Christoph Hellwig
On Thu, Aug 03, 2017 at 07:42:15PM +0200, Paul Menzel wrote:
> Dear Linux folks,
> 
> 
> Since the merge windows opened for Linux 4.13, I am unable to resume from
> ACPI S3 suspend on a Lenovo X60t. The graphics comes back, but I am unable
> to enter anything, and the system seems to be hung. Magic SysRq keys still
> work though, but powering the system of doesn’t work. The power button also
> does not work.
> 
> Please find the stack trace with Linux 4.13-rc3 captured over the serial
> console below.

Is this really -rc3?  rc3 has a commit to disable block runtime pm
for blk-mq, which is now the default for scsi.  So with -rc1 we've
seen similar reports, but rc3 would be odd and suggest we have further
problems.