Bug#639331: linux-image-2.6.36-rc6-686-bigmem: Closing laptop lid hangs the system on Dell studio 1555

2011-09-12 Thread Thibault Manlay
On 12/09/11 21:36, Jonathan Nieder wrote:
 That's the theory, but if Thibault has to use the power button to
 reboot to recover, then something's going wrong, no?
 
 Anyway, it seems it's time to take this upstream.  Thibault, could you
 send a summary of findings so far (i.e., what the state is like when
 closing the lid, problem only shows up with CONFIG_LOCKUP_DETECTOR=y,
 the nolapic kernel parameter works around it, whatever logs it was
 possible to get of what happens until the lid closes, and an acpidump)
 to the linux-a...@vger.kernel.org list (no subscription needed, since
 the convention is to always reply-to-all), cc-ing this bug log?
 
 Sorry for the trouble, and good luck.
 

Hello,

I tried to get a kernel dump with kdump/kexec to have something I could
work on, but no success. The kernel hangs and doesn't reboot on the
crash kernel.
Disabling SMP solves the problem too (it is required for kdump, but SMP
can be enabled if you tell kdump to use one cpu), I feel a little bit
discouraged.

Since I can't find any pertinent similar problem (they are either too
old or with no answer, or both) I begin to believe this is a hardware
problem on my laptop. I'll try to patch the BIOS someday.

Fun fact : closing lid hangs the system, but after I have done a single
suspend-to-ram, I can close the lid as much as I want without getting
the system to hang...


Thanks,

-- 
Thibault Manlay



signature.asc
Description: OpenPGP digital signature


Bug#639331: linux-image-2.6.36-rc6-686-bigmem: Closing laptop lid hangs the system on Dell studio 1555

2011-08-31 Thread Thibault Manlay
On 29/08/11 04:44, Jonathan Nieder wrote:
 Nice.  Have you found which item in the .config caused it?

Found it.

Issue comes from CONFIG_LOCKUP_DETECTOR=y (issue disappears when
unsetting it).

But why would this option/feature locks-up the system?


-- 
Thibault Manlay



signature.asc
Description: OpenPGP digital signature


Bug#639331: linux-image-2.6.36-rc6-686-bigmem: Closing laptop lid hangs the system on Dell studio 1555

2011-08-29 Thread Thibault Manlay
On 29/08/11 04:44, Jonathan Nieder wrote:
 Ah, that must have been painful. I should have included confirming that
 v2.6.36-rc6 is broken in the list of steps; sorry. :/

Np

 Nice.  Have you found which item in the .config caused it?

No, not yet.
I should report back this week or next week.


-- 
Thibault Manlay



signature.asc
Description: OpenPGP digital signature


Bug#639331: linux-image-2.6.36-rc6-686-bigmem: Closing laptop lid hangs the system on Dell studio 1555

2011-08-26 Thread Thibault Manlay
On 26/08/11 17:41, Jonathan Nieder wrote:

 Thanks!

Ok, finished bisecting v2.6.35 to v2.6.36-rc6 and found nothing. After
all maybe it is not a bug in the kernel code.

This may come from the kernel configuration then.

Well, I narrowed the search space : issue comes between
linux-image-2.6.36-rc5-686-bigmem (2.6.36~rc5-1~experimental.1) (which
works) and linux-image-2.6.36-rc6-686-bigmem
(2.6.36~rc6-1~experimental.1) (which doesn't work).

I compiled rc6 from source with Debian package kernel configuration and
got it crashing.

I join a diff file between 2.6.36-rc5 and rc6, but I did not find
anything relevant.

Using the rc5 kernel configuration from Debian experimental package for
use with source kernel (checkout on v2.6.36-rc6) produces no crash. I
suppose something's wrong in kernel configuration.

So to sum up:
 - 2.6.36-rc6 (git) + minimal configuration is ok;
 - 2.6.36-rc6 (git) + Debian 2.6.36-rc5-686-bigmem config is ok;
 - 2.6.36-rc6 (git) + Debian 2.6.36-rc6-686-bigmem config is crashing
when closing lid.

I'll look around in that direction..


Thanks anyway :)

-- 
Thibault Manlay
--- /boot/config-2.6.36-rc5-686-bigmem	2010-09-25 18:16:20.0 +0200
+++ /boot/config-2.6.36-rc6-686-bigmem	2010-10-04 13:52:52.0 +0200
@@ -1,7 +1,7 @@
 #
 # Automatically generated make config: don't edit
-# Linux kernel version: 2.6.36-rc5
-# Sat Sep 25 13:43:12 2010
+# Linux kernel version: 2.6.36-rc6
+# Mon Oct  4 09:18:53 2010
 #
 # CONFIG_64BIT is not set
 CONFIG_X86_32=y
@@ -142,7 +142,7 @@
 CONFIG_UID16=y
 CONFIG_SYSCTL_SYSCALL=y
 CONFIG_KALLSYMS=y
-CONFIG_KALLSYMS_ALL=y
+# CONFIG_KALLSYMS_ALL is not set
 # CONFIG_KALLSYMS_EXTRA_PASS is not set
 CONFIG_HOTPLUG=y
 CONFIG_PRINTK=y
@@ -747,14 +747,14 @@
 #
 # Xtables targets
 #
-# CONFIG_NETFILTER_XT_TARGET_CHECKSUM is not set
+CONFIG_NETFILTER_XT_TARGET_CHECKSUM=m
 CONFIG_NETFILTER_XT_TARGET_CLASSIFY=m
 CONFIG_NETFILTER_XT_TARGET_CONNMARK=m
 CONFIG_NETFILTER_XT_TARGET_CONNSECMARK=m
 CONFIG_NETFILTER_XT_TARGET_CT=m
 CONFIG_NETFILTER_XT_TARGET_DSCP=m
 CONFIG_NETFILTER_XT_TARGET_HL=m
-# CONFIG_NETFILTER_XT_TARGET_IDLETIMER is not set
+CONFIG_NETFILTER_XT_TARGET_IDLETIMER=m
 CONFIG_NETFILTER_XT_TARGET_LED=m
 CONFIG_NETFILTER_XT_TARGET_MARK=m
 CONFIG_NETFILTER_XT_TARGET_NFLOG=m
@@ -777,7 +777,7 @@
 CONFIG_NETFILTER_XT_MATCH_CONNLIMIT=m
 CONFIG_NETFILTER_XT_MATCH_CONNMARK=m
 CONFIG_NETFILTER_XT_MATCH_CONNTRACK=m
-# CONFIG_NETFILTER_XT_MATCH_CPU is not set
+CONFIG_NETFILTER_XT_MATCH_CPU=m
 CONFIG_NETFILTER_XT_MATCH_DCCP=m
 CONFIG_NETFILTER_XT_MATCH_DSCP=m
 CONFIG_NETFILTER_XT_MATCH_ESP=m
@@ -785,7 +785,7 @@
 CONFIG_NETFILTER_XT_MATCH_HELPER=m
 CONFIG_NETFILTER_XT_MATCH_HL=m
 CONFIG_NETFILTER_XT_MATCH_IPRANGE=m
-# CONFIG_NETFILTER_XT_MATCH_IPVS is not set
+CONFIG_NETFILTER_XT_MATCH_IPVS=m
 CONFIG_NETFILTER_XT_MATCH_LENGTH=m
 CONFIG_NETFILTER_XT_MATCH_LIMIT=m
 CONFIG_NETFILTER_XT_MATCH_MAC=m
@@ -1120,7 +1120,7 @@
 # CAN USB interfaces
 #
 CONFIG_CAN_EMS_USB=m
-# CONFIG_CAN_ESD_USB2 is not set
+CONFIG_CAN_ESD_USB2=m
 # CONFIG_CAN_DEBUG_DEVICES is not set
 CONFIG_IRDA=m
 
@@ -1198,7 +1198,7 @@
 CONFIG_BT_HCIUART=m
 CONFIG_BT_HCIUART_H4=y
 CONFIG_BT_HCIUART_BCSP=y
-# CONFIG_BT_HCIUART_ATH3K is not set
+CONFIG_BT_HCIUART_ATH3K=y
 CONFIG_BT_HCIUART_LL=y
 CONFIG_BT_HCIBCM203X=m
 CONFIG_BT_HCIBPA10X=m
@@ -1504,7 +1504,7 @@
 # CONFIG_BLK_DEV_HD is not set
 CONFIG_MISC_DEVICES=y
 CONFIG_AD525X_DPOT=m
-# CONFIG_AD525X_DPOT_I2C is not set
+CONFIG_AD525X_DPOT_I2C=m
 # CONFIG_AD525X_DPOT_SPI is not set
 CONFIG_IBM_ASM=m
 CONFIG_PHANTOM=m
@@ -1929,7 +1929,7 @@
 CONFIG_FIREWIRE_SBP2=m
 CONFIG_FIREWIRE_NET=m
 # CONFIG_IEEE1394 is not set
-# CONFIG_FIREWIRE_NOSY is not set
+CONFIG_FIREWIRE_NOSY=m
 CONFIG_I2O=m
 CONFIG_I2O_LCT_NOTIFY_ON_CHANGES=y
 CONFIG_I2O_EXT_ADAPTEC=y
@@ -2120,7 +2120,7 @@
 CONFIG_CHELSIO_T4_DEPENDS=y
 CONFIG_CHELSIO_T4=m
 CONFIG_CHELSIO_T4VF_DEPENDS=y
-# CONFIG_CHELSIO_T4VF is not set
+CONFIG_CHELSIO_T4VF=m
 CONFIG_ENIC=m
 CONFIG_IXGBE=m
 CONFIG_IXGBE_DCA=y
@@ -2251,7 +2251,7 @@
 CONFIG_NORTEL_HERMES=m
 CONFIG_PCMCIA_HERMES=m
 CONFIG_PCMCIA_SPECTRUM=m
-# CONFIG_ORINOCO_USB is not set
+CONFIG_ORINOCO_USB=m
 CONFIG_P54_COMMON=m
 CONFIG_P54_USB=m
 CONFIG_P54_PCI=m
@@ -2285,8 +2285,8 @@
 CONFIG_WL1251_SPI=m
 CONFIG_WL1251_SDIO=m
 CONFIG_WL1271=m
-# CONFIG_WL1271_SPI is not set
-# CONFIG_WL1271_SDIO is not set
+CONFIG_WL1271_SPI=m
+CONFIG_WL1271_SDIO=m
 CONFIG_ZD1211RW=m
 # CONFIG_ZD1211RW_DEBUG is not set
 
@@ -2738,7 +2738,7 @@
 #
 CONFIG_SERIAL_MAX3100=m
 # CONFIG_SERIAL_MAX3107 is not set
-# CONFIG_SERIAL_MFD_HSU is not set
+CONFIG_SERIAL_MFD_HSU=m
 CONFIG_SERIAL_CORE=y
 CONFIG_SERIAL_CORE_CONSOLE=y
 CONFIG_SERIAL_JSM=m
@@ -2798,7 +2798,7 @@
 CONFIG_TCG_INFINEON=m
 CONFIG_TELCLOCK=m
 CONFIG_DEVPORT=y
-# CONFIG_RAMOOPS is not set
+CONFIG_RAMOOPS=m
 CONFIG_I2C=m
 CONFIG_I2C_BOARDINFO=y
 CONFIG_I2C_COMPAT=y
@@ -2897,7 +2897,7 @@
 # PPS clients support
 #
 # CONFIG_PPS_CLIENT_KTIMER is not set
-# CONFIG_PPS_CLIENT_LDISC is not set

Bug#639331: linux-image-2.6.36-rc6-686-bigmem: Closing laptop lid hangs the system on Dell studio 1555

2011-08-25 Thread Thibault Manlay
Package: linux-2.6
Version: 2.6.36~rc6-1~experimental.1
Severity: important
Tags: upstream


Hello,

I have a problem occurring for some time now, my system hangs each time I close 
the lid of my laptop (Dell studio 1555) whether I'm on a tty or in a X session, 
and the only solution is to restart manually the system (power button). This 
also happens in single boot mode.
I found a workaround to prevent the system hanging, which is to boot with the 
'nolapic' kernel option. I assume this issue would be APIC-based. The option 
disables others CPU cores which is a bit problematic.

I'm running Debian testing.

This problem was not present in the 2.6.32 version and appeared in the 2.6.38 
version. I had not much time to determine at which version this problem began 
to be present but now I have time I had a closer look.

I installed some versions of 'old' Debian kernel packages I found on 
http://snapshot.debian.org/ and I determined that the issue appeared between 
linux-image-2.6.35-trunk-686-bigmem (2.6.35-1~experimental.3) and 
linux-image-2.6.36-rc6-686-bigmem (2.6.36~rc6-1~experimental.1). Everything 
works fine in 2.6.35-trunk and issue in 2.6.36-rc6.

I checked the configurations difference but found nothing pertinent.

Hope this will help.


Thanks in advance,

-- 
Thibault Manlay


-- Configuration diff:
--- /boot/config-2.6.35-trunk-686-bigmem2010-09-06 21:19:30.0 
+0200
+++ /boot/config-2.6.36-rc6-686-bigmem  2010-10-04 13:52:52.0 +0200
@@ -1,7 +1,7 @@
 #
 # Automatically generated make config: don't edit
-# Linux kernel version: 2.6.35
-# Mon Sep  6 16:49:05 2010
+# Linux kernel version: 2.6.36-rc6
+# Mon Oct  4 09:18:53 2010
 #
 # CONFIG_64BIT is not set
 CONFIG_X86_32=y
@@ -10,7 +10,6 @@
 CONFIG_INSTRUCTION_DECODER=y
 CONFIG_OUTPUT_FORMAT=elf32-i386
 CONFIG_ARCH_DEFCONFIG=arch/x86/configs/i386_defconfig
-CONFIG_GENERIC_TIME=y
 CONFIG_GENERIC_CMOS_UPDATE=y
 CONFIG_CLOCKSOURCE_WATCHDOG=y
 CONFIG_GENERIC_CLOCKEVENTS=y
@@ -93,14 +92,13 @@
 CONFIG_TASK_IO_ACCOUNTING=y
 CONFIG_AUDIT=y
 CONFIG_AUDITSYSCALL=y
+CONFIG_AUDIT_WATCH=y
 CONFIG_AUDIT_TREE=y
 
 #
 # RCU Subsystem
 #
 CONFIG_TREE_RCU=y
-# CONFIG_TREE_PREEMPT_RCU is not set
-# CONFIG_TINY_RCU is not set
 # CONFIG_RCU_TRACE is not set
 CONFIG_RCU_FANOUT=32
 # CONFIG_RCU_FANOUT_EXACT is not set
@@ -144,7 +142,7 @@
 CONFIG_UID16=y
 CONFIG_SYSCTL_SYSCALL=y
 CONFIG_KALLSYMS=y
-CONFIG_KALLSYMS_ALL=y
+# CONFIG_KALLSYMS_ALL is not set
 # CONFIG_KALLSYMS_EXTRA_PASS is not set
 CONFIG_HOTPLUG=y
 CONFIG_PRINTK=y
@@ -173,7 +171,6 @@
 # CONFIG_COMPAT_BRK is not set
 # CONFIG_SLAB is not set
 CONFIG_SLUB=y
-# CONFIG_SLOB is not set
 CONFIG_PROFILING=y
 CONFIG_TRACEPOINTS=y
 CONFIG_OPROFILE=m
@@ -195,13 +192,12 @@
 CONFIG_HAVE_HW_BREAKPOINT=y
 CONFIG_HAVE_MIXED_BREAKPOINTS_REGS=y
 CONFIG_HAVE_USER_RETURN_NOTIFIER=y
+CONFIG_HAVE_PERF_EVENTS_NMI=y
 
 #
 # GCOV-based kernel profiling
 #
 # CONFIG_GCOV_KERNEL is not set
-CONFIG_SLOW_WORK=y
-# CONFIG_SLOW_WORK_DEBUG is not set
 CONFIG_HAVE_GENERIC_DMA_COHERENT=y
 CONFIG_SLABINFO=y
 CONFIG_RT_MUTEXES=y
@@ -278,6 +274,7 @@
 CONFIG_SCHED_OMIT_FRAME_POINTER=y
 CONFIG_PARAVIRT_GUEST=y
 CONFIG_XEN=y
+CONFIG_XEN_PVHVM=y
 CONFIG_XEN_MAX_DOMAIN_MEMORY=32
 CONFIG_XEN_SAVE_RESTORE=y
 # CONFIG_XEN_DEBUG_FS is not set
@@ -313,10 +310,8 @@
 # CONFIG_MCYRIXIII is not set
 # CONFIG_MVIAC3_2 is not set
 # CONFIG_MVIAC7 is not set
-# CONFIG_MPSC is not set
 # CONFIG_MCORE2 is not set
 # CONFIG_MATOM is not set
-# CONFIG_GENERIC_CPU is not set
 CONFIG_X86_GENERIC=y
 CONFIG_X86_CPU=y
 CONFIG_X86_INTERNODE_CACHE_SHIFT=6
@@ -386,7 +381,6 @@
 CONFIG_ILLEGAL_POINTER_VALUE=0
 CONFIG_SELECT_MEMORY_MODEL=y
 CONFIG_FLATMEM_MANUAL=y
-# CONFIG_DISCONTIGMEM_MANUAL is not set
 # CONFIG_SPARSEMEM_MANUAL is not set
 CONFIG_FLATMEM=y
 CONFIG_FLAT_NODE_MEM_MAP=y
@@ -461,6 +455,7 @@
 # CONFIG_ACPI_PROCFS_POWER is not set
 CONFIG_ACPI_POWER_METER=m
 CONFIG_ACPI_SYSFS_POWER=y
+# CONFIG_ACPI_EC_DEBUGFS is not set
 # CONFIG_ACPI_PROC_EVENT is not set
 CONFIG_ACPI_AC=m
 CONFIG_ACPI_BATTERY=m
@@ -483,6 +478,7 @@
 CONFIG_ACPI_APEI=y
 CONFIG_ACPI_APEI_GHES=m
 # CONFIG_ACPI_APEI_EINJ is not set
+# CONFIG_ACPI_APEI_ERST_DEBUG is not set
 CONFIG_SFI=y
 CONFIG_X86_APM_BOOT=y
 CONFIG_APM=m
@@ -501,7 +497,6 @@
 CONFIG_CPU_FREQ_STAT=m
 # CONFIG_CPU_FREQ_STAT_DETAILS is not set
 # CONFIG_CPU_FREQ_DEFAULT_GOV_PERFORMANCE is not set
-# CONFIG_CPU_FREQ_DEFAULT_GOV_POWERSAVE is not set
 # CONFIG_CPU_FREQ_DEFAULT_GOV_USERSPACE is not set
 CONFIG_CPU_FREQ_DEFAULT_GOV_ONDEMAND=y
 # CONFIG_CPU_FREQ_DEFAULT_GOV_CONSERVATIVE is not set
@@ -539,7 +534,7 @@
 CONFIG_CPU_IDLE=y
 CONFIG_CPU_IDLE_GOV_LADDER=y
 CONFIG_CPU_IDLE_GOV_MENU=y
-CONFIG_INTEL_IDLE=m
+# CONFIG_INTEL_IDLE is not set
 
 #
 # Bus options (PCI etc.)
@@ -548,7 +543,6 @@
 # CONFIG_PCI_GOBIOS is not set
 # CONFIG_PCI_GOMMCONFIG is not set
 # CONFIG_PCI_GODIRECT is not set
-# CONFIG_PCI_GOOLPC is not set
 CONFIG_PCI_GOANY=y
 CONFIG_PCI_BIOS=y
 CONFIG_PCI_DIRECT=y
@@ -680,13 +674,7

Bug#639331: linux-image-2.6.36-rc6-686-bigmem: Closing laptop lid hangs the system on Dell studio 1555

2011-08-25 Thread Thibault Manlay
On 26/08/11 03:04, Jonathan Nieder wrote:
 Does the latest -pae or -amd64 kernel from sid produce the same
 symptoms?  Are you able to get logs from right before the hang (for

Same issue on sid kernels linux-image-3.0.0-1-amd64 (3.0.0-2) and
linux-image-3.0.0-1-686-pae (3.0.0-2).

I tried to get logs with netconsole ; I remotely receive everything as
expected during the boot, nevertheless nothing come when I reproduce the
issue, for both of the kernels above.


 example with netconsole)?  If you have time to bisect to find the
 exact change that introduced this, that would be best of all --- it
 works like this:
 

I will do this sometime tomorrow but I think this may take a while :-)


I have a friend with the same hardware as me, although he is running
stable I think it would be pertinent trying a recent kernel on his
laptop even if I'm pretty sure of the result.

Btw, would a debug of APIC/LAPIC informations at boot be useful here?
(kernel parameters apic=debug show_lapic=all)


Thanks,

-- 
Thibault Manlay



signature.asc
Description: OpenPGP digital signature


Bug#628641: linux-image-2.6.32-5-686: crashes when closing laptop lid (dell inspiron 300m)

2011-06-09 Thread Thibault Manlay
Hello,


I just noticed a similar issue, maybe related to this bug, on my Dell
Studio 1555.

I'm using debian/testing.

When I close and then open the lid:
- On X: the screen just shows a piece of garbled pixels whose some of
them are blinking;
- On a TTY: the console seem to go back in history (the farthest it can
I suppose).

In both cases the system does not respond and the only thing to do is to
manually turn off the laptop.

This happens on the kernel 2.6.38-5 (linux-image-2.6.38-2-686-bigmem)
and it reproducible everytime.

This happens also on the kernel 2.6.39-2 (linux-image-2.6.39-2-686-pae)
from unstable

I also have a kernel that I kept from squeeze
(linux-image-2.6.32-5-686-bigmem 2.6.32-34squeeze1) on which I cannot
reproduce the bug. It looks like it is a regression somewhere between
versions 2.6.32 and 2.6.38.

I have found a workaround to this issue: using nolapic in the kernel
boot options make the issue disappear. I don't know if it breaks
something else (suspend/hibernate) but I will check this later.

This is very curious because I have used 2.6.38 for a while now, and the
issue appeared today. I hadn't performed any upgrade on my laptop for
about a month.


-- 
Thibault Manlay




signature.asc
Description: OpenPGP digital signature