Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-10-10 Thread Ben Hutchings
On Wed, 2010-10-06 at 09:53 -0700, Vagrant Cascadian wrote:
 On Mon, Sep 27, 2010 at 09:29:28AM -0700, Vagrant Cascadian wrote:
  On Tue, Sep 21, 2010 at 01:37:42PM -0700, Vagrant Cascadian wrote:
   On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote:
Please add 'memory_corruption_check=1' to the kernel parameters and
report whether that fixes the problem and whether it results in any new
log messages.
   
   added this now, will reboot and see if that fixes it... presuming it just
   doesn't hide itself for another several weeks.
  
  ok, didn't have to wait for weeks, apparently.
 
 still getting crashes every week or two, but not sure if they're related to 
 the
 initial problem...
 
 so i've been running with memory_corruption_check=1:

Unlesss you also see messages reporting 'Corrupted low memory at ...'
then what you're seeing is not the problem that memory_corruption_check
deals with.

[...]
 anything else that could possibly be of use in troubleshooting this?

You could test a newer kernel package from experimental.

Ben.

-- 
Ben Hutchings
Once a job is fouled up, anything done to improve it makes it worse.


signature.asc
Description: This is a digitally signed message part


Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-10-06 Thread Vagrant Cascadian
On Mon, Sep 27, 2010 at 09:29:28AM -0700, Vagrant Cascadian wrote:
 On Tue, Sep 21, 2010 at 01:37:42PM -0700, Vagrant Cascadian wrote:
  On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote:
   Please add 'memory_corruption_check=1' to the kernel parameters and
   report whether that fixes the problem and whether it results in any new
   log messages.
  
  added this now, will reboot and see if that fixes it... presuming it just
  doesn't hide itself for another several weeks.
 
 ok, didn't have to wait for weeks, apparently.

still getting crashes every week or two, but not sure if they're related to the
initial problem...

so i've been running with memory_corruption_check=1:

cat /proc/cmdline
BOOT_IMAGE=/vmlinuz-2.6.32-5-686 root=/dev/mapper/mneme-rwt ro quiet 
memory_corruption_check=1


Oct  5 19:14:30 mneme kernel: [ 8525.133061] BUG: unable to handle kernel NULL 
pointer dereference at (null)
Oct  5 19:14:30 mneme kernel: [ 8525.133072] IP: [c113a802] strlen+0x8/0x11
Oct  5 19:14:30 mneme kernel: [ 8525.133083] *pde = 
Oct  5 19:14:30 mneme kernel: [ 8525.133088] Oops:  [#1] SMP
Oct  5 19:14:30 mneme kernel: [ 8525.133094] last sysfs file: 
/sys/devices/pci:00/:00:02.1/resource
Oct  5 19:14:30 mneme kernel: [ 8525.133099] Modules linked in: usbhid hid tun 
ip6table_filter ip6_tables iptable_filter ip_tables x_tables sco bridge stp bnep
rfcomm l2cap crc16 bluetooth acpi_cpufreq parport_pc ppdev lp cpufreq_stats 
cpufreq_powersave parport cpufreq_conservative cpufreq_userspace kvm_intel kvm 
uinpu
t fuse dm_snapshot firewire_sbp2 loop snd_hda_codec_idt snd_hda_intel 
snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_midi 
snd_rawmidi snd_seq
_midi_event i915 snd_seq arc4 drm_kms_helper ecb snd_timer b43 snd_seq_device 
drm snd yenta_socket rsrc_nonstatic i2c_i801 i2c_algo_bit soundcore mac80211 
rng_c
ore snd_page_alloc dell_laptop joydev i2c_core cfg80211 rfkill video battery 
processor button output dcdbas psmouse ac evdev serio_raw ext3 jbd mbcache 
sha256_g
eneric aes_i586 aes_generic cbc dm_crypt dm_mod sd_mod crc_t10dif ata_generic 
ata_piix sdhci_pci sdhci thermal ssb firewire_ohci pcmcia libata mmc_core tg3 
uhci
_hcd firewire_core crc_itu_t pcmcia_core scsi_mod led_class ehci_h
Oct  5 19:14:30 mneme kernel: cd thermal_sys libphy usbcore nls_base [last 
unloaded: scsi_wait_scan]
Oct  5 19:14:30 mneme kernel: [ 8525.133242]
Oct  5 19:14:30 mneme kernel: [ 8525.133248] Pid: 10491, comm: Xorg Not tainted 
(2.6.32-5-686 #1) Latitude D420
Oct  5 19:14:30 mneme kernel: [ 8525.133254] EIP: 0060:[c113a802] EFLAGS: 
00213246 CPU: 0
Oct  5 19:14:30 mneme kernel: [ 8525.133259] EIP is at strlen+0x8/0x11
Oct  5 19:14:30 mneme kernel: [ 8525.133263] EAX:  EBX: 0fd4 ECX: 
 EDX: 0005
Oct  5 19:14:30 mneme kernel: [ 8525.133268] ESI:  EDI:  EBP: 
f6f533f0 ESP: d5871f4c
Oct  5 19:14:30 mneme kernel: [ 8525.133272]  DS: 007b ES: 007b FS: 00d8 GS: 
00e0 SS: 0068
Oct  5 19:14:30 mneme kernel: [ 8525.133278] Process Xorg (pid: 10491, 
ti=d587 task=f335d540 task.ti=d587)
Oct  5 19:14:30 mneme kernel: [ 8525.133282] Stack:
Oct  5 19:14:30 mneme kernel: [ 8525.133285]  f0790e00 c10f2aee c10bd768 
d5871f90  c128167c f0790e00 e0537d88
Oct  5 19:14:30 mneme kernel: [ 8525.133295] 0 e0537e00 c10bd9fb d5871f90 
c10bd768 fff7 f0790e00  8000
Oct  5 19:14:30 mneme kernel: [ 8525.133306] 0 c10bda8b 0a0882a8 0a088288 
7d28 ffea 000c 0a087fd0 b76afff4
Oct  5 19:14:30 mneme kernel: [ 8525.133318] Call Trace:
Oct  5 19:14:30 mneme kernel: [ 8525.133328]  [c10f2aee] ? 
sysfs_readdir+0xe0/0x13a
Oct  5 19:14:30 mneme kernel: [ 8525.16]  [c10bd768] ? filldir64+0x0/0xc5
Oct  5 19:14:30 mneme kernel: [ 8525.133342]  [c10bd9fb] ? 
vfs_readdir+0x62/0x8c
Oct  5 19:14:30 mneme kernel: [ 8525.133349]  [c10bd768] ? filldir64+0x0/0xc5
Oct  5 19:14:30 mneme kernel: [ 8525.133355]  [c10bda8b] ? 
sys_getdents64+0x66/0xa5
Oct  5 19:14:30 mneme kernel: [ 8525.133363]  [c10030fb] ? 
sysenter_do_call+0x12/0x28
Oct  5 19:14:30 mneme kernel: [ 8525.133367] Code: eb 04 19 c0 0c 01 5e 5f c3 
56 89 c6 89 d0 88 c4 ac 38 e0 74 09 84 c0 75 f7 be 01 00 00 00 89 f0 48 5e c3 
57 8
3 c9 ff 89 c7 31 c0 f2 ae f7 d1 49 89 c8 5f c3 57 31 ff 85 c9 74 0e 89 c7 89 
d0 f2
Oct  5 19:14:30 mneme kernel: [ 8525.133427] EIP: [c113a802] strlen+0x8/0x11 
SS:ESP 0068:d5871f4c
Oct  5 19:14:30 mneme kernel: [ 8525.133434] CR2: 
Oct  5 19:14:30 mneme kernel: [ 8525.133439] ---[ end trace e4f8adeee260d138 
]---

and again today:

Oct  6 08:20:00 mneme kernel: [ 5643.886857] BUG: unable to handle kernel NULL 
pointer dereference at 0010
Oct  6 08:20:00 mneme kernel: [ 5643.887055] IP: [c108cf4b] 
file_ra_state_init+0x3/0x18
Oct  6 08:20:00 mneme kernel: [ 5643.890573] *pde =  
Oct  6 08:20:00 mneme kernel: [ 5643.890573] Oops:  [#1] SMP 
Oct  6 08:20:00state_init+0x3/0x18
Oct  6 08:20:00 mneme kernel: [ 5643.890573] 

Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-09-27 Thread Vagrant Cascadian
On Tue, Sep 21, 2010 at 01:37:42PM -0700, Vagrant Cascadian wrote:
 On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote:
 
  Please add 'memory_corruption_check=1' to the kernel parameters and
  report whether that fixes the problem and whether it results in any new
  log messages.
 
 added this now, will reboot and see if that fixes it... presuming it just
 doesn't hide itself for another several weeks.

ok, didn't have to wait for weeks, apparently.

the error message was different this time...  also different is that it failed
on starting X.org, rather than starting my window manager (tritium)
sucessfully, but failing to start an x-terminal-emulator.

is it a new issue, or the same issue triggered differently due to
memory_corruption_check=1 ?

this froze the system with periodic disk activity shortly after resuming from
disk on with linux-image-2.6.32-5-686 2.6.32-23:


Sep 26 19:51:52 mneme kernel: [92514.768255] [ cut here 
]
Sep 26 19:51:52 mneme kernel: [92514.768268] WARNING: at 
/build/buildd-linux-2.6_2.6.32-23-i386-x1D1UQ/linux-2.6-2.6.32/debian/build/source_i386_none/fs/sysfs/file.c:355
 sysfs_open_file+0x91/0x259()
Sep 26 19:51:52 mneme kernel: [92514.768275] Hardware name: Latitude D420   

Sep 26 19:51:52 mneme kernel: [92514.768278] missing sysfs attribute operations 
for kobject: NULL
Sep 26 19:51:52 mneme kernel: [92514.768283] Modules linked in: mct_u232 
usbserial ext2 hfs hfsplus vfat fat isofs nls_utf8 udf usb_storage usbhid hid 
tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables sco bridge stp 
bnep parport_pc ppdev lp parport l2cap crc16 bluetooth acpi_cpufreq 
cpufreq_stats cpufreq_powersave cpufreq_conservative cpufreq_userspace 
kvm_intel kvm uinput fuse dm_snapshot firewire_sbp2 loop snd_hda_codec_idt 
snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss arc4 snd_mixer_oss ecb 
snd_pcm snd_seq_midi snd_rawmidi snd_seq_midi_event b43 i915 snd_seq 
drm_kms_helper snd_timer joydev snd_seq_device drm mac80211 yenta_socket 
rsrc_nonstatic i2c_i801 snd i2c_algo_bit soundcore snd_page_alloc psmouse 
i2c_core cfg80211 video battery processor button ac dell_laptop serio_raw evdev 
rng_core dcdbas output rfkill ext3 jbd mbcache sha256_generic aes_i586 
aes_generic cbc dm_crypt dm_mod sd_mod crc_t10dif ata_generic ssb sdhci_pci 
firewire_ohci pcmcia thermal tg3 ata_piix uhci_hcd sdhci firewire
Sep 26 19:51:52 mneme kernel: _core crc_itu_t mmc_core led_class thermal_sys 
libphy libata scsi_mod ehci_hcd pcmcia_core usbcore nls_base [last unloaded: 
scsi_wait_scan]
Sep 26 19:51:52 mneme kernel: [92514.768448] Pid: 12333, comm: Xorg Not tainted 
2.6.32-5-686 #1
Sep 26 19:51:52 mneme kernel: [92514.768452] Call Trace:
Sep 26 19:51:52 mneme kernel: [92514.768461]  [c103014d] ? 
warn_slowpath_common+0x5e/0x8a
Sep 26 19:51:52 mneme kernel: [92514.768468]  [c10301ab] ? 
warn_slowpath_fmt+0x26/0x2a
Sep 26 19:51:52 mneme kernel: [92514.768475]  [c10f2050] ? 
sysfs_open_file+0x91/0x259
Sep 26 19:51:52 mneme kernel: [92514.768482]  [c10b1347] ? 
__dentry_open+0x156/0x246
Sep 26 19:51:52 mneme kernel: [92514.768489]  [c10b14c8] ? 
nameidata_to_filp+0x29/0x3c
Sep 26 19:51:52 mneme kernel: [92514.768495]  [c10f1fbf] ? 
sysfs_open_file+0x0/0x259
Sep 26 19:51:52 mneme kernel: [92514.768502]  [c10bb470] ? 
do_filp_open+0x43f/0x802
Sep 26 19:51:52 mneme kernel: [92514.768509]  [c113ae03] ? 
copy_to_user+0x29/0xf8
Sep 26 19:51:52 mneme kernel: [92514.768515]  [c10b9edc] ? 
vfs_readlink+0x2f/0x40
Sep 26 19:51:52 mneme kernel: [92514.768521]  [c10b9f7a] ? 
generic_readlink+0x48/0x6f
Sep 26 19:51:52 mneme kernel: [92514.768528]  [c10c3181] ? alloc_fd+0x52/0xb7
Sep 26 19:51:52 mneme kernel: [92514.768534]  [c10b10ff] ? 
do_sys_open+0x4c/0xdf
Sep 26 19:51:52 mneme kernel: [92514.768540]  [c10b11d6] ? sys_open+0x1e/0x23
Sep 26 19:51:52 mneme kernel: [92514.768547]  [c10030fb] ? 
sysenter_do_call+0x12/0x28
Sep 26 19:51:52 mneme kernel: [92514.768552] ---[ end trace 9cabd45097cf0d9d 
]---
Sep 26 19:51:52 mneme kernel: [92514.768704] [ cut here 
]
Sep 26 19:51:52 mneme kernel: [92514.768712] WARNING: at 
/build/buildd-linux-2.6_2.6.32-23-i386-x1D1UQ/linux-2.6-2.6.32/debian/build/source_i386_none/fs/sysfs/file.c:355
 sysfs_open_file+0x91/0x259()
Sep 26 19:51:52 mneme kernel: [92514.768718] Hardware name: Latitude D420   

Sep 26 19:51:52 mneme kernel: [92514.768722] missing sysfs attribute operations 
for kobject: NULL
Sep 26 19:51:52 mneme kernel: [92514.768725] Modules linked in: mct_u232 
usbserial ext2 hfs hfsplus vfat fat isofs nls_utf8 udf usb_storage usbhid hid 
tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables sco bridge stp 
bnep parport_pc ppdev lp parport l2cap crc16 bluetooth acpi_cpufreq 
cpufreq_stats cpufreq_powersave cpufreq_conservative cpufreq_userspace 
kvm_intel kvm uinput fuse dm_snapshot firewire_sbp2 loop snd_hda_codec_idt 
snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss arc4 

Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-09-21 Thread Vagrant Cascadian
On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote:
 On Mon, 2010-09-06 at 17:05 -0700, Vagrant Cascadian wrote:
  On Tue, Sep 07, 2010 at 12:10:09AM +0100, Ben Hutchings wrote:
   On Mon, 2010-09-06 at 12:27 -0700, Vagrant Cascadian wrote:
...snip...
  i also didn't mention, largely because i'm pretty unsure if it's really the
  case, but the issue *seems* to occur more reliably when hibernating (to 
  disk)
  while on battery, but i'm really unsure of that.

that didn't appear to really make any difference... i tried all combinations of
hibernate with AC, with battery, and resume with AC, with battery, and was
unable to reproduce it reliably. in fact, i hadn't seen it since my last
comment on the bug report nearly two weeks ago... until today. gah.

so it's not exactly easy to reproduce. :(


  i'll see if i can more reliably trigger the same problem on a clean squeeze
  install on the same hardware in another partition... where i'm not as 
  worried
  about crashing.

i was also unable to reproduce it on a clean squeeze install either, so it's
probably something particular with my existing configuration or useage
patterns.

the test install was just on another partition, and my typical environment is
using lvm on an encrypted volume, if that seems likely to make any difference.
my typical environment is also upgraded from a lenny install.

 
 Please add 'memory_corruption_check=1' to the kernel parameters and
 report whether that fixes the problem and whether it results in any new
 log messages.

added this now, will reboot and see if that fixes it... presuming it just
doesn't hide itself for another several weeks.


live well,
  vagrant



-- 
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100921203742.gb3...@talon.fglan



Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-09-19 Thread Ben Hutchings
On Mon, 2010-09-06 at 17:05 -0700, Vagrant Cascadian wrote:
 On Tue, Sep 07, 2010 at 12:10:09AM +0100, Ben Hutchings wrote:
  On Mon, 2010-09-06 at 12:27 -0700, Vagrant Cascadian wrote:
   Package: linux-2.6
   Version: 2.6.32-21
   
   i'm experiencing very similar issues, although also with earlier versions 
   of
   linux-image-2.6.32-5-686.
   
   i first started having this problem after upgrading to squeeze from lenny,
   although i was running the exact same linux-image-2.6.32-* directly on 
   lenny
   for quite some time without problems. 
   
   i had been using uswsusp, but after experiencing this problem several 
   times, i
   purged uswsusp and switched to using the in-kernel resume with 
   pm-hibernate.
  [...]
  
  And did that make any difference?
 
 heh. sorry for being unclear :)
 
 unfortunately, no, it's still Oops'ing after resume.
 
 i also didn't mention, largely because i'm pretty unsure if it's really the
 case, but the issue *seems* to occur more reliably when hibernating (to disk)
 while on battery, but i'm really unsure of that.
 
 i'll see if i can more reliably trigger the same problem on a clean squeeze
 install on the same hardware in another partition... where i'm not as worried
 about crashing.

Please add 'memory_corruption_check=1' to the kernel parameters and
report whether that fixes the problem and whether it results in any new
log messages.

Ben.

-- 
Ben Hutchings
Once a job is fouled up, anything done to improve it makes it worse.


signature.asc
Description: This is a digitally signed message part


Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-09-06 Thread Vagrant Cascadian
Package: linux-2.6
Version: 2.6.32-21

i'm experiencing very similar issues, although also with earlier versions of
linux-image-2.6.32-5-686.

i first started having this problem after upgrading to squeeze from lenny,
although i was running the exact same linux-image-2.6.32-* directly on lenny
for quite some time without problems. 

i had been using uswsusp, but after experiencing this problem several times, i
purged uswsusp and switched to using the in-kernel resume with pm-hibernate.

an example OOPS below, if it's at all useful:

Sep  5 19:00:35 mneme kernel: [ 7750.522234] BUG: unable to handle kernel NULL 
pointer dereference at 0010
Sep  5 19:00:35 mneme kernel: [ 7750.522243] IP: [c108cdc7] 
file_ra_state_init+0x3/0x18
Sep  5 19:00:35 mneme kernel: [ 7750.522256] *pde =  
Sep  5 19:00:35 mneme kernel: [ 7750.522261] Oops:  [#3] SMP 
Sep  5 19:00:35 mneme kernel: [ 7750.522266] last sysfs file: 
/sys/devices/system/cpu/cpu1/cache/index2/shared_cpu_map
Sep  5 19:00:35 mneme kernel: [ 7750.522272] Modules linked in: mct_u232 
usbserial tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables 
parport_pc ppdev lp parport sco bridge stp bnep rfcomm l2cap crc16 bluetooth 
acpi_cpufreq cpufreq_stats cpufreq_powersave cpufreq_conservative 
cpufreq_userspace kvm_intel kvm uinput fuse firewire_sbp2 loop 
snd_hda_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss 
snd_mixer_oss snd_pcm snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq i915 
snd_timer drm_kms_helper joydev snd_seq_device arc4 ecb drm b43 snd 
yenta_socket soundcore i2c_i801 i2c_algo_bit psmouse snd_page_alloc 
rsrc_nonstatic mac80211 dell_laptop i2c_core rng_core serio_raw evdev dcdbas 
video cfg80211 output button processor battery ac rfkill ext3 jbd mbcache 
sha256_generic aes_i586 aes_generic cbc dm_crypt dm_mod sd_mod crc_t10dif 
ata_generic ssb sdhci_pci sdhci ata_piix mmc_core firewire_ohci tg3 libata 
pcmcia thermal uhci_hcd led_class firewire_core crc_itu_t libphy scsi_mod 
ehci_hcd therma
Sep  5 19:00:35 mneme kernel: l_sys pcmcia_core usbcore nls_base [last 
unloaded: scsi_wait_scan]
Sep  5 19:00:35 mneme kernel: [ 7750.522412] 
Sep  5 19:00:35 mneme kernel: [ 7750.522418] Pid: 10665, comm: x-terminal-emul 
Tainted: G  D(2.6.32-5-686 #1) Latitude D420   
Sep  5 19:00:35 mneme kernel: [ 7750.522424] EIP: 0060:[c108cdc7] EFLAGS: 
00210202 CPU: 1
Sep  5 19:00:35 mneme kernel: [ 7750.522429] EIP is at 
file_ra_state_init+0x3/0x18
Sep  5 19:00:35 mneme kernel: [ 7750.522434] EAX: ef7f45c8 EBX:  ECX: 
d5699800 EDX: 
Sep  5 19:00:35 mneme kernel: [ 7750.522438] ESI: ef7f4580 EDI:  EBP: 
f6f73b18 ESP: c49a3ea0
Sep  5 19:00:35 mneme kernel: [ 7750.522443]  DS: 007b ES: 007b FS: 00d8 GS: 
00e0 SS: 0068
Sep  5 19:00:35 mneme kernel: [ 7750.522448] Process x-terminal-emul (pid: 
10665, ti=c49a2000 task=efaa1100 task.ti=c49a2000)
Sep  5 19:00:35 mneme kernel: [ 7750.522452] Stack:
Sep  5 19:00:35 mneme kernel: [ 7750.522455]  c10b114b f6f28080 f68bb220 
c49a3f00 ef7f4580 c49a3f00 c49a3f00 0003
Sep  5 19:00:35 mneme kernel: [ 7750.522465] 0 c10b12ac ef7f4580 c10b4c40 
f6cdfe80  c49a3f00 c10bb254 
Sep  5 19:00:35 mneme kernel: [ 7750.522476] 0 0002 efa7a000 ff9c 
c1c7cbc0 b7112000  fffa0844 ef992898
Sep  5 19:00:35 mneme kernel: [ 7750.522488] Call Trace:
Sep  5 19:00:35 mneme kernel: [ 7750.522495]  [c10b114b] ? 
__dentry_open+0x176/0x246
Sep  5 19:00:35 mneme kernel: [ 7750.522502]  [c10b12ac] ? 
nameidata_to_filp+0x29/0x3c
Sep  5 19:00:35 mneme kernel: [ 7750.522509]  [c10b4c40] ? 
chrdev_open+0x0/0x116
Sep  5 19:00:35 mneme kernel: [ 7750.522516]  [c10bb254] ? 
do_filp_open+0x43f/0x802
Sep  5 19:00:35 mneme kernel: [ 7750.522524]  [c10c2f65] ? alloc_fd+0x52/0xb7
Sep  5 19:00:35 mneme kernel: [ 7750.522530]  [c10b0ee3] ? 
do_sys_open+0x4c/0xdf
Sep  5 19:00:35 mneme kernel: [ 7750.522536]  [c10b0fba] ? sys_open+0x1e/0x23
Sep  5 19:00:35 mneme kernel: [ 7750.522543]  [c10030fb] ? 
sysenter_do_call+0x12/0x28
Sep  5 19:00:35 mneme kernel: [ 7750.522547] Code: c3 53 89 d3 ff 74 24 0c ff 
74 24 0c e8 0e 93 fa ff 5a 59 85 c0 75 0e 85 db 74 0a c7 05 34 fc 4a c1 00 00 
00 00 5b c3 90 8b 52 40 8b 52 10 c7 40 14 ff ff ff ff c7 40 18 ff ff ff ff 89 
50 0c c3 
Sep  5 19:00:35 mneme kernel: [ 7750.522607] EIP: [c108cdc7] 
file_ra_state_init+0x3/0x18 SS:ESP 0068:c49a3ea0
Sep  5 19:00:35 mneme kernel: [ 7750.522616] CR2: 0010
Sep  5 19:00:35 mneme kernel: [ 7750.522620] ---[ end trace f383376f90ced1d2 
]---

live well,
  vagrant

-- Package-specific info:
** Version:
Linux version 2.6.32-5-686 (Debian 2.6.32-21) (b...@decadent.org.uk) (gcc 
version 4.3.5 (Debian 4.3.5-2) ) #1 SMP Wed Aug 25 14:28:12 UTC 2010

** Command line:
BOOT_IMAGE=/vmlinuz-2.6.32-5-686 root=/dev/mapper/mneme-rwt ro quiet

** Not tainted

** Kernel log:
[ 5400.080272] CPU: Physical Processor ID: 0
[ 5400.080272] CPU: Processor Core ID: 1
[ 5400.080272] CPU1: 

Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-09-06 Thread Ben Hutchings
On Mon, 2010-09-06 at 12:27 -0700, Vagrant Cascadian wrote:
 Package: linux-2.6
 Version: 2.6.32-21
 
 i'm experiencing very similar issues, although also with earlier versions of
 linux-image-2.6.32-5-686.
 
 i first started having this problem after upgrading to squeeze from lenny,
 although i was running the exact same linux-image-2.6.32-* directly on lenny
 for quite some time without problems. 
 
 i had been using uswsusp, but after experiencing this problem several times, i
 purged uswsusp and switched to using the in-kernel resume with pm-hibernate.
[...]

And did that make any difference?

Ben.

-- 
Ben Hutchings
Once a job is fouled up, anything done to improve it makes it worse.


signature.asc
Description: This is a digitally signed message part


Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-09-06 Thread Vagrant Cascadian
On Tue, Sep 07, 2010 at 12:10:09AM +0100, Ben Hutchings wrote:
 On Mon, 2010-09-06 at 12:27 -0700, Vagrant Cascadian wrote:
  Package: linux-2.6
  Version: 2.6.32-21
  
  i'm experiencing very similar issues, although also with earlier versions of
  linux-image-2.6.32-5-686.
  
  i first started having this problem after upgrading to squeeze from lenny,
  although i was running the exact same linux-image-2.6.32-* directly on lenny
  for quite some time without problems. 
  
  i had been using uswsusp, but after experiencing this problem several 
  times, i
  purged uswsusp and switched to using the in-kernel resume with pm-hibernate.
 [...]
 
 And did that make any difference?

heh. sorry for being unclear :)

unfortunately, no, it's still Oops'ing after resume.

i also didn't mention, largely because i'm pretty unsure if it's really the
case, but the issue *seems* to occur more reliably when hibernating (to disk)
while on battery, but i'm really unsure of that.

i'll see if i can more reliably trigger the same problem on a clean squeeze
install on the same hardware in another partition... where i'm not as worried
about crashing.

live well,
  vagrant



-- 
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100907000545.gx28...@claws.fglan



Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)

2010-09-01 Thread Alexandre Rossi
Source: linux-2.6
Version: 2.6.32-21
Severity: normal

Hi,

Since the upgrade from 2.6.32-20 - 2.6.32-21, my laptop fails to resume
properly into Xorg : the cursor only appears when I move it and there is
nothing much more I can do than switching to the console, login as root and
reboot.

In the console, I can see a lot of OOpses that are visible in the attached log
file. This did not happend with 2.6.32-20, or at least not before many
suspend cycles.

Here is the first of the many similar stack traces :
--
Sep  1 21:29:47 annalee kernel: [ 1062.410659] BUG: unable to handle kernel 
NULL pointer dereference at 0020
Sep  1 21:29:47 annalee kernel: [ 1062.410665] IP: [810ba09c] 
file_ra_state_init+0x4/0x14
Sep  1 21:29:47 annalee kernel: [ 1062.410673] PGD 7c87e067 PUD 7c370067 PMD 0
Sep  1 21:29:47 annalee kernel: [ 1062.410678] Oops:  [#1] SMP
Sep  1 21:29:47 annalee kernel: [ 1062.410681] last sysfs file: 
/sys/devices/LNXSYSTM:00/LNXSYBUS:00/ACPI0003:00/power_supply/AC/uevent
Sep  1 21:29:47 annalee kernel: [ 1062.410685] CPU 1
Sep  1 21:29:47 annalee kernel: [ 1062.410687] Modules linked in: binfmt_misc 
acpi_cpufreq firewire_sbp2 firewire_core crc_itu_t loop sha256_generic 
aes_x86_64 aes_generic cbc dm_crypt arc4 snd_hda_codec_idt ecb iwl3945 
snd_hda_intel iwlcore snd_hda_codec snd_hwdep joydev mac80211 snd_pcm snd_seq 
snd_timer snd_seq_device led_class battery dell_laptop snd cfg80211 soundcore 
psmouse snd_page_alloc i2c_i801 evdev pcspkr rfkill dcdbas wmi serio_raw ac 
processor ext3 jbd mbcache dm_mod sd_mod crc_t10dif i915 drm_kms_helper drm 
ata_generic uhci_hcd i2c_algo_bit tg3 libphy thermal ata_piix button libata 
ehci_hcd scsi_mod i2c_core video thermal_sys output usbcore nls_base [last 
unloaded: scsi_wait_scan]
Sep  1 21:29:47 annalee kernel: [ 1062.410739] Pid: 2798, comm: date Not 
tainted 2.6.32-5-amd64 #1 Latitude D630
Sep  1 21:29:47 annalee kernel: [ 1062.410742] RIP: 0010:[810ba09c]  
[810ba09c] file_ra_state_init+0x4/0x14
Sep  1 21:29:47 annalee kernel: [ 1062.410747] RSP: 0018:88007df39db0  
EFLAGS: 00010206
Sep  1 21:29:47 annalee kernel: [ 1062.410749] RAX:  RBX: 
 RCX: 88007c89ba80
Sep  1 21:29:47 annalee kernel: [ 1062.410752] RDX: 88007c89ba80 RSI: 
88007ef53798 RDI: 88007c89baf0
Sep  1 21:29:47 annalee kernel: [ 1062.410754] RBP: 88007c89ba80 R08: 
 R09: 880037bd9c00
Sep  1 21:29:47 annalee kernel: [ 1062.410757] R10: 88007df39e48 R11: 
81151385 R12: 
Sep  1 21:29:47 annalee kernel: [ 1062.410759] R13: 88007ef53678 R14: 
0024 R15: 810eb47c
Sep  1 21:29:47 annalee kernel: [ 1062.410762] FS:  () 
GS:88000190() knlGS:
Sep  1 21:29:47 annalee kernel: [ 1062.410765] CS:  0010 DS:  ES:  CR0: 
80050033
Sep  1 21:29:47 annalee kernel: [ 1062.410768] CR2: 0020 CR3: 
7c70e000 CR4: 06e0
Sep  1 21:29:47 annalee kernel: [ 1062.410770] DR0:  DR1: 
 DR2: 
Sep  1 21:29:47 annalee kernel: [ 1062.410773] DR3:  DR6: 
0ff0 DR7: 0400
Sep  1 21:29:47 annalee kernel: [ 1062.410776] Process date (pid: 2798, 
threadinfo 88007df38000, task 88007db662e0)
Sep  1 21:29:47 annalee kernel: [ 1062.410778] Stack:
Sep  1 21:29:47 annalee kernel: [ 1062.410779]  810eb991 
880037bd9c00 880037a1aa00 88007a759600
Sep  1 21:29:47 annalee kernel: [ 1062.410783] 0  
88007df39e48 88007df39e48 8001
Sep  1 21:29:47 annalee kernel: [ 1062.410787] 0 0024 
ff9c 810f70bb 88007df39e78
Sep  1 21:29:47 annalee kernel: [ 1062.410792] Call Trace:
Sep  1 21:29:47 annalee kernel: [ 1062.410797]  [810eb991] ? 
__dentry_open+0x1c4/0x2bf
Sep  1 21:29:47 annalee kernel: [ 1062.410802]  [810f70bb] ? 
do_filp_open+0x4e4/0x94b
Sep  1 21:29:47 annalee kernel: [ 1062.410806]  [810e40dd] ? 
virt_to_head_page+0x9/0x2a
Sep  1 21:29:47 annalee kernel: [ 1062.410810]  [811000c5] ? 
alloc_fd+0x67/0x10c
Sep  1 21:29:47 annalee kernel: [ 1062.410813]  [810eb6fb] ? 
do_sys_open+0x55/0xfc
Sep  1 21:29:47 annalee kernel: [ 1062.410818]  [81010b42] ? 
system_call_fastpath+0x16/0x1b
Sep  1 21:29:47 annalee kernel: [ 1062.410821] Code: 89 d8 5b 5d 41 5c c3 53 89 
f3 e8 57 98 f9 ff 85 c0 75 0f 85 db 74 0b 48 c7 05 ed 86 59 00 00 00 00 00 5b 
c3 90 90 90 48 8b 46 68 48 8b 40 20 48 c7 47 18 ff ff ff ff 89 47 10 c3 65 8b 
04 25 98
Sep  1 21:29:47 annalee kernel: [ 1062.410853] RIP  [810ba09c] 
file_ra_state_init+0x4/0x14
Sep  1 21:29:47 annalee kernel: [ 1062.410857]  RSP 88007df39db0
Sep  1 21:29:47 annalee kernel: [ 1062.410859] CR2: 0020
Sep  1 21:29:47 annalee kernel: [ 1062.410862] ---[ end trace f71fcf8b8aa10d22 
]---
Sep  1