Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
On Wed, 2010-10-06 at 09:53 -0700, Vagrant Cascadian wrote: On Mon, Sep 27, 2010 at 09:29:28AM -0700, Vagrant Cascadian wrote: On Tue, Sep 21, 2010 at 01:37:42PM -0700, Vagrant Cascadian wrote: On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote: Please add 'memory_corruption_check=1' to the kernel parameters and report whether that fixes the problem and whether it results in any new log messages. added this now, will reboot and see if that fixes it... presuming it just doesn't hide itself for another several weeks. ok, didn't have to wait for weeks, apparently. still getting crashes every week or two, but not sure if they're related to the initial problem... so i've been running with memory_corruption_check=1: Unlesss you also see messages reporting 'Corrupted low memory at ...' then what you're seeing is not the problem that memory_corruption_check deals with. [...] anything else that could possibly be of use in troubleshooting this? You could test a newer kernel package from experimental. Ben. -- Ben Hutchings Once a job is fouled up, anything done to improve it makes it worse. signature.asc Description: This is a digitally signed message part
Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
On Mon, Sep 27, 2010 at 09:29:28AM -0700, Vagrant Cascadian wrote: On Tue, Sep 21, 2010 at 01:37:42PM -0700, Vagrant Cascadian wrote: On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote: Please add 'memory_corruption_check=1' to the kernel parameters and report whether that fixes the problem and whether it results in any new log messages. added this now, will reboot and see if that fixes it... presuming it just doesn't hide itself for another several weeks. ok, didn't have to wait for weeks, apparently. still getting crashes every week or two, but not sure if they're related to the initial problem... so i've been running with memory_corruption_check=1: cat /proc/cmdline BOOT_IMAGE=/vmlinuz-2.6.32-5-686 root=/dev/mapper/mneme-rwt ro quiet memory_corruption_check=1 Oct 5 19:14:30 mneme kernel: [ 8525.133061] BUG: unable to handle kernel NULL pointer dereference at (null) Oct 5 19:14:30 mneme kernel: [ 8525.133072] IP: [c113a802] strlen+0x8/0x11 Oct 5 19:14:30 mneme kernel: [ 8525.133083] *pde = Oct 5 19:14:30 mneme kernel: [ 8525.133088] Oops: [#1] SMP Oct 5 19:14:30 mneme kernel: [ 8525.133094] last sysfs file: /sys/devices/pci:00/:00:02.1/resource Oct 5 19:14:30 mneme kernel: [ 8525.133099] Modules linked in: usbhid hid tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables sco bridge stp bnep rfcomm l2cap crc16 bluetooth acpi_cpufreq parport_pc ppdev lp cpufreq_stats cpufreq_powersave parport cpufreq_conservative cpufreq_userspace kvm_intel kvm uinpu t fuse dm_snapshot firewire_sbp2 loop snd_hda_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_midi snd_rawmidi snd_seq _midi_event i915 snd_seq arc4 drm_kms_helper ecb snd_timer b43 snd_seq_device drm snd yenta_socket rsrc_nonstatic i2c_i801 i2c_algo_bit soundcore mac80211 rng_c ore snd_page_alloc dell_laptop joydev i2c_core cfg80211 rfkill video battery processor button output dcdbas psmouse ac evdev serio_raw ext3 jbd mbcache sha256_g eneric aes_i586 aes_generic cbc dm_crypt dm_mod sd_mod crc_t10dif ata_generic ata_piix sdhci_pci sdhci thermal ssb firewire_ohci pcmcia libata mmc_core tg3 uhci _hcd firewire_core crc_itu_t pcmcia_core scsi_mod led_class ehci_h Oct 5 19:14:30 mneme kernel: cd thermal_sys libphy usbcore nls_base [last unloaded: scsi_wait_scan] Oct 5 19:14:30 mneme kernel: [ 8525.133242] Oct 5 19:14:30 mneme kernel: [ 8525.133248] Pid: 10491, comm: Xorg Not tainted (2.6.32-5-686 #1) Latitude D420 Oct 5 19:14:30 mneme kernel: [ 8525.133254] EIP: 0060:[c113a802] EFLAGS: 00213246 CPU: 0 Oct 5 19:14:30 mneme kernel: [ 8525.133259] EIP is at strlen+0x8/0x11 Oct 5 19:14:30 mneme kernel: [ 8525.133263] EAX: EBX: 0fd4 ECX: EDX: 0005 Oct 5 19:14:30 mneme kernel: [ 8525.133268] ESI: EDI: EBP: f6f533f0 ESP: d5871f4c Oct 5 19:14:30 mneme kernel: [ 8525.133272] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 Oct 5 19:14:30 mneme kernel: [ 8525.133278] Process Xorg (pid: 10491, ti=d587 task=f335d540 task.ti=d587) Oct 5 19:14:30 mneme kernel: [ 8525.133282] Stack: Oct 5 19:14:30 mneme kernel: [ 8525.133285] f0790e00 c10f2aee c10bd768 d5871f90 c128167c f0790e00 e0537d88 Oct 5 19:14:30 mneme kernel: [ 8525.133295] 0 e0537e00 c10bd9fb d5871f90 c10bd768 fff7 f0790e00 8000 Oct 5 19:14:30 mneme kernel: [ 8525.133306] 0 c10bda8b 0a0882a8 0a088288 7d28 ffea 000c 0a087fd0 b76afff4 Oct 5 19:14:30 mneme kernel: [ 8525.133318] Call Trace: Oct 5 19:14:30 mneme kernel: [ 8525.133328] [c10f2aee] ? sysfs_readdir+0xe0/0x13a Oct 5 19:14:30 mneme kernel: [ 8525.16] [c10bd768] ? filldir64+0x0/0xc5 Oct 5 19:14:30 mneme kernel: [ 8525.133342] [c10bd9fb] ? vfs_readdir+0x62/0x8c Oct 5 19:14:30 mneme kernel: [ 8525.133349] [c10bd768] ? filldir64+0x0/0xc5 Oct 5 19:14:30 mneme kernel: [ 8525.133355] [c10bda8b] ? sys_getdents64+0x66/0xa5 Oct 5 19:14:30 mneme kernel: [ 8525.133363] [c10030fb] ? sysenter_do_call+0x12/0x28 Oct 5 19:14:30 mneme kernel: [ 8525.133367] Code: eb 04 19 c0 0c 01 5e 5f c3 56 89 c6 89 d0 88 c4 ac 38 e0 74 09 84 c0 75 f7 be 01 00 00 00 89 f0 48 5e c3 57 8 3 c9 ff 89 c7 31 c0 f2 ae f7 d1 49 89 c8 5f c3 57 31 ff 85 c9 74 0e 89 c7 89 d0 f2 Oct 5 19:14:30 mneme kernel: [ 8525.133427] EIP: [c113a802] strlen+0x8/0x11 SS:ESP 0068:d5871f4c Oct 5 19:14:30 mneme kernel: [ 8525.133434] CR2: Oct 5 19:14:30 mneme kernel: [ 8525.133439] ---[ end trace e4f8adeee260d138 ]--- and again today: Oct 6 08:20:00 mneme kernel: [ 5643.886857] BUG: unable to handle kernel NULL pointer dereference at 0010 Oct 6 08:20:00 mneme kernel: [ 5643.887055] IP: [c108cf4b] file_ra_state_init+0x3/0x18 Oct 6 08:20:00 mneme kernel: [ 5643.890573] *pde = Oct 6 08:20:00 mneme kernel: [ 5643.890573] Oops: [#1] SMP Oct 6 08:20:00state_init+0x3/0x18 Oct 6 08:20:00 mneme kernel: [ 5643.890573]
Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
On Tue, Sep 21, 2010 at 01:37:42PM -0700, Vagrant Cascadian wrote: On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote: Please add 'memory_corruption_check=1' to the kernel parameters and report whether that fixes the problem and whether it results in any new log messages. added this now, will reboot and see if that fixes it... presuming it just doesn't hide itself for another several weeks. ok, didn't have to wait for weeks, apparently. the error message was different this time... also different is that it failed on starting X.org, rather than starting my window manager (tritium) sucessfully, but failing to start an x-terminal-emulator. is it a new issue, or the same issue triggered differently due to memory_corruption_check=1 ? this froze the system with periodic disk activity shortly after resuming from disk on with linux-image-2.6.32-5-686 2.6.32-23: Sep 26 19:51:52 mneme kernel: [92514.768255] [ cut here ] Sep 26 19:51:52 mneme kernel: [92514.768268] WARNING: at /build/buildd-linux-2.6_2.6.32-23-i386-x1D1UQ/linux-2.6-2.6.32/debian/build/source_i386_none/fs/sysfs/file.c:355 sysfs_open_file+0x91/0x259() Sep 26 19:51:52 mneme kernel: [92514.768275] Hardware name: Latitude D420 Sep 26 19:51:52 mneme kernel: [92514.768278] missing sysfs attribute operations for kobject: NULL Sep 26 19:51:52 mneme kernel: [92514.768283] Modules linked in: mct_u232 usbserial ext2 hfs hfsplus vfat fat isofs nls_utf8 udf usb_storage usbhid hid tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables sco bridge stp bnep parport_pc ppdev lp parport l2cap crc16 bluetooth acpi_cpufreq cpufreq_stats cpufreq_powersave cpufreq_conservative cpufreq_userspace kvm_intel kvm uinput fuse dm_snapshot firewire_sbp2 loop snd_hda_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss arc4 snd_mixer_oss ecb snd_pcm snd_seq_midi snd_rawmidi snd_seq_midi_event b43 i915 snd_seq drm_kms_helper snd_timer joydev snd_seq_device drm mac80211 yenta_socket rsrc_nonstatic i2c_i801 snd i2c_algo_bit soundcore snd_page_alloc psmouse i2c_core cfg80211 video battery processor button ac dell_laptop serio_raw evdev rng_core dcdbas output rfkill ext3 jbd mbcache sha256_generic aes_i586 aes_generic cbc dm_crypt dm_mod sd_mod crc_t10dif ata_generic ssb sdhci_pci firewire_ohci pcmcia thermal tg3 ata_piix uhci_hcd sdhci firewire Sep 26 19:51:52 mneme kernel: _core crc_itu_t mmc_core led_class thermal_sys libphy libata scsi_mod ehci_hcd pcmcia_core usbcore nls_base [last unloaded: scsi_wait_scan] Sep 26 19:51:52 mneme kernel: [92514.768448] Pid: 12333, comm: Xorg Not tainted 2.6.32-5-686 #1 Sep 26 19:51:52 mneme kernel: [92514.768452] Call Trace: Sep 26 19:51:52 mneme kernel: [92514.768461] [c103014d] ? warn_slowpath_common+0x5e/0x8a Sep 26 19:51:52 mneme kernel: [92514.768468] [c10301ab] ? warn_slowpath_fmt+0x26/0x2a Sep 26 19:51:52 mneme kernel: [92514.768475] [c10f2050] ? sysfs_open_file+0x91/0x259 Sep 26 19:51:52 mneme kernel: [92514.768482] [c10b1347] ? __dentry_open+0x156/0x246 Sep 26 19:51:52 mneme kernel: [92514.768489] [c10b14c8] ? nameidata_to_filp+0x29/0x3c Sep 26 19:51:52 mneme kernel: [92514.768495] [c10f1fbf] ? sysfs_open_file+0x0/0x259 Sep 26 19:51:52 mneme kernel: [92514.768502] [c10bb470] ? do_filp_open+0x43f/0x802 Sep 26 19:51:52 mneme kernel: [92514.768509] [c113ae03] ? copy_to_user+0x29/0xf8 Sep 26 19:51:52 mneme kernel: [92514.768515] [c10b9edc] ? vfs_readlink+0x2f/0x40 Sep 26 19:51:52 mneme kernel: [92514.768521] [c10b9f7a] ? generic_readlink+0x48/0x6f Sep 26 19:51:52 mneme kernel: [92514.768528] [c10c3181] ? alloc_fd+0x52/0xb7 Sep 26 19:51:52 mneme kernel: [92514.768534] [c10b10ff] ? do_sys_open+0x4c/0xdf Sep 26 19:51:52 mneme kernel: [92514.768540] [c10b11d6] ? sys_open+0x1e/0x23 Sep 26 19:51:52 mneme kernel: [92514.768547] [c10030fb] ? sysenter_do_call+0x12/0x28 Sep 26 19:51:52 mneme kernel: [92514.768552] ---[ end trace 9cabd45097cf0d9d ]--- Sep 26 19:51:52 mneme kernel: [92514.768704] [ cut here ] Sep 26 19:51:52 mneme kernel: [92514.768712] WARNING: at /build/buildd-linux-2.6_2.6.32-23-i386-x1D1UQ/linux-2.6-2.6.32/debian/build/source_i386_none/fs/sysfs/file.c:355 sysfs_open_file+0x91/0x259() Sep 26 19:51:52 mneme kernel: [92514.768718] Hardware name: Latitude D420 Sep 26 19:51:52 mneme kernel: [92514.768722] missing sysfs attribute operations for kobject: NULL Sep 26 19:51:52 mneme kernel: [92514.768725] Modules linked in: mct_u232 usbserial ext2 hfs hfsplus vfat fat isofs nls_utf8 udf usb_storage usbhid hid tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables sco bridge stp bnep parport_pc ppdev lp parport l2cap crc16 bluetooth acpi_cpufreq cpufreq_stats cpufreq_powersave cpufreq_conservative cpufreq_userspace kvm_intel kvm uinput fuse dm_snapshot firewire_sbp2 loop snd_hda_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss arc4
Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
On Mon, Sep 20, 2010 at 02:04:23AM +0100, Ben Hutchings wrote: On Mon, 2010-09-06 at 17:05 -0700, Vagrant Cascadian wrote: On Tue, Sep 07, 2010 at 12:10:09AM +0100, Ben Hutchings wrote: On Mon, 2010-09-06 at 12:27 -0700, Vagrant Cascadian wrote: ...snip... i also didn't mention, largely because i'm pretty unsure if it's really the case, but the issue *seems* to occur more reliably when hibernating (to disk) while on battery, but i'm really unsure of that. that didn't appear to really make any difference... i tried all combinations of hibernate with AC, with battery, and resume with AC, with battery, and was unable to reproduce it reliably. in fact, i hadn't seen it since my last comment on the bug report nearly two weeks ago... until today. gah. so it's not exactly easy to reproduce. :( i'll see if i can more reliably trigger the same problem on a clean squeeze install on the same hardware in another partition... where i'm not as worried about crashing. i was also unable to reproduce it on a clean squeeze install either, so it's probably something particular with my existing configuration or useage patterns. the test install was just on another partition, and my typical environment is using lvm on an encrypted volume, if that seems likely to make any difference. my typical environment is also upgraded from a lenny install. Please add 'memory_corruption_check=1' to the kernel parameters and report whether that fixes the problem and whether it results in any new log messages. added this now, will reboot and see if that fixes it... presuming it just doesn't hide itself for another several weeks. live well, vagrant -- To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org Archive: http://lists.debian.org/20100921203742.gb3...@talon.fglan
Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
On Mon, 2010-09-06 at 17:05 -0700, Vagrant Cascadian wrote: On Tue, Sep 07, 2010 at 12:10:09AM +0100, Ben Hutchings wrote: On Mon, 2010-09-06 at 12:27 -0700, Vagrant Cascadian wrote: Package: linux-2.6 Version: 2.6.32-21 i'm experiencing very similar issues, although also with earlier versions of linux-image-2.6.32-5-686. i first started having this problem after upgrading to squeeze from lenny, although i was running the exact same linux-image-2.6.32-* directly on lenny for quite some time without problems. i had been using uswsusp, but after experiencing this problem several times, i purged uswsusp and switched to using the in-kernel resume with pm-hibernate. [...] And did that make any difference? heh. sorry for being unclear :) unfortunately, no, it's still Oops'ing after resume. i also didn't mention, largely because i'm pretty unsure if it's really the case, but the issue *seems* to occur more reliably when hibernating (to disk) while on battery, but i'm really unsure of that. i'll see if i can more reliably trigger the same problem on a clean squeeze install on the same hardware in another partition... where i'm not as worried about crashing. Please add 'memory_corruption_check=1' to the kernel parameters and report whether that fixes the problem and whether it results in any new log messages. Ben. -- Ben Hutchings Once a job is fouled up, anything done to improve it makes it worse. signature.asc Description: This is a digitally signed message part
Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
Package: linux-2.6 Version: 2.6.32-21 i'm experiencing very similar issues, although also with earlier versions of linux-image-2.6.32-5-686. i first started having this problem after upgrading to squeeze from lenny, although i was running the exact same linux-image-2.6.32-* directly on lenny for quite some time without problems. i had been using uswsusp, but after experiencing this problem several times, i purged uswsusp and switched to using the in-kernel resume with pm-hibernate. an example OOPS below, if it's at all useful: Sep 5 19:00:35 mneme kernel: [ 7750.522234] BUG: unable to handle kernel NULL pointer dereference at 0010 Sep 5 19:00:35 mneme kernel: [ 7750.522243] IP: [c108cdc7] file_ra_state_init+0x3/0x18 Sep 5 19:00:35 mneme kernel: [ 7750.522256] *pde = Sep 5 19:00:35 mneme kernel: [ 7750.522261] Oops: [#3] SMP Sep 5 19:00:35 mneme kernel: [ 7750.522266] last sysfs file: /sys/devices/system/cpu/cpu1/cache/index2/shared_cpu_map Sep 5 19:00:35 mneme kernel: [ 7750.522272] Modules linked in: mct_u232 usbserial tun ip6table_filter ip6_tables iptable_filter ip_tables x_tables parport_pc ppdev lp parport sco bridge stp bnep rfcomm l2cap crc16 bluetooth acpi_cpufreq cpufreq_stats cpufreq_powersave cpufreq_conservative cpufreq_userspace kvm_intel kvm uinput fuse firewire_sbp2 loop snd_hda_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq i915 snd_timer drm_kms_helper joydev snd_seq_device arc4 ecb drm b43 snd yenta_socket soundcore i2c_i801 i2c_algo_bit psmouse snd_page_alloc rsrc_nonstatic mac80211 dell_laptop i2c_core rng_core serio_raw evdev dcdbas video cfg80211 output button processor battery ac rfkill ext3 jbd mbcache sha256_generic aes_i586 aes_generic cbc dm_crypt dm_mod sd_mod crc_t10dif ata_generic ssb sdhci_pci sdhci ata_piix mmc_core firewire_ohci tg3 libata pcmcia thermal uhci_hcd led_class firewire_core crc_itu_t libphy scsi_mod ehci_hcd therma Sep 5 19:00:35 mneme kernel: l_sys pcmcia_core usbcore nls_base [last unloaded: scsi_wait_scan] Sep 5 19:00:35 mneme kernel: [ 7750.522412] Sep 5 19:00:35 mneme kernel: [ 7750.522418] Pid: 10665, comm: x-terminal-emul Tainted: G D(2.6.32-5-686 #1) Latitude D420 Sep 5 19:00:35 mneme kernel: [ 7750.522424] EIP: 0060:[c108cdc7] EFLAGS: 00210202 CPU: 1 Sep 5 19:00:35 mneme kernel: [ 7750.522429] EIP is at file_ra_state_init+0x3/0x18 Sep 5 19:00:35 mneme kernel: [ 7750.522434] EAX: ef7f45c8 EBX: ECX: d5699800 EDX: Sep 5 19:00:35 mneme kernel: [ 7750.522438] ESI: ef7f4580 EDI: EBP: f6f73b18 ESP: c49a3ea0 Sep 5 19:00:35 mneme kernel: [ 7750.522443] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 Sep 5 19:00:35 mneme kernel: [ 7750.522448] Process x-terminal-emul (pid: 10665, ti=c49a2000 task=efaa1100 task.ti=c49a2000) Sep 5 19:00:35 mneme kernel: [ 7750.522452] Stack: Sep 5 19:00:35 mneme kernel: [ 7750.522455] c10b114b f6f28080 f68bb220 c49a3f00 ef7f4580 c49a3f00 c49a3f00 0003 Sep 5 19:00:35 mneme kernel: [ 7750.522465] 0 c10b12ac ef7f4580 c10b4c40 f6cdfe80 c49a3f00 c10bb254 Sep 5 19:00:35 mneme kernel: [ 7750.522476] 0 0002 efa7a000 ff9c c1c7cbc0 b7112000 fffa0844 ef992898 Sep 5 19:00:35 mneme kernel: [ 7750.522488] Call Trace: Sep 5 19:00:35 mneme kernel: [ 7750.522495] [c10b114b] ? __dentry_open+0x176/0x246 Sep 5 19:00:35 mneme kernel: [ 7750.522502] [c10b12ac] ? nameidata_to_filp+0x29/0x3c Sep 5 19:00:35 mneme kernel: [ 7750.522509] [c10b4c40] ? chrdev_open+0x0/0x116 Sep 5 19:00:35 mneme kernel: [ 7750.522516] [c10bb254] ? do_filp_open+0x43f/0x802 Sep 5 19:00:35 mneme kernel: [ 7750.522524] [c10c2f65] ? alloc_fd+0x52/0xb7 Sep 5 19:00:35 mneme kernel: [ 7750.522530] [c10b0ee3] ? do_sys_open+0x4c/0xdf Sep 5 19:00:35 mneme kernel: [ 7750.522536] [c10b0fba] ? sys_open+0x1e/0x23 Sep 5 19:00:35 mneme kernel: [ 7750.522543] [c10030fb] ? sysenter_do_call+0x12/0x28 Sep 5 19:00:35 mneme kernel: [ 7750.522547] Code: c3 53 89 d3 ff 74 24 0c ff 74 24 0c e8 0e 93 fa ff 5a 59 85 c0 75 0e 85 db 74 0a c7 05 34 fc 4a c1 00 00 00 00 5b c3 90 8b 52 40 8b 52 10 c7 40 14 ff ff ff ff c7 40 18 ff ff ff ff 89 50 0c c3 Sep 5 19:00:35 mneme kernel: [ 7750.522607] EIP: [c108cdc7] file_ra_state_init+0x3/0x18 SS:ESP 0068:c49a3ea0 Sep 5 19:00:35 mneme kernel: [ 7750.522616] CR2: 0010 Sep 5 19:00:35 mneme kernel: [ 7750.522620] ---[ end trace f383376f90ced1d2 ]--- live well, vagrant -- Package-specific info: ** Version: Linux version 2.6.32-5-686 (Debian 2.6.32-21) (b...@decadent.org.uk) (gcc version 4.3.5 (Debian 4.3.5-2) ) #1 SMP Wed Aug 25 14:28:12 UTC 2010 ** Command line: BOOT_IMAGE=/vmlinuz-2.6.32-5-686 root=/dev/mapper/mneme-rwt ro quiet ** Not tainted ** Kernel log: [ 5400.080272] CPU: Physical Processor ID: 0 [ 5400.080272] CPU: Processor Core ID: 1 [ 5400.080272] CPU1:
Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
On Mon, 2010-09-06 at 12:27 -0700, Vagrant Cascadian wrote: Package: linux-2.6 Version: 2.6.32-21 i'm experiencing very similar issues, although also with earlier versions of linux-image-2.6.32-5-686. i first started having this problem after upgrading to squeeze from lenny, although i was running the exact same linux-image-2.6.32-* directly on lenny for quite some time without problems. i had been using uswsusp, but after experiencing this problem several times, i purged uswsusp and switched to using the in-kernel resume with pm-hibernate. [...] And did that make any difference? Ben. -- Ben Hutchings Once a job is fouled up, anything done to improve it makes it worse. signature.asc Description: This is a digitally signed message part
Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
On Tue, Sep 07, 2010 at 12:10:09AM +0100, Ben Hutchings wrote: On Mon, 2010-09-06 at 12:27 -0700, Vagrant Cascadian wrote: Package: linux-2.6 Version: 2.6.32-21 i'm experiencing very similar issues, although also with earlier versions of linux-image-2.6.32-5-686. i first started having this problem after upgrading to squeeze from lenny, although i was running the exact same linux-image-2.6.32-* directly on lenny for quite some time without problems. i had been using uswsusp, but after experiencing this problem several times, i purged uswsusp and switched to using the in-kernel resume with pm-hibernate. [...] And did that make any difference? heh. sorry for being unclear :) unfortunately, no, it's still Oops'ing after resume. i also didn't mention, largely because i'm pretty unsure if it's really the case, but the issue *seems* to occur more reliably when hibernating (to disk) while on battery, but i'm really unsure of that. i'll see if i can more reliably trigger the same problem on a clean squeeze install on the same hardware in another partition... where i'm not as worried about crashing. live well, vagrant -- To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org Archive: http://lists.debian.org/20100907000545.gx28...@claws.fglan
Bug#595187: linux-2.6: lots of OOPses on resume (NULL pointer dereference in file_ra_state_init)
Source: linux-2.6 Version: 2.6.32-21 Severity: normal Hi, Since the upgrade from 2.6.32-20 - 2.6.32-21, my laptop fails to resume properly into Xorg : the cursor only appears when I move it and there is nothing much more I can do than switching to the console, login as root and reboot. In the console, I can see a lot of OOpses that are visible in the attached log file. This did not happend with 2.6.32-20, or at least not before many suspend cycles. Here is the first of the many similar stack traces : -- Sep 1 21:29:47 annalee kernel: [ 1062.410659] BUG: unable to handle kernel NULL pointer dereference at 0020 Sep 1 21:29:47 annalee kernel: [ 1062.410665] IP: [810ba09c] file_ra_state_init+0x4/0x14 Sep 1 21:29:47 annalee kernel: [ 1062.410673] PGD 7c87e067 PUD 7c370067 PMD 0 Sep 1 21:29:47 annalee kernel: [ 1062.410678] Oops: [#1] SMP Sep 1 21:29:47 annalee kernel: [ 1062.410681] last sysfs file: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/ACPI0003:00/power_supply/AC/uevent Sep 1 21:29:47 annalee kernel: [ 1062.410685] CPU 1 Sep 1 21:29:47 annalee kernel: [ 1062.410687] Modules linked in: binfmt_misc acpi_cpufreq firewire_sbp2 firewire_core crc_itu_t loop sha256_generic aes_x86_64 aes_generic cbc dm_crypt arc4 snd_hda_codec_idt ecb iwl3945 snd_hda_intel iwlcore snd_hda_codec snd_hwdep joydev mac80211 snd_pcm snd_seq snd_timer snd_seq_device led_class battery dell_laptop snd cfg80211 soundcore psmouse snd_page_alloc i2c_i801 evdev pcspkr rfkill dcdbas wmi serio_raw ac processor ext3 jbd mbcache dm_mod sd_mod crc_t10dif i915 drm_kms_helper drm ata_generic uhci_hcd i2c_algo_bit tg3 libphy thermal ata_piix button libata ehci_hcd scsi_mod i2c_core video thermal_sys output usbcore nls_base [last unloaded: scsi_wait_scan] Sep 1 21:29:47 annalee kernel: [ 1062.410739] Pid: 2798, comm: date Not tainted 2.6.32-5-amd64 #1 Latitude D630 Sep 1 21:29:47 annalee kernel: [ 1062.410742] RIP: 0010:[810ba09c] [810ba09c] file_ra_state_init+0x4/0x14 Sep 1 21:29:47 annalee kernel: [ 1062.410747] RSP: 0018:88007df39db0 EFLAGS: 00010206 Sep 1 21:29:47 annalee kernel: [ 1062.410749] RAX: RBX: RCX: 88007c89ba80 Sep 1 21:29:47 annalee kernel: [ 1062.410752] RDX: 88007c89ba80 RSI: 88007ef53798 RDI: 88007c89baf0 Sep 1 21:29:47 annalee kernel: [ 1062.410754] RBP: 88007c89ba80 R08: R09: 880037bd9c00 Sep 1 21:29:47 annalee kernel: [ 1062.410757] R10: 88007df39e48 R11: 81151385 R12: Sep 1 21:29:47 annalee kernel: [ 1062.410759] R13: 88007ef53678 R14: 0024 R15: 810eb47c Sep 1 21:29:47 annalee kernel: [ 1062.410762] FS: () GS:88000190() knlGS: Sep 1 21:29:47 annalee kernel: [ 1062.410765] CS: 0010 DS: ES: CR0: 80050033 Sep 1 21:29:47 annalee kernel: [ 1062.410768] CR2: 0020 CR3: 7c70e000 CR4: 06e0 Sep 1 21:29:47 annalee kernel: [ 1062.410770] DR0: DR1: DR2: Sep 1 21:29:47 annalee kernel: [ 1062.410773] DR3: DR6: 0ff0 DR7: 0400 Sep 1 21:29:47 annalee kernel: [ 1062.410776] Process date (pid: 2798, threadinfo 88007df38000, task 88007db662e0) Sep 1 21:29:47 annalee kernel: [ 1062.410778] Stack: Sep 1 21:29:47 annalee kernel: [ 1062.410779] 810eb991 880037bd9c00 880037a1aa00 88007a759600 Sep 1 21:29:47 annalee kernel: [ 1062.410783] 0 88007df39e48 88007df39e48 8001 Sep 1 21:29:47 annalee kernel: [ 1062.410787] 0 0024 ff9c 810f70bb 88007df39e78 Sep 1 21:29:47 annalee kernel: [ 1062.410792] Call Trace: Sep 1 21:29:47 annalee kernel: [ 1062.410797] [810eb991] ? __dentry_open+0x1c4/0x2bf Sep 1 21:29:47 annalee kernel: [ 1062.410802] [810f70bb] ? do_filp_open+0x4e4/0x94b Sep 1 21:29:47 annalee kernel: [ 1062.410806] [810e40dd] ? virt_to_head_page+0x9/0x2a Sep 1 21:29:47 annalee kernel: [ 1062.410810] [811000c5] ? alloc_fd+0x67/0x10c Sep 1 21:29:47 annalee kernel: [ 1062.410813] [810eb6fb] ? do_sys_open+0x55/0xfc Sep 1 21:29:47 annalee kernel: [ 1062.410818] [81010b42] ? system_call_fastpath+0x16/0x1b Sep 1 21:29:47 annalee kernel: [ 1062.410821] Code: 89 d8 5b 5d 41 5c c3 53 89 f3 e8 57 98 f9 ff 85 c0 75 0f 85 db 74 0b 48 c7 05 ed 86 59 00 00 00 00 00 5b c3 90 90 90 48 8b 46 68 48 8b 40 20 48 c7 47 18 ff ff ff ff 89 47 10 c3 65 8b 04 25 98 Sep 1 21:29:47 annalee kernel: [ 1062.410853] RIP [810ba09c] file_ra_state_init+0x4/0x14 Sep 1 21:29:47 annalee kernel: [ 1062.410857] RSP 88007df39db0 Sep 1 21:29:47 annalee kernel: [ 1062.410859] CR2: 0020 Sep 1 21:29:47 annalee kernel: [ 1062.410862] ---[ end trace f71fcf8b8aa10d22 ]--- Sep 1