Re: 2.6.35-rc2 : OOPS with LTP memcg regression test run.

2010-06-10 Thread Maciej Rutecki
I created a Bugzilla entry at 
https://bugzilla.kernel.org/show_bug.cgi?id=16178
for your bug report, please add your address to the CC list in there, thanks!

On niedziela, 6 czerwca 2010 o 17:06:54 Sachin Sant wrote:
 While executing LTP Controller tests(memcg regression) on
 a POWER6 box came across this following OOPS.
 
 Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a
  child Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB,
  file-rss:1088kB Memory cgroup out of memory: kill process 9140
  (memcg_test_1) score 3 or a child Killed process 9140 (memcg_test_1)
  vsz:3456kB, anon-rss:448kB, file-rss:1088kB Unable to handle kernel paging
  request for data at address 0x720072007200720 Faulting instruction
  address: 0xc015b778
 Oops: Kernel access of bad area, sig: 11 [#2]
 SMP NR_CPUS=1024 NUMA pSeries
 last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map
 Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom
  sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP:
  c015b778 LR: c015b740 CTR: 
 REGS: c9812ff0 TRAP: 0300   Tainted: G  D 
  (2.6.35-rc2-autotest) MSR: 80009032 EE,ME,IR,DR  CR: 44004424 
  XER: 0001
 DAR: 0720072007200720, DSISR: 4000
 TASK = c5fb1100[9155] 'umount' THREAD: c981 CPU: 0
 GPR00:  c9813270 c0d3d7a0 
 GPR04: 8050 0016 0027 cf2c6870
 GPR08: 06a5 c0b16870 c0cf0140 0e7b
 GPR12: 24004428 c744 8000 f000
 GPR16:  c98138f0 002d 0027
 GPR20:  0027  c7063138
 GPR24:   c019bafc ce02e000
 GPR28: 0001 8050 c0ca6b00 0720072007200720
 NIP [c015b778] .kmem_cache_alloc+0xb0/0x13c
 LR [c015b740] .kmem_cache_alloc+0x78/0x13c
 Call Trace:
 [c9813270] [c015b740] .kmem_cache_alloc+0x78/0x13c
  (unreliable) [c9813310] [c019bafc]
  .alloc_buffer_head+0x2c/0x78 [c9813390] [c019c99c]
  .alloc_page_buffers+0x60/0x114 [c9813450] [c019ca78]
  .create_empty_buffers+0x28/0x140 [c98134e0] [c019f2ec]
  .__block_prepare_write+0xe4/0x4f0 [c9813610] [c019f94c]
  .block_write_begin_newtrunc+0xa8/0x120 [c98136d0]
  [c019fea0] .block_write_begin+0x34/0x8c [c9813770]
  [c022b458] .ext3_write_begin+0x13c/0x298 [c9813880]
  [c0117500] .generic_file_buffered_write+0x13c/0x320
  [c98139b0] [c0119c80]
  .__generic_file_aio_write+0x378/0x3dc [c9813ab0]
  [c0119d68] .generic_file_aio_write+0x84/0xfc [c9813b60]
  [c016e460] .do_sync_write+0xac/0x10c
 [c9813ce0] [c016f204] .vfs_write+0xd0/0x1dc
 [c9813d80] [c016f418] .SyS_write+0x58/0xa0
 [c9813e30] [c00085b4] syscall_exit+0x0/0x40
 Instruction dump:
 3860 409e0090 3800 8b8d0212 980d0212 e96d0040 e93b 7ce95a14
 7fe9582a 2fbf 419e0014 e81b001a 7c1f002a 7c09592a 481c 7f46d378
 ---[ end trace f24cb0cb5729d2bb ]---
 
 And few more of these. Previous snapshot release
  2.6.35-rc1-git5(6c5de280b6...) was good.
 
 Thanks
 -Sachin
 

-- 
Maciej Rutecki
http://www.maciek.unixy.pl
___
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev


Re: 2.6.35-rc2 : OOPS with LTP memcg regression test run.

2010-06-10 Thread KAMEZAWA Hiroyuki
On Thu, 10 Jun 2010 22:00:57 +0200
Maciej Rutecki maciej.rute...@gmail.com wrote:

 I created a Bugzilla entry at 
 https://bugzilla.kernel.org/show_bug.cgi?id=16178
 for your bug report, please add your address to the CC list in there, thanks!
 

Hmm... It seems a panic in SLUB or SLAB.
Is .config available ?

-Kame


 On niedziela, 6 czerwca 2010 o 17:06:54 Sachin Sant wrote:
  While executing LTP Controller tests(memcg regression) on
  a POWER6 box came across this following OOPS.
  
  Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a
   child Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB,
   file-rss:1088kB Memory cgroup out of memory: kill process 9140
   (memcg_test_1) score 3 or a child Killed process 9140 (memcg_test_1)
   vsz:3456kB, anon-rss:448kB, file-rss:1088kB Unable to handle kernel paging
   request for data at address 0x720072007200720 Faulting instruction
   address: 0xc015b778
  Oops: Kernel access of bad area, sig: 11 [#2]
  SMP NR_CPUS=1024 NUMA pSeries
  last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map
  Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom
   sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP:
   c015b778 LR: c015b740 CTR: 
  REGS: c9812ff0 TRAP: 0300   Tainted: G  D 
   (2.6.35-rc2-autotest) MSR: 80009032 EE,ME,IR,DR  CR: 44004424 
   XER: 0001
  DAR: 0720072007200720, DSISR: 4000
  TASK = c5fb1100[9155] 'umount' THREAD: c981 CPU: 0
  GPR00:  c9813270 c0d3d7a0 
  GPR04: 8050 0016 0027 cf2c6870
  GPR08: 06a5 c0b16870 c0cf0140 0e7b
  GPR12: 24004428 c744 8000 f000
  GPR16:  c98138f0 002d 0027
  GPR20:  0027  c7063138
  GPR24:   c019bafc ce02e000
  GPR28: 0001 8050 c0ca6b00 0720072007200720
  NIP [c015b778] .kmem_cache_alloc+0xb0/0x13c
  LR [c015b740] .kmem_cache_alloc+0x78/0x13c
  Call Trace:
  [c9813270] [c015b740] .kmem_cache_alloc+0x78/0x13c
   (unreliable) [c9813310] [c019bafc]
   .alloc_buffer_head+0x2c/0x78 [c9813390] [c019c99c]
   .alloc_page_buffers+0x60/0x114 [c9813450] [c019ca78]
   .create_empty_buffers+0x28/0x140 [c98134e0] [c019f2ec]
   .__block_prepare_write+0xe4/0x4f0 [c9813610] [c019f94c]
   .block_write_begin_newtrunc+0xa8/0x120 [c98136d0]
   [c019fea0] .block_write_begin+0x34/0x8c [c9813770]
   [c022b458] .ext3_write_begin+0x13c/0x298 [c9813880]
   [c0117500] .generic_file_buffered_write+0x13c/0x320
   [c98139b0] [c0119c80]
   .__generic_file_aio_write+0x378/0x3dc [c9813ab0]
   [c0119d68] .generic_file_aio_write+0x84/0xfc [c9813b60]
   [c016e460] .do_sync_write+0xac/0x10c
  [c9813ce0] [c016f204] .vfs_write+0xd0/0x1dc
  [c9813d80] [c016f418] .SyS_write+0x58/0xa0
  [c9813e30] [c00085b4] syscall_exit+0x0/0x40
  Instruction dump:
  3860 409e0090 3800 8b8d0212 980d0212 e96d0040 e93b 7ce95a14
  7fe9582a 2fbf 419e0014 e81b001a 7c1f002a 7c09592a 481c 7f46d378
  ---[ end trace f24cb0cb5729d2bb ]---
  
  And few more of these. Previous snapshot release
   2.6.35-rc1-git5(6c5de280b6...) was good.
  
  Thanks
  -Sachin
  
 
 -- 
 Maciej Rutecki
 http://www.maciek.unixy.pl
 
 --
 To unsubscribe, send a message with 'unsubscribe linux-mm' in
 the body to majord...@kvack.org.  For more info on Linux MM,
 see: http://www.linux-mm.org/ .
 Don't email: a href=mailto:d...@kvack.org; em...@kvack.org /a
 

___
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev


Re: 2.6.35-rc2 : OOPS with LTP memcg regression test run.

2010-06-10 Thread Sachin Sant

KAMEZAWA Hiroyuki wrote:

On Thu, 10 Jun 2010 22:00:57 +0200
Maciej Rutecki maciej.rute...@gmail.com wrote:

  
I created a Bugzilla entry at 
https://bugzilla.kernel.org/show_bug.cgi?id=16178

for your bug report, please add your address to the CC list in there, thanks!




Hmm... It seems a panic in SLUB or SLAB.
Is .config available ?
  

I think the root cause for this problem was same as the one
mentioned in this thread (Bug kmalloc-4096 : Poison overwritten)

http://marc.info/?l=linux-kernelm=127586004308747w=2 
http://marc.info/?l=linux-kernelm=127586004308747w=2

I verified that the problem goes away after applying the commit 386f40c.

Thanks
-Sachin 



--

-
Sachin Sant
IBM Linux Technology Center
India Systems and Technology Labs
Bangalore, India
-

___
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev


2.6.35-rc2 : OOPS with LTP memcg regression test run.

2010-06-06 Thread Sachin Sant

While executing LTP Controller tests(memcg regression) on
a POWER6 box came across this following OOPS.

Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a child
Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB, file-rss:1088kB
Memory cgroup out of memory: kill process 9140 (memcg_test_1) score 3 or a child
Killed process 9140 (memcg_test_1) vsz:3456kB, anon-rss:448kB, file-rss:1088kB
Unable to handle kernel paging request for data at address 0x720072007200720
Faulting instruction address: 0xc015b778
Oops: Kernel access of bad area, sig: 11 [#2]
SMP NR_CPUS=1024 NUMA pSeries
last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map
Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom sg 
sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod
NIP: c015b778 LR: c015b740 CTR: 
REGS: c9812ff0 TRAP: 0300   Tainted: G  D  (2.6.35-rc2-autotest)
MSR: 80009032 EE,ME,IR,DR  CR: 44004424  XER: 0001
DAR: 0720072007200720, DSISR: 4000
TASK = c5fb1100[9155] 'umount' THREAD: c981 CPU: 0
GPR00:  c9813270 c0d3d7a0 
GPR04: 8050 0016 0027 cf2c6870
GPR08: 06a5 c0b16870 c0cf0140 0e7b
GPR12: 24004428 c744 8000 f000
GPR16:  c98138f0 002d 0027
GPR20:  0027  c7063138
GPR24:   c019bafc ce02e000
GPR28: 0001 8050 c0ca6b00 0720072007200720
NIP [c015b778] .kmem_cache_alloc+0xb0/0x13c
LR [c015b740] .kmem_cache_alloc+0x78/0x13c
Call Trace:
[c9813270] [c015b740] .kmem_cache_alloc+0x78/0x13c (unreliable)
[c9813310] [c019bafc] .alloc_buffer_head+0x2c/0x78
[c9813390] [c019c99c] .alloc_page_buffers+0x60/0x114
[c9813450] [c019ca78] .create_empty_buffers+0x28/0x140
[c98134e0] [c019f2ec] .__block_prepare_write+0xe4/0x4f0
[c9813610] [c019f94c] .block_write_begin_newtrunc+0xa8/0x120
[c98136d0] [c019fea0] .block_write_begin+0x34/0x8c
[c9813770] [c022b458] .ext3_write_begin+0x13c/0x298
[c9813880] [c0117500] .generic_file_buffered_write+0x13c/0x320
[c98139b0] [c0119c80] .__generic_file_aio_write+0x378/0x3dc
[c9813ab0] [c0119d68] .generic_file_aio_write+0x84/0xfc
[c9813b60] [c016e460] .do_sync_write+0xac/0x10c
[c9813ce0] [c016f204] .vfs_write+0xd0/0x1dc
[c9813d80] [c016f418] .SyS_write+0x58/0xa0
[c9813e30] [c00085b4] syscall_exit+0x0/0x40
Instruction dump:
3860 409e0090 3800 8b8d0212 980d0212 e96d0040 e93b 7ce95a14
7fe9582a 2fbf 419e0014 e81b001a 7c1f002a 7c09592a 481c 7f46d378
---[ end trace f24cb0cb5729d2bb ]---

And few more of these. Previous snapshot release 2.6.35-rc1-git5(6c5de280b6...)
was good.

Thanks
-Sachin


--

-
Sachin Sant
IBM Linux Technology Center
India Systems and Technology Labs
Bangalore, India
-

___
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev


Re: 2.6.35-rc2 : OOPS with LTP memcg regression test run.

2010-06-06 Thread Al Viro
On Sun, Jun 06, 2010 at 08:36:54PM +0530, Sachin Sant wrote:

 And few more of these. Previous snapshot release 
 2.6.35-rc1-git5(6c5de280b6...)
 was good.

That's very odd, since
; git diff --stat 6c5de280b6..v2.6.35-rc2 
 Makefile |2 +-
 drivers/gpu/drm/i915/intel_display.c |9 +++
 fs/ext4/inode.c  |   40 +++--
 fs/ext4/move_extent.c|3 ++
 4 files changed, 36 insertions(+), 18 deletions(-)
;
and nothing of that looks like good candidates...
___
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev


Re: 2.6.35-rc2 : OOPS with LTP memcg regression test run.

2010-06-06 Thread Markus Trippelsdorf

 And few more of these. Previous snapshot release 
 2.6.35-rc1-git5(6c5de280b6...)
 was good.

That's very odd, since
; git diff --stat 6c5de280b6..v2.6.35-rc2 
 Makefile |2 +-
 drivers/gpu/drm/i915/intel_display.c |9 +++
 fs/ext4/inode.c  |   40 +++--
 fs/ext4/move_extent.c|3 ++
 4 files changed, 36 insertions(+), 18 deletions(-)

and nothing of that looks like good candidates...

I may have the same problem on my machine.
(See also the thread: ext4 2.6.35-rc2 regression (ext4: Make sure the MOVE_EXT 
ioctl...))

general protection fault:  [#1] SMP
last sysfs file: 
/sys/devices/pci:00/:00:11.0/host2/target2:0:0/2:0:0:0/block/sdb/size
CPU 2
Pid: 1683, comm: iptables-restor Not tainted 2.6.35-rc2-00033-gcc1f375 #46 
M4A78T-E/System Product Name
RIP: 0010:[810cc6e6]  [810cc6e6] kmem_cache_alloc+0x59/0xda
RSP: 0018:88011c993d78  EFLAGS: 00010002
RAX:  RBX: 0720072007200720 RCX: 810bd4c9
RDX: 7f076cee3000 RSI: 00d0 RDI: 88011fc01800
RBP: 88011c993db8 R08: 880001b13f48 R09: 
R10: 88011d387c00 R11: 88011c983930 R12: 88011fc01800
R13: 0202 R14: 00d0 R15: 00d0
FS:  7f076dc43700() GS:880001b0() knlGS:
CS:  0010 DS:  ES:  CR0: 8005003b
CR2: 7f8595d364f8 CR3: 00011b8b CR4: 06e0
DR0:  DR1:  DR2: 
DR3:  DR6: 0ff0 DR7: 0400
Process iptables-restor (pid: 1683, threadinfo 88011c992000, task 
88011ec09610)
Stack:
88011d387c10 88011c983930 88011c993d98 fffa
0 88011d387bd0 7f076cee3000 88011f77ea40 
0 88011c993e08 810bd4c9 88011b8f5cc0 810bd639
Call Trace:
[810bd4c9] __split_vma+0x33/0x18d
[810bd639] ? vma_merge+0x16/0x1fc
[810bdc01] split_vma+0x23/0x28
[810bf572] mprotect_fixup+0x146/0x54c
[810befff] ? do_mmap_pgoff+0x2a4/0x2fe
[810bfaf0] sys_mprotect+0x178/0x1f4
[8102b93b] system_call_fastpath+0x16/0x1b
Code: 65 4c 8b 04 25 88 d4 00 00 48 8b 07 49 01 c0 49 8b 18 48 85 db 75 10 83 
ca ff 44 89 f6 e8 58 fa ff ff 48 89 c3 eb 0b 48 63 47 18 48 8b 04 03 49 89 00 
41 55 9d 48 85 db 74 15 41 81 e6 00 80 00
RIP  [810cc6e6] kmem_cache_alloc+0x59/0xda
RSP 88011c993d78
---[ end trace e2fb1ccd3cb9dd77 ]---
-- 
Markus
___
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev