Re: [BUG] 2.6.25-rc5-mm1 kernel panic with Exception: 501 on powerpc

2008-03-13 Thread Kamalesh Babulal
Matthew Wilcox wrote:
 On Thu, Mar 13, 2008 at 09:26:09AM +1100, Michael Ellerman wrote:
 down() looks OK, but there's still a spin_lock_irq() in __down_common(),
 although I don't know if it makes sense for us to be in __down() at that
 stage.
 
 The spin_lock_irq in __down_common is correct.  We're going to schedule(),
 so we spin_unlock_irq() to save us passing the flags into the helper
 function.  If we had interrupts disabled on entry, there's an Aieee
 for that.
 

Hi All,

Sorry for all the noise made :-(, something wrong in the test setup from my 
end, 
the kernel was 2.6.25-rc3-mm1 not 2.6.25-rc5-mm1. This bug is not seen in the
2.6.25-rc5-mm1 kernel.

-- 
Thanks  Regards,
Kamalesh Babulal,
___
Linuxppc-dev mailing list
Linuxppc-dev@ozlabs.org
https://ozlabs.org/mailman/listinfo/linuxppc-dev


Re: [BUG] 2.6.25-rc5-mm1 kernel panic with Exception: 501 on powerpc

2008-03-12 Thread Andrew Morton
On Wed, 12 Mar 2008 18:25:37 +0530 Kamalesh Babulal [EMAIL PROTECTED] wrote:

 Hi Andrew,
 
 The 2.6.25-rc5-mm1 kernel panics while bootup on powerpc
 
 returning from prom_init
 Unable to handle kernel paging request for data at address 0x
 Faulting instruction address: 0xc000d5dc
 cpu 0x0: Vector: 300 (Data Access) at [c07636e0]
 pc: c000d5dc: .do_IRQ+0x74/0x1f4
 lr: c000d5a8: .do_IRQ+0x40/0x1f4
 sp: c0763960
msr: 80001032
dar: 0
  dsisr: 4000
   current = 0xc0688e60
   paca= 0xc0689900
 pid   = 0, comm = swapper
 enter ? for help
 [c0763a00] c0004c24 hardware_interrupt_entry+0x24/0x28
 --- Exception: 501 (Hardware Interrupt) at c06021b0 
 .free_bootmem_core+0x94/0xcc
 [link register   ] c060373c 
 .free_bootmem_with_active_regions+0x78/0xb8
 [c0763cf0] c0602610 .init_bootmem_core+0x5c/0xfc (unreliable)
 [c0763d80] c05eb68c .do_init_bootmem+0x964/0xaf0
 [c0763e50] c05e03b0 .setup_arch+0x1a4/0x218
 [c0763ee0] c05d76bc .start_kernel+0xe8/0x424
 [c0763f90] c0008590 .start_here_common+0x60/0xd0
 

Beats me.  Maybe we're still enabling interrupts too early.  But the new
semaphore code got fixed (didn't it?)

___
Linuxppc-dev mailing list
Linuxppc-dev@ozlabs.org
https://ozlabs.org/mailman/listinfo/linuxppc-dev


Re: [BUG] 2.6.25-rc5-mm1 kernel panic with Exception: 501 on powerpc

2008-03-12 Thread Matthew Wilcox
On Wed, Mar 12, 2008 at 10:46:45AM -0700, Andrew Morton wrote:
 On Wed, 12 Mar 2008 18:25:37 +0530 Kamalesh Babulal [EMAIL PROTECTED] wrote:
 Beats me.  Maybe we're still enabling interrupts too early.  But the new
 semaphore code got fixed (didn't it?)

On the 7th, according to my records.  Easy to check -- look in
kernel/semaphore.c and see whether down() is using spin_lock_irqsave
(good) or spin_lock_irq (bad).

-- 
Intel are signing my paycheques ... these opinions are still mine
Bill, look, we understand that you're interested in selling us this
operating system, but compare it to ours.  We can't possibly take such
a retrograde step.
___
Linuxppc-dev mailing list
Linuxppc-dev@ozlabs.org
https://ozlabs.org/mailman/listinfo/linuxppc-dev


Re: [BUG] 2.6.25-rc5-mm1 kernel panic with Exception: 501 on powerpc

2008-03-12 Thread Badari Pulavarty
On Wed, 2008-03-12 at 18:25 +0530, Kamalesh Babulal wrote:
 Hi Andrew,
 
 The 2.6.25-rc5-mm1 kernel panics while bootup on powerpc
 
 returning from prom_init
 Unable to handle kernel paging request for data at address 0x
 Faulting instruction address: 0xc000d5dc
 cpu 0x0: Vector: 300 (Data Access) at [c07636e0]
 pc: c000d5dc: .do_IRQ+0x74/0x1f4
 lr: c000d5a8: .do_IRQ+0x40/0x1f4
 sp: c0763960
msr: 80001032
dar: 0
  dsisr: 4000
   current = 0xc0688e60
   paca= 0xc0689900
 pid   = 0, comm = swapper
 enter ? for help
 [c0763a00] c0004c24 hardware_interrupt_entry+0x24/0x28
 --- Exception: 501 (Hardware Interrupt) at c06021b0 
 .free_bootmem_core+0x94/0xcc
 [link register   ] c060373c 
 .free_bootmem_with_active_regions+0x78/0xb8
 [c0763cf0] c0602610 .init_bootmem_core+0x5c/0xfc (unreliable)
 [c0763d80] c05eb68c .do_init_bootmem+0x964/0xaf0
 [c0763e50] c05e03b0 .setup_arch+0x1a4/0x218
 [c0763ee0] c05d76bc .start_kernel+0xe8/0x424
 [c0763f90] c0008590 .start_here_common+0x60/0xd0

Is this only on one machine ? happens all the time ?

I ran into similar issues on rc3-mm1. rc5-mm1 seems to be working fine
for me on ppc64.

Thanks,
Badari

___
Linuxppc-dev mailing list
Linuxppc-dev@ozlabs.org
https://ozlabs.org/mailman/listinfo/linuxppc-dev


Re: [BUG] 2.6.25-rc5-mm1 kernel panic with Exception: 501 on powerpc

2008-03-12 Thread Benjamin Herrenschmidt

 Beats me.  Maybe we're still enabling interrupts too early.  But the new
 semaphore code got fixed (didn't it?)

Won't lockdep/irqtrace warn if that happens ? You don't yet have the
lockdep patches for ppc64 (I'm still trying to find out why they break
iSeries) but it should warn of such a spurrious IRQ enable on other
archs too... At least, from a quick look at the code, it -seems- that it
does have such a test.

Cheers,
Ben.


___
Linuxppc-dev mailing list
Linuxppc-dev@ozlabs.org
https://ozlabs.org/mailman/listinfo/linuxppc-dev


Re: [BUG] 2.6.25-rc5-mm1 kernel panic with Exception: 501 on powerpc

2008-03-12 Thread Michael Ellerman
On Wed, 2008-03-12 at 11:51 -0600, Matthew Wilcox wrote:
 On Wed, Mar 12, 2008 at 10:46:45AM -0700, Andrew Morton wrote:
  On Wed, 12 Mar 2008 18:25:37 +0530 Kamalesh Babulal [EMAIL PROTECTED] 
  wrote:
  Beats me.  Maybe we're still enabling interrupts too early.  But the new
  semaphore code got fixed (didn't it?)
 
 On the 7th, according to my records.  Easy to check -- look in
 kernel/semaphore.c and see whether down() is using spin_lock_irqsave
 (good) or spin_lock_irq (bad).

down() looks OK, but there's still a spin_lock_irq() in __down_common(),
although I don't know if it makes sense for us to be in __down() at that
stage.

cheers

-- 
Michael Ellerman
OzLabs, IBM Australia Development Lab

wwweb: http://michael.ellerman.id.au
phone: +61 2 6212 1183 (tie line 70 21183)

We do not inherit the earth from our ancestors,
we borrow it from our children. - S.M.A.R.T Person


signature.asc
Description: This is a digitally signed message part
___
Linuxppc-dev mailing list
Linuxppc-dev@ozlabs.org
https://ozlabs.org/mailman/listinfo/linuxppc-dev