Re: [beagleboard] EHCI softirq kernel panic

2011-08-10 Thread Alan Stern
On Wed, 10 Aug 2011, Felipe Balbi wrote:

 Hi,
 
 On Tue, Aug 09, 2011 at 02:30:14PM -0400, Jason Kridner wrote:
  On Tue, Aug 9, 2011 at 1:51 PM, Joel A Fernandes agnel.j...@gmail.com 
  wrote:
   Anyone seen this before?
  
  A lot of the kernel developers don't frequent the beagleboard list.
  If you think it is a general kernel bug, I suspect you want to copy
  linux-omap.
 
 and linux-usb, and Alan Stern as he's the EHCI maintainer and myself for
 the OMAP USB part ;-)
 
   Trying to boot 3.0.0 with OE patches from an SD Card, and with a
   network cable connected results in the following traceback.
   Not connecting a network cable makes the errors go away.

   [ � 99.084899] Unable to handle kernel NULL pointer dereference at
   virtual address 
   [ � 99.093383] pgd = c0004000
   [ � 99.096191] [] *pgd=
   [ � 99.099945] Internal error: Oops: 17 [#2]
   [ � 99.104125] Modules linked in: ipv6
   [ � 99.107788] CPU: 0 � �Tainted: G � � �D � � �(3.0.0+ #1)
   [ � 99.113342] PC is at ehci_quiesce+0xc/0x94
   [ � 99.117614] LR is at ehci_stop+0x34/0x8c
   [ � 99.121734] pc : [c0325ce4] � �lr : [c0328bfc] � �psr: 21d3
   [ � 99.121734] sp : c064de70 �ip : 0108 �fp : c06b8624
   [ � 99.133728] r10: c064dec0 �r9 :  �r8 : dee08504
   [ � 99.139190] r7 : c0328b94 �r6 : 0100 �r5 : dee08504 �r4 : dee08608
   [ � 99.145996] r3 :  �r2 : dee086ec �r1 : dee086b8 �r0 : dee08608
   [ � 99.152832] Flags: nzCv �IRQs off �FIQs off �Mode SVC_32 �ISA ARM
   Segment kernel
   [ � 99.160644] Control: 10c5387d �Table: 9d804019 �DAC: 0015
   [ � 99.166656] Process swapper (pid: 0, stack limit = 0xc064c2f0)
   [ � 99.172760] Stack: (0xc064de70 to 0xc064e000)

   [ � 99.288482] [c0325ce4] (ehci_quiesce+0xc/0x94) from [c0328bfc]
   (ehci_stop+0x34/0x8c)
   [ � 99.296936] [c0328bfc] (ehci_stop+0x34/0x8c) from [c007a3d4]
   (run_timer_softirq+0x15c/0x1f8)
   [ � 99.306121] [c007a3d4] (run_timer_softirq+0x15c/0x1f8) from
   [c064dec0] (0xc064dec0)
   [ � 99.314483] Code: c05d7f9a e92d4073 e1a04000 e5903004 (e5933000)
   [ � 99.320892] ---[ end trace 4ae88755f08e391f ]---
   [ � 99.325714] Kernel panic - not syncing: Fatal exception in interrupt

I'm puzzled.  Why is ehci_stop getting called in a softirq context?  
That should never happen.  It should get called only when the driver is 
unbound from the controller.

Alan Stern

--
To unsubscribe from this list: send the line unsubscribe linux-omap in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [beagleboard] EHCI softirq kernel panic

2011-08-10 Thread Felipe Balbi
Hi,

On Wed, Aug 10, 2011 at 10:11:48AM -0400, Alan Stern wrote:
 On Wed, 10 Aug 2011, Felipe Balbi wrote:
 
  Hi,
  
  On Tue, Aug 09, 2011 at 02:30:14PM -0400, Jason Kridner wrote:
   On Tue, Aug 9, 2011 at 1:51 PM, Joel A Fernandes agnel.j...@gmail.com 
   wrote:
Anyone seen this before?
   
   A lot of the kernel developers don't frequent the beagleboard list.
   If you think it is a general kernel bug, I suspect you want to copy
   linux-omap.
  
  and linux-usb, and Alan Stern as he's the EHCI maintainer and myself for
  the OMAP USB part ;-)
  
Trying to boot 3.0.0 with OE patches from an SD Card, and with a
network cable connected results in the following traceback.
Not connecting a network cable makes the errors go away.
 
[ ? 99.084899] Unable to handle kernel NULL pointer dereference at
virtual address 
[ ? 99.093383] pgd = c0004000
[ ? 99.096191] [] *pgd=
[ ? 99.099945] Internal error: Oops: 17 [#2]
[ ? 99.104125] Modules linked in: ipv6
[ ? 99.107788] CPU: 0 ? ?Tainted: G ? ? ?D ? ? ?(3.0.0+ #1)
[ ? 99.113342] PC is at ehci_quiesce+0xc/0x94
[ ? 99.117614] LR is at ehci_stop+0x34/0x8c
[ ? 99.121734] pc : [c0325ce4] ? ?lr : [c0328bfc] ? ?psr: 21d3
[ ? 99.121734] sp : c064de70 ?ip : 0108 ?fp : c06b8624
[ ? 99.133728] r10: c064dec0 ?r9 :  ?r8 : dee08504
[ ? 99.139190] r7 : c0328b94 ?r6 : 0100 ?r5 : dee08504 ?r4 : 
dee08608
[ ? 99.145996] r3 :  ?r2 : dee086ec ?r1 : dee086b8 ?r0 : 
dee08608
[ ? 99.152832] Flags: nzCv ?IRQs off ?FIQs off ?Mode SVC_32 ?ISA ARM
Segment kernel
[ ? 99.160644] Control: 10c5387d ?Table: 9d804019 ?DAC: 0015
[ ? 99.166656] Process swapper (pid: 0, stack limit = 0xc064c2f0)
[ ? 99.172760] Stack: (0xc064de70 to 0xc064e000)
 
[ ? 99.288482] [c0325ce4] (ehci_quiesce+0xc/0x94) from [c0328bfc]
(ehci_stop+0x34/0x8c)
[ ? 99.296936] [c0328bfc] (ehci_stop+0x34/0x8c) from [c007a3d4]
(run_timer_softirq+0x15c/0x1f8)
[ ? 99.306121] [c007a3d4] (run_timer_softirq+0x15c/0x1f8) from
[c064dec0] (0xc064dec0)
[ ? 99.314483] Code: c05d7f9a e92d4073 e1a04000 e5903004 (e5933000)
[ ? 99.320892] ---[ end trace 4ae88755f08e391f ]---
[ ? 99.325714] Kernel panic - not syncing: Fatal exception in interrupt
 
 I'm puzzled.  Why is ehci_stop getting called in a softirq context?  
 That should never happen.  It should get called only when the driver is 
 unbound from the controller.

Maybe some OpenEmbedded patch which changed the behavior and ended up
breaking the driver ?

-- 
balbi


signature.asc
Description: Digital signature


Re: [beagleboard] EHCI softirq kernel panic

2011-08-10 Thread Joel A Fernandes
On Wed, Aug 10, 2011 at 4:35 PM, Felipe Balbi ba...@ti.com wrote:
 Hi,

 On Wed, Aug 10, 2011 at 10:11:48AM -0400, Alan Stern wrote:
 On Wed, 10 Aug 2011, Felipe Balbi wrote:

  Hi,
 
  On Tue, Aug 09, 2011 at 02:30:14PM -0400, Jason Kridner wrote:
   On Tue, Aug 9, 2011 at 1:51 PM, Joel A Fernandes agnel.j...@gmail.com 
   wrote:
Anyone seen this before?
  
   A lot of the kernel developers don't frequent the beagleboard list.
   If you think it is a general kernel bug, I suspect you want to copy
   linux-omap.
 
  and linux-usb, and Alan Stern as he's the EHCI maintainer and myself for
  the OMAP USB part ;-)
 
Trying to boot 3.0.0 with OE patches from an SD Card, and with a
network cable connected results in the following traceback.
Not connecting a network cable makes the errors go away.

[ ? 99.084899] Unable to handle kernel NULL pointer dereference at
virtual address 
[ ? 99.093383] pgd = c0004000
[ ? 99.096191] [] *pgd=
[ ? 99.099945] Internal error: Oops: 17 [#2]
[ ? 99.104125] Modules linked in: ipv6
[ ? 99.107788] CPU: 0 ? ?Tainted: G ? ? ?D ? ? ?(3.0.0+ #1)
[ ? 99.113342] PC is at ehci_quiesce+0xc/0x94
[ ? 99.117614] LR is at ehci_stop+0x34/0x8c
[ ? 99.121734] pc : [c0325ce4] ? ?lr : [c0328bfc] ? ?psr: 21d3
[ ? 99.121734] sp : c064de70 ?ip : 0108 ?fp : c06b8624
[ ? 99.133728] r10: c064dec0 ?r9 :  ?r8 : dee08504
[ ? 99.139190] r7 : c0328b94 ?r6 : 0100 ?r5 : dee08504 ?r4 : 
dee08608
[ ? 99.145996] r3 :  ?r2 : dee086ec ?r1 : dee086b8 ?r0 : 
dee08608
[ ? 99.152832] Flags: nzCv ?IRQs off ?FIQs off ?Mode SVC_32 ?ISA ARM
Segment kernel
[ ? 99.160644] Control: 10c5387d ?Table: 9d804019 ?DAC: 0015
[ ? 99.166656] Process swapper (pid: 0, stack limit = 0xc064c2f0)
[ ? 99.172760] Stack: (0xc064de70 to 0xc064e000)

[ ? 99.288482] [c0325ce4] (ehci_quiesce+0xc/0x94) from [c0328bfc]
(ehci_stop+0x34/0x8c)
[ ? 99.296936] [c0328bfc] (ehci_stop+0x34/0x8c) from [c007a3d4]
(run_timer_softirq+0x15c/0x1f8)
[ ? 99.306121] [c007a3d4] (run_timer_softirq+0x15c/0x1f8) from
[c064dec0] (0xc064dec0)
[ ? 99.314483] Code: c05d7f9a e92d4073 e1a04000 e5903004 (e5933000)
[ ? 99.320892] ---[ end trace 4ae88755f08e391f ]---
[ ? 99.325714] Kernel panic - not syncing: Fatal exception in interrupt

 I'm puzzled.  Why is ehci_stop getting called in a softirq context?
 That should never happen.  It should get called only when the driver is
 unbound from the controller.

 Maybe some OpenEmbedded patch which changed the behavior and ended up
 breaking the driver ?


Hi Felipe,

Thanks for looking into this.

It could be the 1GHz OPP patch in OE, after reverting it [1] I haven't
seen this issue since.

Thanks,
Joel

[1] 
https://github.com/joelagnel/meta-texasinstruments/commit/95fc251b1aeafc1ef774659a8e8654e11b620778
--
To unsubscribe from this list: send the line unsubscribe linux-omap in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [beagleboard] EHCI softirq kernel panic

2011-08-09 Thread Jason Kridner
On Tue, Aug 9, 2011 at 1:51 PM, Joel A Fernandes agnel.j...@gmail.com wrote:
 Anyone seen this before?

A lot of the kernel developers don't frequent the beagleboard list.
If you think it is a general kernel bug, I suspect you want to copy
linux-omap.


 Trying to boot 3.0.0 with OE patches from an SD Card, and with a
 network cable connected results in the following traceback.
 Not connecting a network cable makes the errors go away.

 [   83.386779] Process gtk-update-icon (pid: 351, stack limit = 0xdd95c2f0)
 [   83.393798] Stack: (0xdd95dfb0 to 0xdd95e000)
 [   83.398345] dfa0:                                     007f9225
 007ba664 00bf 2fff
 [   83.406890] dfc0: 00ff2f2f 00ff 00ff 00ff 00ff
 00ff  00799225
 [   83.415435] dfe0: 00ff be9f8780 00ff 4027e084 2010
   
 [   83.423980] Code: bad PC value
 [   83.427490] ---[ end trace 4ae88755f08e391e ]---
 [   83.434570] S98configure[58]: Segmentation fault
 [   83.840148] S98configure[58]: Segmentation fault
 [   83.867004] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/gnome/.icon-theme.cache : File exists
 [   89.605407] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/HighContrastLargePrint/.icon-theme.cache : File
 exists
 [   89.932525] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/Mist/.icon-theme.cache : File exists
 [   89.967773] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/gnome/.icon-theme.cache : File exists
 [   91.477386] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/HighContrastLargePrint/.icon-theme.cache : File
 exists
 [   91.788269] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/Mist/.icon-theme.cache : File exists
 [   91.822631] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/gnome/.icon-theme.cache : File exists
 [   94.313598] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/HighContrastLargePrint/.icon-theme.cache : File
 exists
 [   95.381011] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/Mist/.icon-theme.cache : File exists
 [   95.415405] S98configure[58]: gtk-update-icon-cache: Failed to open
 file /usr/share/icons/gnome/.icon-theme.cache : File exists
 [   99.084899] Unable to handle kernel NULL pointer dereference at
 virtual address 
 [   99.093383] pgd = c0004000
 [   99.096191] [] *pgd=
 [   99.099945] Internal error: Oops: 17 [#2]
 [   99.104125] Modules linked in: ipv6
 [   99.107788] CPU: 0    Tainted: G      D      (3.0.0+ #1)
 [   99.113342] PC is at ehci_quiesce+0xc/0x94
 [   99.117614] LR is at ehci_stop+0x34/0x8c
 [   99.121734] pc : [c0325ce4]    lr : [c0328bfc]    psr: 21d3
 [   99.121734] sp : c064de70  ip : 0108  fp : c06b8624
 [   99.133728] r10: c064dec0  r9 :   r8 : dee08504
 [   99.139190] r7 : c0328b94  r6 : 0100  r5 : dee08504  r4 : dee08608
 [   99.145996] r3 :   r2 : dee086ec  r1 : dee086b8  r0 : dee08608
 [   99.152832] Flags: nzCv  IRQs off  FIQs off  Mode SVC_32  ISA ARM
 Segment kernel
 [   99.160644] Control: 10c5387d  Table: 9d804019  DAC: 0015
 [   99.166656] Process swapper (pid: 0, stack limit = 0xc064c2f0)
 [   99.172760] Stack: (0xc064de70 to 0xc064e000)
 [   99.177307] de60:                                     dee08608
 dee086b8 dee08608 dee08504
 [   99.185852] de80: 0100 c0328bfc 0001 a153 dee08504
 c007a3d4 00200200 6153
 [   99.194396] dea0: dee08504 c007a3d4 00200200 c06b87a0 c064c000
 c007a3d4 c064c000 0003
 [   99.202941] dec0: c064dec0 c064dec0 000a 0001 c06b8628
 c064c000 0100 c06b8600
 [   99.211517] dee0: 000a c0075ec0 a246 c0677bdc 
 0001 0003 0025
 [   99.220062] df00:  c0654258 0003 0003 413fc082
   c0076290
 [   99.228607] df20: 0025 c0040064 6053  fa20
 c0044ff8 00051f4d 
 [   99.237152] df40: 00051f4d  c06a662c c0654258 c0654258
 0003 0003 413fc082
 [   99.245697] df60:    c064df80 c0054dcc
 c0054dd8 6053 
 [   99.254272] df80:  00051f4d 0063 04aae30b 0063
 04a5c3be 0001 
 [   99.262817] dfa0: c0654248 c0654258 c06d7a6c c0377404 c064c000
 c0652254 c06a5d04 c065224c
 [   99.271362] dfc0: 80004059 c0045f40 c064e9b4 c003341c c0ae1140
 c0008868 c00082c8 060a
 [   99.279907] dfe0: 8100 c003341c 10c53c7d c064e060 c0033418
 8000803c  
 [   99.288482] [c0325ce4] (ehci_quiesce+0xc/0x94) from [c0328bfc]
 (ehci_stop+0x34/0x8c)
 [   99.296936] [c0328bfc] (ehci_stop+0x34/0x8c) from [c007a3d4]
 (run_timer_softirq+0x15c/0x1f8)
 [   99.306121] [c007a3d4] (run_timer_softirq+0x15c/0x1f8) from
 [c064dec0] (0xc064dec0)
 [   99.314483] Code: c05d7f9a e92d4073 

Re: [beagleboard] EHCI softirq kernel panic

2011-08-09 Thread Felipe Balbi
Hi,

On Tue, Aug 09, 2011 at 02:30:14PM -0400, Jason Kridner wrote:
 On Tue, Aug 9, 2011 at 1:51 PM, Joel A Fernandes agnel.j...@gmail.com wrote:
  Anyone seen this before?
 
 A lot of the kernel developers don't frequent the beagleboard list.
 If you think it is a general kernel bug, I suspect you want to copy
 linux-omap.

and linux-usb, and Alan Stern as he's the EHCI maintainer and myself for
the OMAP USB part ;-)

  Trying to boot 3.0.0 with OE patches from an SD Card, and with a
  network cable connected results in the following traceback.
  Not connecting a network cable makes the errors go away.
 
  [   83.386779] Process gtk-update-icon (pid: 351, stack limit = 0xdd95c2f0)
  [   83.393798] Stack: (0xdd95dfb0 to 0xdd95e000)
  [   83.398345] dfa0:                                     007f9225
  007ba664 00bf 2fff
  [   83.406890] dfc0: 00ff2f2f 00ff 00ff 00ff 00ff
  00ff  00799225
  [   83.415435] dfe0: 00ff be9f8780 00ff 4027e084 2010
    
  [   83.423980] Code: bad PC value
  [   83.427490] ---[ end trace 4ae88755f08e391e ]---
  [   83.434570] S98configure[58]: Segmentation fault
  [   83.840148] S98configure[58]: Segmentation fault
  [   83.867004] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/gnome/.icon-theme.cache : File exists
  [   89.605407] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/HighContrastLargePrint/.icon-theme.cache : File
  exists
  [   89.932525] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/Mist/.icon-theme.cache : File exists
  [   89.967773] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/gnome/.icon-theme.cache : File exists
  [   91.477386] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/HighContrastLargePrint/.icon-theme.cache : File
  exists
  [   91.788269] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/Mist/.icon-theme.cache : File exists
  [   91.822631] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/gnome/.icon-theme.cache : File exists
  [   94.313598] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/HighContrastLargePrint/.icon-theme.cache : File
  exists
  [   95.381011] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/Mist/.icon-theme.cache : File exists
  [   95.415405] S98configure[58]: gtk-update-icon-cache: Failed to open
  file /usr/share/icons/gnome/.icon-theme.cache : File exists
  [   99.084899] Unable to handle kernel NULL pointer dereference at
  virtual address 
  [   99.093383] pgd = c0004000
  [   99.096191] [] *pgd=
  [   99.099945] Internal error: Oops: 17 [#2]
  [   99.104125] Modules linked in: ipv6
  [   99.107788] CPU: 0    Tainted: G      D      (3.0.0+ #1)
  [   99.113342] PC is at ehci_quiesce+0xc/0x94
  [   99.117614] LR is at ehci_stop+0x34/0x8c
  [   99.121734] pc : [c0325ce4]    lr : [c0328bfc]    psr: 21d3
  [   99.121734] sp : c064de70  ip : 0108  fp : c06b8624
  [   99.133728] r10: c064dec0  r9 :   r8 : dee08504
  [   99.139190] r7 : c0328b94  r6 : 0100  r5 : dee08504  r4 : dee08608
  [   99.145996] r3 :   r2 : dee086ec  r1 : dee086b8  r0 : dee08608
  [   99.152832] Flags: nzCv  IRQs off  FIQs off  Mode SVC_32  ISA ARM
  Segment kernel
  [   99.160644] Control: 10c5387d  Table: 9d804019  DAC: 0015
  [   99.166656] Process swapper (pid: 0, stack limit = 0xc064c2f0)
  [   99.172760] Stack: (0xc064de70 to 0xc064e000)
  [   99.177307] de60:                                     dee08608
  dee086b8 dee08608 dee08504
  [   99.185852] de80: 0100 c0328bfc 0001 a153 dee08504
  c007a3d4 00200200 6153
  [   99.194396] dea0: dee08504 c007a3d4 00200200 c06b87a0 c064c000
  c007a3d4 c064c000 0003
  [   99.202941] dec0: c064dec0 c064dec0 000a 0001 c06b8628
  c064c000 0100 c06b8600
  [   99.211517] dee0: 000a c0075ec0 a246 c0677bdc 
  0001 0003 0025
  [   99.220062] df00:  c0654258 0003 0003 413fc082
    c0076290
  [   99.228607] df20: 0025 c0040064 6053  fa20
  c0044ff8 00051f4d 
  [   99.237152] df40: 00051f4d  c06a662c c0654258 c0654258
  0003 0003 413fc082
  [   99.245697] df60:    c064df80 c0054dcc
  c0054dd8 6053 
  [   99.254272] df80:  00051f4d 0063 04aae30b 0063
  04a5c3be 0001 
  [   99.262817] dfa0: c0654248 c0654258 c06d7a6c c0377404 c064c000
  c0652254 c06a5d04 c065224c
  [   99.271362] dfc0: 80004059 c0045f40 c064e9b4 c003341c c0ae1140
  c0008868 c00082c8 060a
  [   99.279907] dfe0: 8100 c003341c 10c53c7d c064e060 c0033418
  8000803c  
  [   99.288482] [c0325ce4] (ehci_quiesce+0xc/0x94) from [c0328bfc]