On Wed, 2011-06-01 at 21:27 -0700, Christian Kujau wrote: > On Thu, 2 Jun 2011 at 12:57, Benjamin Herrenschmidt wrote: > > Ok, thanks a lot, It looks rather trivial actually: That new workaround > > is PCIe specific but is called unconditionally, and will do bad things > > non-PCIe implementations. > > OK, with your patch applied to Linus' latest git tree the machine > continues to boot. Also, with the latest tree, the "machine is stuck after > ide-cd init" problem[0] went away. > > For this particular problem and patch, feel free to add: > > Tested-by: Christian Kujau <li...@nerdbynature.de> > > However, shortly after boot and loggin in to the box remotely, the bux did > not respond any more. I'm not sure if these are related to those SSB/PCIe > changes, but somehow I hope they are - bisecting those would take much > longer, as it's not an "instant" death: > > * http://nerdbynature.de/bits/3.0-rc1/linux-3.0-rc1_stuck1.jpg > * http://nerdbynature.de/bits/3.0-rc1/linux-3.0-rc1_stuck2.jpg > > This is what an OCR program made of it:
I think this is another problem that I'm in the middle of trying to figure out. It -looks- to me that something goes wrong in the tty code when a large file is piped through a pty, causing the kernel to hang for minutes in the workqueue / ldisk flush code. I've just sent an initial report to Alan Cox about it and am currently bisecting it. Cheers, Ben. > irq euent stamp: 185804850 > hardirqs last enabled at (185904849): [<c04005b0>] > _raw_spin_unlock_irqrestore+0x40/0x?e > hardirqs last disabled at (185904850): [<c00120b8>] reenable_mmu+0x24/0x78 > Softirqs last enabled at (185892414): [<c000fe8c>] call_do_softirq+0x14/0x24 > softirqs last disabled at (18589240?): [<c000fe8c>] call_do_softirq+0x14/0x24 > NIP: e04005b4 LR: e04005b0 CTR: 00000000 > REGS: ef92be10 TRHP: 0901 Not tainted (3.0.0-rel-00049-g1fa?b6a-dirtg) > MSB: 00009032 <EE.ME.IR.DR> CR: 42002084 > TRSK = ef8d0000[38B] ’kuorker/0:2’ THREAD: > GPR00: c04005b0 ef92bec0 efBd0000 00000001 > GPR08: 00000000 0b14aed0 0049a306 00030600 > HIP [c01005b1] _rau_spin_unlock_irqrestore+0x44/0x?c > LR [c04005b0] _rau_spin_unlock_irqrestore+0x40/0x?c > Call Trace: > [ef92bec0] [c04005b0] _raw_spin_unlock_irqrestore+0x40/0x?c (unreliable) > [ef92bed0] [c029c504] flush_tu_ldisc+0x121/0x230 > [ef92bf10] [c001c86c] process_one_uork+0x1c1/0x4cB > [ef92bfS0] [c004efac] worker_thread+0x1?8/0x3c1 > [ef92bf90] [c0051148] kthread+0x81/0x88 > [ef92hff0] [c0810390] kernel_thread+0x1c/0x68 > > XER: 20000000 > ef92a000 ef8d0660 00000006 00000000 18614000 22002088 > Instruction dump: > ??? 93e1060c ?c9f23?B 38800001 90010011 4bc6e9a9 ?fc3i`3?8 4be61a69 > ?3e08080 11820021 1bc6b515 ?fe00124 > B8c16008 ?c0803a6 83c1000c > > Well, the picture is way better :-\ > > Thanks, > Christian. > > [0] http://nerdbynature.de/bits/3.0-rc1/linux-3.0-rc1-cdrom.jpg _______________________________________________ Linuxppc-dev mailing list Linuxppc-dev@lists.ozlabs.org https://lists.ozlabs.org/listinfo/linuxppc-dev