Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-13 Thread Borislav Petkov
On Mon, Feb 13, 2017 at 02:26:20AM +0100, Gabriel C wrote: > I didn't tested your patch yet but did a boot with mce=off and nomce > which seems to not really works since is still want to mc_device_add() > even when off. mc_device_add() is microcode loader's ->add_dev() subsys pointer and that's

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-13 Thread Borislav Petkov
On Mon, Feb 13, 2017 at 02:26:20AM +0100, Gabriel C wrote: > I didn't tested your patch yet but did a boot with mce=off and nomce > which seems to not really works since is still want to mc_device_add() > even when off. mc_device_add() is microcode loader's ->add_dev() subsys pointer and that's

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-13 Thread Thomas Gleixner
On Mon, 13 Feb 2017, Mike Galbraith wrote: > kernel/time/tick-broadcast.c |5 +++-- > 1 file changed, 3 insertions(+), 2 deletions(-) > > --- a/kernel/time/tick-broadcast.c > +++ b/kernel/time/tick-broadcast.c > @@ -357,6 +357,7 @@ void tick_broadcast_control(enum tick_br > struct

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-13 Thread Thomas Gleixner
On Mon, 13 Feb 2017, Mike Galbraith wrote: > kernel/time/tick-broadcast.c |5 +++-- > 1 file changed, 3 insertions(+), 2 deletions(-) > > --- a/kernel/time/tick-broadcast.c > +++ b/kernel/time/tick-broadcast.c > @@ -357,6 +357,7 @@ void tick_broadcast_control(enum tick_br > struct

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Mike Galbraith
On Mon, 2017-02-13 at 02:26 +0100, Gabriel C wrote: > [5.276704]CPU0 > [5.312400] > [5.347605] lock(tick_broadcast_lock); > [5.383163] > [5.418457] lock(tick_broadcast_lock); > [5.454015] > *** DEADLOCK *** > > [5.557982]

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Mike Galbraith
On Mon, 2017-02-13 at 02:26 +0100, Gabriel C wrote: > [5.276704]CPU0 > [5.312400] > [5.347605] lock(tick_broadcast_lock); > [5.383163] > [5.418457] lock(tick_broadcast_lock); > [5.454015] > *** DEADLOCK *** > > [5.557982]

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Gabriel C
On 13.02.2017 01:38, Borislav Petkov wrote: On Sun, Feb 12, 2017 at 11:21:13PM +0100, Gabriel C wrote: http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash_initcall_debug.mp4 http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash_intcall_debug_ucode_off.mp4 Thanks and

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Gabriel C
On 13.02.2017 01:38, Borislav Petkov wrote: On Sun, Feb 12, 2017 at 11:21:13PM +0100, Gabriel C wrote: http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash_initcall_debug.mp4 http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash_intcall_debug_ucode_off.mp4 Thanks and

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Borislav Petkov
On Sun, Feb 12, 2017 at 11:21:13PM +0100, Gabriel C wrote: > http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash_initcall_debug.mp4 > http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash_intcall_debug_ucode_off.mp4 Thanks and interesting. In both cases, mcheck_init_device()

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Borislav Petkov
On Sun, Feb 12, 2017 at 11:21:13PM +0100, Gabriel C wrote: > http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash_initcall_debug.mp4 > http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash_intcall_debug_ucode_off.mp4 Thanks and interesting. In both cases, mcheck_init_device()

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Gabriel C
On 12.02.2017 22:12, Borislav Petkov wrote: On Sun, Feb 12, 2017 at 09:21:53PM +0100, Gabriel C wrote: There is what I get : http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash2.mp4 Ok, I'm watching it frame-by-frame. I can see the microcode getting updated to revision 0x19 as

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Gabriel C
On 12.02.2017 22:12, Borislav Petkov wrote: On Sun, Feb 12, 2017 at 09:21:53PM +0100, Gabriel C wrote: There is what I get : http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash2.mp4 Ok, I'm watching it frame-by-frame. I can see the microcode getting updated to revision 0x19 as

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Borislav Petkov
On Sun, Feb 12, 2017 at 09:21:53PM +0100, Gabriel C wrote: > There is what I get : > > http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash2.mp4 Ok, I'm watching it frame-by-frame. I can see the microcode getting updated to revision 0x19 as in your working dmesg. The machine hangs

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Borislav Petkov
On Sun, Feb 12, 2017 at 09:21:53PM +0100, Gabriel C wrote: > There is what I get : > > http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/crash2.mp4 Ok, I'm watching it frame-by-frame. I can see the microcode getting updated to revision 0x19 as in your working dmesg. The machine hangs

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Gabriel C
On 11.02.2017 22:32, Borislav Petkov wrote: On Sat, Feb 11, 2017 at 09:58:26PM +0100, Gabriel C wrote: Yes , it will hang before tsc message .. Also sometimes I have same trace sometimes it just hangs forever. It doesn't sound like dis_ucode_ldr changes anything. Or maybe it does, maybe the

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-12 Thread Gabriel C
On 11.02.2017 22:32, Borislav Petkov wrote: On Sat, Feb 11, 2017 at 09:58:26PM +0100, Gabriel C wrote: Yes , it will hang before tsc message .. Also sometimes I have same trace sometimes it just hangs forever. It doesn't sound like dis_ucode_ldr changes anything. Or maybe it does, maybe the

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Borislav Petkov
On Sat, Feb 11, 2017 at 09:58:26PM +0100, Gabriel C wrote: > Yes , it will hang before tsc message .. > Also sometimes I have same trace sometimes it just hangs forever. It doesn't sound like dis_ucode_ldr changes anything. Or maybe it does, maybe the microcode applies some fix for some erratum

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Borislav Petkov
On Sat, Feb 11, 2017 at 09:58:26PM +0100, Gabriel C wrote: > Yes , it will hang before tsc message .. > Also sometimes I have same trace sometimes it just hangs forever. It doesn't sound like dis_ucode_ldr changes anything. Or maybe it does, maybe the microcode applies some fix for some erratum

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Gabriel C
On 11.02.2017 15:21, Borislav Petkov wrote: On Sat, Feb 11, 2017 at 02:09:14PM +0100, Gabriel C wrote: Adding ' dis_ucode_ldr ' to commandline makes the kernel hangs right after : Wait a minute, are you saying that without dis_ucode_ldr you can't even boot so far? Yes , it will hang

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Gabriel C
On 11.02.2017 15:21, Borislav Petkov wrote: On Sat, Feb 11, 2017 at 02:09:14PM +0100, Gabriel C wrote: Adding ' dis_ucode_ldr ' to commandline makes the kernel hangs right after : Wait a minute, are you saying that without dis_ucode_ldr you can't even boot so far? Yes , it will hang

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Borislav Petkov
On Sat, Feb 11, 2017 at 02:09:14PM +0100, Gabriel C wrote: > Adding ' dis_ucode_ldr ' to commandline makes the kernel hangs right after : Wait a minute, are you saying that without dis_ucode_ldr you can't even boot so far? > clocksource: tsc: mask: 0x max_cycles: 0x20ac7f6ecc6,

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Borislav Petkov
On Sat, Feb 11, 2017 at 02:09:14PM +0100, Gabriel C wrote: > Adding ' dis_ucode_ldr ' to commandline makes the kernel hangs right after : Wait a minute, are you saying that without dis_ucode_ldr you can't even boot so far? > clocksource: tsc: mask: 0x max_cycles: 0x20ac7f6ecc6,

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Gabriel C
On 11.02.2017 09:26, Thomas Gleixner wrote: You might try with 'earlyprintk' on the command line. That should tell more. With that I have some more output.. and after lots more boots I found out there are really at least 2 bugs triggered by this in 4.10. When just boothing with

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Gabriel C
On 11.02.2017 09:26, Thomas Gleixner wrote: You might try with 'earlyprintk' on the command line. That should tell more. With that I have some more output.. and after lots more boots I found out there are really at least 2 bugs triggered by this in 4.10. When just boothing with

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Thomas Gleixner
On Sat, 11 Feb 2017, Gabriel C wrote: > On 07.02.2017 22:25, Thomas Gleixner wrote: > Hi Thomas , > > Sorry I was travelling.. Nothing to be sorry about. > > Btw, how far in the boot process is the machine when this happens? > > Right after : > > Uncompressing Linux. > Booting the

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-11 Thread Thomas Gleixner
On Sat, 11 Feb 2017, Gabriel C wrote: > On 07.02.2017 22:25, Thomas Gleixner wrote: > Hi Thomas , > > Sorry I was travelling.. Nothing to be sorry about. > > Btw, how far in the boot process is the machine when this happens? > > Right after : > > Uncompressing Linux. > Booting the

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-10 Thread Gabriel C
On 11.02.2017 00:17, Gabriel C wrote: Btw, how far in the boot process is the machine when this happens? Right after : Uncompressing Linux. Booting the kernel.. So early.. After lots more boots .. I found out sometimes it gets to : .. [4.656826] Key type dns_resolver

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-10 Thread Gabriel C
On 11.02.2017 00:17, Gabriel C wrote: Btw, how far in the boot process is the machine when this happens? Right after : Uncompressing Linux. Booting the kernel.. So early.. After lots more boots .. I found out sometimes it gets to : .. [4.656826] Key type dns_resolver

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-10 Thread Gabriel C
On 07.02.2017 22:25, Thomas Gleixner wrote: On Tue, 7 Feb 2017, Thomas Gleixner wrote: Hi Thomas , Sorry I was travelling.. Gabriel, can you please send me the bootlog from a working kernel? http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/dmesg ( If you wish I can send you

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-10 Thread Gabriel C
On 07.02.2017 22:25, Thomas Gleixner wrote: On Tue, 7 Feb 2017, Thomas Gleixner wrote: Hi Thomas , Sorry I was travelling.. Gabriel, can you please send me the bootlog from a working kernel? http://ftp.frugalware.org/pub/other/people/crazy/kernel/t/dmesg ( If you wish I can send you

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-07 Thread Thomas Gleixner
On Mon, 6 Feb 2017, Linus Torvalds wrote: > That said, it also strikes me that the implicated > irq_chip_retrigger_hierarchy() function looks really very suspicious > indeed. > > Most of the other users don't seem to traverse the parent all the way > until they find something. They just do the

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-07 Thread Thomas Gleixner
On Mon, 6 Feb 2017, Linus Torvalds wrote: > That said, it also strikes me that the implicated > irq_chip_retrigger_hierarchy() function looks really very suspicious > indeed. > > Most of the other users don't seem to traverse the parent all the way > until they find something. They just do the

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-07 Thread Thomas Gleixner
On Tue, 7 Feb 2017, Thomas Gleixner wrote: > Gabriel, can you please send me the bootlog from a working kernel? Plus content of /proc/interrupts. Btw, how far in the boot process is the machine when this happens? Thanks, tglx

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-07 Thread Thomas Gleixner
On Tue, 7 Feb 2017, Thomas Gleixner wrote: > Gabriel, can you please send me the bootlog from a working kernel? Plus content of /proc/interrupts. Btw, how far in the boot process is the machine when this happens? Thanks, tglx

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-07 Thread Thomas Gleixner
On Mon, 6 Feb 2017, Linus Torvalds wrote: > But for now, I'd be inclined to just revert it unless somebody has a > "Duh!" moment and can tell me what's wrong with that commit with an > obvious fix. I have no "Duh!" moment even after staring at the code for quite a while. Gabriel, can you please

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-07 Thread Thomas Gleixner
On Mon, 6 Feb 2017, Linus Torvalds wrote: > But for now, I'd be inclined to just revert it unless somebody has a > "Duh!" moment and can tell me what's wrong with that commit with an > obvious fix. I have no "Duh!" moment even after staring at the code for quite a while. Gabriel, can you please

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Linus Torvalds
On Mon, Feb 6, 2017 at 9:30 AM, Gabriel C wrote: > > Somewhat late , however I didn't tested 4.9.6 but jumped from 4.9.5 to 4.9.7 > and found out by box won't boot anymore. > > It hangs early and freeze with a lot RCU warnings. > Since I cannot setup a netconsole right now I

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Linus Torvalds
On Mon, Feb 6, 2017 at 9:30 AM, Gabriel C wrote: > > Somewhat late , however I didn't tested 4.9.6 but jumped from 4.9.5 to 4.9.7 > and found out by box won't boot anymore. > > It hangs early and freeze with a lot RCU warnings. > Since I cannot setup a netconsole right now I cannot post the

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Gabriel C
On 06.02.2017 20:05, Ruslan Ruslichenko -X (rruslich - GLOBALLOGIC INC at Cisco) wrote: On 02/06/2017 07:41 PM, Greg KH wrote: On Mon, Feb 06, 2017 at 06:30:15PM +0100, Gabriel C wrote: On 26.01.2017 08:48, Greg KH wrote: Hi Greg, I'm announcing the release of the 4.9.6 kernel.

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Gabriel C
On 06.02.2017 20:05, Ruslan Ruslichenko -X (rruslich - GLOBALLOGIC INC at Cisco) wrote: On 02/06/2017 07:41 PM, Greg KH wrote: On Mon, Feb 06, 2017 at 06:30:15PM +0100, Gabriel C wrote: On 26.01.2017 08:48, Greg KH wrote: Hi Greg, I'm announcing the release of the 4.9.6 kernel.

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Ruslan Ruslichenko -X (rruslich - GLOBALLOGIC INC at Cisco)
On 02/06/2017 07:41 PM, Greg KH wrote: On Mon, Feb 06, 2017 at 06:30:15PM +0100, Gabriel C wrote: On 26.01.2017 08:48, Greg KH wrote: Hi Greg, I'm announcing the release of the 4.9.6 kernel. Somewhat late , however I didn't tested 4.9.6 but jumped from 4.9.5 to 4.9.7 and found out by box

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Ruslan Ruslichenko -X (rruslich - GLOBALLOGIC INC at Cisco)
On 02/06/2017 07:41 PM, Greg KH wrote: On Mon, Feb 06, 2017 at 06:30:15PM +0100, Gabriel C wrote: On 26.01.2017 08:48, Greg KH wrote: Hi Greg, I'm announcing the release of the 4.9.6 kernel. Somewhat late , however I didn't tested 4.9.6 but jumped from 4.9.5 to 4.9.7 and found out by box

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Greg KH
On Mon, Feb 06, 2017 at 06:30:15PM +0100, Gabriel C wrote: > > On 26.01.2017 08:48, Greg KH wrote: > > Hi Greg, > > > I'm announcing the release of the 4.9.6 kernel. > > > Somewhat late , however I didn't tested 4.9.6 but jumped from 4.9.5 to 4.9.7 > and found out by box won't boot anymore. >

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Greg KH
On Mon, Feb 06, 2017 at 06:30:15PM +0100, Gabriel C wrote: > > On 26.01.2017 08:48, Greg KH wrote: > > Hi Greg, > > > I'm announcing the release of the 4.9.6 kernel. > > > Somewhat late , however I didn't tested 4.9.6 but jumped from 4.9.5 to 4.9.7 > and found out by box won't boot anymore. >

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Gabriel C
On 26.01.2017 08:48, Greg KH wrote: Hi Greg, I'm announcing the release of the 4.9.6 kernel. Somewhat late , however I didn't tested 4.9.6 but jumped from 4.9.5 to 4.9.7 and found out by box won't boot anymore. It hangs early and freeze with a lot RCU warnings. Since I cannot setup a

Re: Linux 4.9.6 ( Restore IO-APIC irq_chip retrigger callback , breaks my box )

2017-02-06 Thread Gabriel C
On 26.01.2017 08:48, Greg KH wrote: Hi Greg, I'm announcing the release of the 4.9.6 kernel. Somewhat late , however I didn't tested 4.9.6 but jumped from 4.9.5 to 4.9.7 and found out by box won't boot anymore. It hangs early and freeze with a lot RCU warnings. Since I cannot setup a