Re: [LKP] [x86LKP] PANIC: double fault, error_code: 0xffffffffffffffff

2014-10-08 Thread Andi Kleen
> As the table shows, it's not an easily reproducible bug -- the possibility is
> less than 10%:

I think i found the problem. Thanks. Hopefully tests will pass now.

-Andi
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [LKP] [x86LKP] PANIC: double fault, error_code: 0xffffffffffffffff

2014-10-08 Thread Andi Kleen
 As the table shows, it's not an easily reproducible bug -- the possibility is
 less than 10%:

I think i found the problem. Thanks. Hopefully tests will pass now.

-Andi
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [LKP] [x86LKP] PANIC: double fault, error_code: 0xffffffffffffffff

2014-10-07 Thread Fengguang Wu
Hi Andi,

On Mon, Oct 06, 2014 at 01:37:31PM -0700, Andi Kleen wrote:
> 
> I cannot reproduce with the config/kvm script. 
> 
> Well the user space in your initrd hangs after boot, but I presume that's some
> different issue.

It should not be hang, but is taking some time to do the tests, and will reboot
after about 1 minute:

[1.047088] Write protecting the kernel read-only data: 10240k
[1.050967] Freeing unused kernel memory: 1248K (8800014c8000 - 
88000160)
[1.056489] Freeing unused kernel memory: 1936K (88000181c000 - 
880001a0)
[1.061008] random: init urandom read with 3 bits of entropy available
[   62.208919] reboot: Restarting system

As the table shows, it's not an easily reproducible bug -- the possibility is
less than 10%:

+--++++
|  | 598d570a05 | b8a868e9ea | 
8048975233 |
+--++++
| boot_successes   | 900| 280| 79   
  |
| boot_failures| 0  | 20 | 2
  |
| PANIC:double_fault,  | 0  | 12 | 2
  |
| Kernel_panic-not_syncing:Machine_halted  | 0  | 11 | 2
  |
| BUG:unable_to_handle_kernel  | 0  | 5  |  
  |
| Oops | 0  | 3  |  
  |
| RIP:pgd_free | 0  | 1  |  
  |
| BUG:kernel_boot_crashed  | 0  | 4  |  
  |
| RIP:show_stack_log_lvl   | 0  | 1  |  
  |
| Kernel_panic-not_syncing:Fatal_exception | 0  | 1  |  
  |
+--++++

> Do you have a different config or setup that shows it too?

The next email report titled "[x86] double fault:  [#1]" happens on another
config and it has much better reproduce ratio:

+---+++--+
|   | 598d570a05 | 
b8a868e9ea | v3.17-rc7_100409 |
+---+++--+
| boot_successes| 207| 24   
  | 17   |
| boot_failures | 3  | 46   
  | 4|
| BUG:kernel_boot_crashed   | 3  |  
  |  |
| double_fault: | 0  | 39   
  | 4|
| RIP:trace_hardirqs_off_thunk  | 0  | 22   
  | 1|
| BUG:unable_to_handle_kernel   | 0  | 17   
  | 1|
| Oops  | 0  | 15   
  | 1|
| RIP:show_stack_log_lvl| 0  | 14   
  | 1|
| Kernel_panic-not_syncing:Fatal_exception_in_interrupt | 0  | 20   
  | 2|
| RIP:trace_hardirqs_off_caller | 0  | 6
  | 1|
| BUG:kernel_boot_hang  | 0  | 23   
  | 2|
| backtrace:rescuer_thread  | 0  | 1
  |  |
+---+++--+

You can use the same script to reproduce it. Only the kernel needs
rebuild with the attached config in that email.

Thanks,
Fengguang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [LKP] [x86LKP] PANIC: double fault, error_code: 0xffffffffffffffff

2014-10-07 Thread Fengguang Wu
Hi Andi,

On Mon, Oct 06, 2014 at 01:37:31PM -0700, Andi Kleen wrote:
 
 I cannot reproduce with the config/kvm script. 
 
 Well the user space in your initrd hangs after boot, but I presume that's some
 different issue.

It should not be hang, but is taking some time to do the tests, and will reboot
after about 1 minute:

[1.047088] Write protecting the kernel read-only data: 10240k
[1.050967] Freeing unused kernel memory: 1248K (8800014c8000 - 
88000160)
[1.056489] Freeing unused kernel memory: 1936K (88000181c000 - 
880001a0)
[1.061008] random: init urandom read with 3 bits of entropy available
[   62.208919] reboot: Restarting system

As the table shows, it's not an easily reproducible bug -- the possibility is
less than 10%:

+--++++
|  | 598d570a05 | b8a868e9ea | 
8048975233 |
+--++++
| boot_successes   | 900| 280| 79   
  |
| boot_failures| 0  | 20 | 2
  |
| PANIC:double_fault,  | 0  | 12 | 2
  |
| Kernel_panic-not_syncing:Machine_halted  | 0  | 11 | 2
  |
| BUG:unable_to_handle_kernel  | 0  | 5  |  
  |
| Oops | 0  | 3  |  
  |
| RIP:pgd_free | 0  | 1  |  
  |
| BUG:kernel_boot_crashed  | 0  | 4  |  
  |
| RIP:show_stack_log_lvl   | 0  | 1  |  
  |
| Kernel_panic-not_syncing:Fatal_exception | 0  | 1  |  
  |
+--++++

 Do you have a different config or setup that shows it too?

The next email report titled [x86] double fault:  [#1] happens on another
config and it has much better reproduce ratio:

+---+++--+
|   | 598d570a05 | 
b8a868e9ea | v3.17-rc7_100409 |
+---+++--+
| boot_successes| 207| 24   
  | 17   |
| boot_failures | 3  | 46   
  | 4|
| BUG:kernel_boot_crashed   | 3  |  
  |  |
| double_fault: | 0  | 39   
  | 4|
| RIP:trace_hardirqs_off_thunk  | 0  | 22   
  | 1|
| BUG:unable_to_handle_kernel   | 0  | 17   
  | 1|
| Oops  | 0  | 15   
  | 1|
| RIP:show_stack_log_lvl| 0  | 14   
  | 1|
| Kernel_panic-not_syncing:Fatal_exception_in_interrupt | 0  | 20   
  | 2|
| RIP:trace_hardirqs_off_caller | 0  | 6
  | 1|
| BUG:kernel_boot_hang  | 0  | 23   
  | 2|
| backtrace:rescuer_thread  | 0  | 1
  |  |
+---+++--+

You can use the same script to reproduce it. Only the kernel needs
rebuild with the attached config in that email.

Thanks,
Fengguang
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [LKP] [x86LKP] PANIC: double fault, error_code: 0xffffffffffffffff

2014-10-06 Thread Andi Kleen

I cannot reproduce with the config/kvm script. 

Well the user space in your initrd hangs after boot, but I presume that's some
different issue.

Do you have a different config or setup that shows it too?

-Andi
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [LKP] [x86LKP] PANIC: double fault, error_code: 0xffffffffffffffff

2014-10-06 Thread Andi Kleen

I cannot reproduce with the config/kvm script. 

Well the user space in your initrd hangs after boot, but I presume that's some
different issue.

Do you have a different config or setup that shows it too?

-Andi
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/