Hi Artem,

On Mon, Aug 04, 2014 at 12:20:45PM +0000, Artem Fetishev wrote:
> Are we talking about error in rapl_pmu_init()? I ran your script on v3.14 and 
> the error (in rapl_pmu_init) did not appear. Once again, for 3.14-rc8 the 
> error does show up. My patch was merged between v3.14-rc8 and v3.14. The 
> purpose of the patch was fixing that GPF in rapl_pmu_init. So I doubt the 
> bisect is correct.

> +-----------------------------------------------------------+------------+------------+---------------+
> |                                                           | d9060742fb | 
> 825600c0f2 | next-20140723 |
> +-----------------------------------------------------------+------------+------------+---------------+
> | boot_successes                                            | 167        | 48 
>         | 9             |
> | boot_failures                                             | 97         | 18 
>         | 2             |
> | BUG:unable_to_handle_kernel_NULL_pointer_dereference      | 97         | 14 
>         | 2             |
> | Oops                                                      | 97         | 3  
>         | 2             |

> | RIP:rapl_pmu_init                                         | 97         |    
>         |               |
> | Kernel_panic-not_syncing:Attempted_to_kill_init_exitcode= | 97         |    
>         |               |
> | backtrace:rapl_pmu_init                                   | 97         |    
>         |               |

The rapl_pmu_init bug is confirmed to be fixed judging from the above
3 lines.

> | backtrace:kernel_init_freeable                            | 97         | 3  
>         |               |

However the below BUGs are new in commit 825600c0f2:

> | BUG:unable_to_handle_kernel_paging_request                | 0          | 17 
>         | 2             |
> | BUG:unable_to_handle_kernel_NULL_pointer                  | 0          | 1  
>         |               |
> | RIP:slob_page_alloc                                       | 0          | 3  
>         |               |
> | backtrace:proc_create_data                                | 0          | 3  
>         |               |
> | backtrace:init_timer_list_procfs                          | 0          | 3  
>         |               |
> | BUG:unable_to_handle                                      | 0          | 1  
>         |               |
> | BUG:unable_to_handl                                       | 0          | 1  
>         |               |
> | BUG:unable_to_handle_ker                                  | 0          | 1  
>         |               |
> | BUG:kernel_boot_hang                                      | 0          | 1  
>         |               |
> | RIP:memset                                                | 0          | 0  
>         | 2             |
> | RIP:__rmqueue                                             | 0          | 0  
>         | 2             |
> | backtrace:cryptomgr_test                                  | 0          | 0  
>         | 2             |
> | backtrace:do_fork                                         | 0          | 0  
>         | 2             |
> +-----------------------------------------------------------+------------+------------+---------------+

BUT I find cryptomgr_test showing up in the dmesg, which is very good
at producing all kinds of bug messages. So please ignore this bug
report -- it has too strong noises and may not be correct.

> [    1.026545] cryptomgr_test (20) used greatest stack depth: 5632 bytes left
> [    1.030491] AVX version of gcm_enc/dec engaged.
> [    1.033812] alg: No test for __gcm-aes-aesni (__driver-gcm-aes-aesni)
> [    1.038806] BUG: unable to handle kernel paging request at ffff880012770000

Thanks,
Fengguang

> [    1.040058] IP: [<ffffffff8161d18f>] memset+0x1f/0xb0
> [    1.040058] PGD 4170067 PUD 4171067 PMD 13bee067 PTE 8000000012770060
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] BUG: unable to handle kernel paging request at ffff880012776440
> [    1.040058] IP: [<ffffffff810fb846>] match_held_lock+0x16/0x2f0
> [    1.040058] PGD 4170067 PUD 4171067 PMD 13bee067 PTE 8000000012776060
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000068
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 0000000000000072
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> [    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
> [    1.040058] PGD 0 
> [    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
> 000000000000006a
> 
> git bisect start v3.14 v3.13 --
> git bisect good 494479038d97f1b9f76fc633a360a681acdf035c  # 20:32     66+     
> 28  Merge tag 'pinctrl-v3.14-2' of 
> git://git.kernel.org/pub/scm/linux/kernel/git/linusw/linux-pinctrl
> git bisect good 751a03c3728ed393287374078b98c3094a0b3bd2  # 20:36     66+     
> 67  Merge tag 'pm+acpi-3.14-rc5' of 
> git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm
> git bisect good 18f2af2d68815e1c4d5c275ebd030e27ef627582  # 20:43     66+     
> 55  Merge tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm
> git bisect good 708f04d2abf4e90abee61d9ffb1f165038017ecf  # 20:46     66+     
> 58  block: free q->flush_rq in blk_init_allocated_queue error paths
> git bisect good 75c5a52da3fc2a06abb6c6192bdf5d680e56d37d  # 20:48     66+     
> 31  vfs: Allocate anon_inode_inode in anon_inode_init()
> git bisect  bad 0f2776e6151a123552fd06b666fe755fa780a967  # 20:48      0-     
> 18  Merge branch 'for-linus' of 
> git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client
> git bisect good 1fac1fa965a9fbdb38d8a4b945db3521f801c946  # 20:57     66+     
> 26  Merge branch 'drm-fixes' of git://people.freedesktop.org/~airlied/linux
> git bisect  bad bc53267e97e4942e7f22c31cd8cf87840796accb  # 21:01      1-     
>  5  Merge branch 'akpm' (patches from Andrew Morton)
> git bisect good 2946369ee277fa9fcc3372aabddc9c15dfabf744  # 21:11     66+     
>  0  Merge branch 'for-linus' of 
> git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input
> git bisect  bad 825600c0f20e595daaa7a6dd8970f84fa2a2ee57  # 21:30      0-     
>  7  x86: fix boot on uniprocessor systems
> git bisect good d9060742fbf630fe31951dfc10b798deb2813f01  # 22:39     66+     
> 29  ocfs2: check if cluster name exists before deref
> # first bad commit: [825600c0f20e595daaa7a6dd8970f84fa2a2ee57] x86: fix boot 
> on uniprocessor systems
> git bisect good d9060742fbf630fe31951dfc10b798deb2813f01  # 22:49    198+     
> 97  ocfs2: check if cluster name exists before deref
> git bisect  bad 5eb00b037d9bb650b18b8f331bb9fb7a66559b5f  # 22:51      0-     
>  2  Add linux-next specific files for 20140723
> git bisect  bad 82e13c71bc655b6dc7110da4e164079dadb44892  # 23:12      0-     
> 77  Merge branch 'for-3.16' of git://linux-nfs.org/~bfields/linux
> git bisect  bad 1a58d9909611972fd1c081bb04a9f7dc2571e612  # 23:23      2-     
>  1  Add linux-next specific files for 20140724
> 
> 
> This script may reproduce the error.
> 
> ----------------------------------------------------------------------------
> #!/bin/bash
> 
> kernel=$1
> initrd=yocto-minimal-x86_64.cgz
> 
> wget --no-clobber 
> https://github.com/fengguang/reproduce-kernel-bug/blob/master/initrd/$initrd
> 
> kvm=(
>       qemu-system-x86_64
>       -enable-kvm
>       -cpu Haswell,+smep,+smap
>       -kernel $kernel
>       -initrd $initrd
>       -m 320
>       -smp 1
>       -net nic,vlan=1,model=e1000
>       -net user,vlan=1
>       -boot order=nc
>       -no-reboot
>       -watchdog i6300esb
>       -rtc base=localtime
>       -serial stdio
>       -display none
>       -monitor null 
> )
> 
> append=(
>       hung_task_panic=1
>       earlyprintk=ttyS0,115200
>       debug
>       apic=debug
>       sysrq_always_enabled
>       rcupdate.rcu_cpu_stall_timeout=100
>       panic=10
>       softlockup_panic=1
>       nmi_watchdog=panic
>       prompt_ramdisk=0
>       console=ttyS0,115200
>       console=tty0
>       vga=normal
>       root=/dev/ram0
>       rw
>       drbd.minor_count=8
> )
> 
> "${kvm[@]}" --append "${append[*]}"
> ----------------------------------------------------------------------------
> 
> Thanks,
> Fengguang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to