Hi Fengguang,

Reproduced with your config and script. Working on locating the cause...

Thanks,
-Artem

-----Original Message-----
From: Fengguang Wu [mailto:fengguang...@intel.com] 
Sent: Wednesday, July 30, 2014 7:08 AM
To: Artem Fetishev
Cc: Jet Chen; Su Tao; Yuanhan Liu; LKP; linux-kernel@vger.kernel.org
Subject: [x86] BUG: unable to handle kernel paging request at ffff880012770000

Hi Artem,

FYI, this bisect has noise errors, so I'm not sure if it's a correct bisect.

git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git master
commit 825600c0f20e595daaa7a6dd8970f84fa2a2ee57
Author:     Artem Fetishev <artem_fetis...@epam.com>
AuthorDate: Fri Mar 28 13:33:39 2014 -0700
Commit:     Linus Torvalds <torva...@linux-foundation.org>
CommitDate: Fri Mar 28 13:56:58 2014 -0700

    x86: fix boot on uniprocessor systems
    
    On x86 uniprocessor systems topology_physical_package_id() returns -1
    which causes rapl_cpu_prepare() to leave rapl_pmu variable uninitialized
    which leads to GPF in rapl_pmu_init().
    
    See arch/x86/kernel/cpu/perf_event_intel_rapl.c.
    
    It turns out that physical_package_id and core_id can actually be
    retreived for uniprocessor systems too.  Enabling them also fixes
    rapl_pmu code.
    
    Signed-off-by: Artem Fetishev <artem_fetis...@epam.com>
    Cc: Stephane Eranian <eran...@google.com>
    Cc: Ingo Molnar <mi...@elte.hu>
    Cc: "H. Peter Anvin" <h...@zytor.com>
    Cc: Thomas Gleixner <t...@linutronix.de>
    Cc: Peter Zijlstra <a.p.zijls...@chello.nl>
    Cc: <sta...@vger.kernel.org>
    Signed-off-by: Andrew Morton <a...@linux-foundation.org>
    Signed-off-by: Linus Torvalds <torva...@linux-foundation.org>

===================================================
PARENT COMMIT NOT CLEAN. LOOK OUT FOR WRONG BISECT!
===================================================
Attached dmesg for the parent commit, too, to help confirm whether it is a 
noise error.

+-----------------------------------------------------------+------------+------------+---------------+
|                                                           | d9060742fb | 
825600c0f2 | next-20140723 |
+-----------------------------------------------------------+------------+------------+---------------+
| boot_successes                                            | 167        | 48   
      | 9             |
| boot_failures                                             | 97         | 18   
      | 2             |
| BUG:unable_to_handle_kernel_NULL_pointer_dereference      | 97         | 14   
      | 2             |
| Oops                                                      | 97         | 3    
      | 2             |
| RIP:rapl_pmu_init                                         | 97         |      
      |               |
| Kernel_panic-not_syncing:Attempted_to_kill_init_exitcode= | 97         |      
      |               |
| backtrace:rapl_pmu_init                                   | 97         |      
      |               |
| backtrace:kernel_init_freeable                            | 97         | 3    
      |               |
| BUG:unable_to_handle_kernel_paging_request                | 0          | 17   
      | 2             |
| BUG:unable_to_handle_kernel_NULL_pointer                  | 0          | 1    
      |               |
| RIP:slob_page_alloc                                       | 0          | 3    
      |               |
| backtrace:proc_create_data                                | 0          | 3    
      |               |
| backtrace:init_timer_list_procfs                          | 0          | 3    
      |               |
| BUG:unable_to_handle                                      | 0          | 1    
      |               |
| BUG:unable_to_handl                                       | 0          | 1    
      |               |
| BUG:unable_to_handle_ker                                  | 0          | 1    
      |               |
| BUG:kernel_boot_hang                                      | 0          | 1    
      |               |
| RIP:memset                                                | 0          | 0    
      | 2             |
| RIP:__rmqueue                                             | 0          | 0    
      | 2             |
| backtrace:cryptomgr_test                                  | 0          | 0    
      | 2             |
| backtrace:do_fork                                         | 0          | 0    
      | 2             |
+-----------------------------------------------------------+------------+------------+---------------+

[    1.026545] cryptomgr_test (20) used greatest stack depth: 5632 bytes left
[    1.030491] AVX version of gcm_enc/dec engaged.
[    1.033812] alg: No test for __gcm-aes-aesni (__driver-gcm-aes-aesni)
[    1.038806] BUG: unable to handle kernel paging request at ffff880012770000
[    1.040058] IP: [<ffffffff8161d18f>] memset+0x1f/0xb0
[    1.040058] PGD 4170067 PUD 4171067 PMD 13bee067 PTE 8000000012770060
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] BUG: unable to handle kernel paging request at ffff880012776440
[    1.040058] IP: [<ffffffff810fb846>] match_held_lock+0x16/0x2f0
[    1.040058] PGD 4170067 PUD 4171067 PMD 13bee067 PTE 8000000012776060
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000068
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
0000000000000072
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a
[    1.040058] IP: [<ffffffff820d66f6>] no_context+0x34e/0x3f4
[    1.040058] PGD 0 
[    1.040058] BUG: unable to handle kernel NULL pointer dereference at 
000000000000006a

git bisect start v3.14 v3.13 --
git bisect good 494479038d97f1b9f76fc633a360a681acdf035c  # 20:32     66+     
28  Merge tag 'pinctrl-v3.14-2' of 
git://git.kernel.org/pub/scm/linux/kernel/git/linusw/linux-pinctrl
git bisect good 751a03c3728ed393287374078b98c3094a0b3bd2  # 20:36     66+     
67  Merge tag 'pm+acpi-3.14-rc5' of 
git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm
git bisect good 18f2af2d68815e1c4d5c275ebd030e27ef627582  # 20:43     66+     
55  Merge tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm
git bisect good 708f04d2abf4e90abee61d9ffb1f165038017ecf  # 20:46     66+     
58  block: free q->flush_rq in blk_init_allocated_queue error paths
git bisect good 75c5a52da3fc2a06abb6c6192bdf5d680e56d37d  # 20:48     66+     
31  vfs: Allocate anon_inode_inode in anon_inode_init()
git bisect  bad 0f2776e6151a123552fd06b666fe755fa780a967  # 20:48      0-     
18  Merge branch 'for-linus' of 
git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client
git bisect good 1fac1fa965a9fbdb38d8a4b945db3521f801c946  # 20:57     66+     
26  Merge branch 'drm-fixes' of git://people.freedesktop.org/~airlied/linux
git bisect  bad bc53267e97e4942e7f22c31cd8cf87840796accb  # 21:01      1-      
5  Merge branch 'akpm' (patches from Andrew Morton)
git bisect good 2946369ee277fa9fcc3372aabddc9c15dfabf744  # 21:11     66+      
0  Merge branch 'for-linus' of 
git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input
git bisect  bad 825600c0f20e595daaa7a6dd8970f84fa2a2ee57  # 21:30      0-      
7  x86: fix boot on uniprocessor systems
git bisect good d9060742fbf630fe31951dfc10b798deb2813f01  # 22:39     66+     
29  ocfs2: check if cluster name exists before deref
# first bad commit: [825600c0f20e595daaa7a6dd8970f84fa2a2ee57] x86: fix boot on 
uniprocessor systems
git bisect good d9060742fbf630fe31951dfc10b798deb2813f01  # 22:49    198+     
97  ocfs2: check if cluster name exists before deref
git bisect  bad 5eb00b037d9bb650b18b8f331bb9fb7a66559b5f  # 22:51      0-      
2  Add linux-next specific files for 20140723
git bisect  bad 82e13c71bc655b6dc7110da4e164079dadb44892  # 23:12      0-     
77  Merge branch 'for-3.16' of git://linux-nfs.org/~bfields/linux
git bisect  bad 1a58d9909611972fd1c081bb04a9f7dc2571e612  # 23:23      2-      
1  Add linux-next specific files for 20140724


This script may reproduce the error.

----------------------------------------------------------------------------
#!/bin/bash

kernel=$1
initrd=yocto-minimal-x86_64.cgz

wget --no-clobber 
https://github.com/fengguang/reproduce-kernel-bug/blob/master/initrd/$initrd

kvm=(
        qemu-system-x86_64
        -enable-kvm
        -cpu Haswell,+smep,+smap
        -kernel $kernel
        -initrd $initrd
        -m 320
        -smp 1
        -net nic,vlan=1,model=e1000
        -net user,vlan=1
        -boot order=nc
        -no-reboot
        -watchdog i6300esb
        -rtc base=localtime
        -serial stdio
        -display none
        -monitor null 
)

append=(
        hung_task_panic=1
        earlyprintk=ttyS0,115200
        debug
        apic=debug
        sysrq_always_enabled
        rcupdate.rcu_cpu_stall_timeout=100
        panic=10
        softlockup_panic=1
        nmi_watchdog=panic
        prompt_ramdisk=0
        console=ttyS0,115200
        console=tty0
        vga=normal
        root=/dev/ram0
        rw
        drbd.minor_count=8
)

"${kvm[@]}" --append "${append[*]}"
----------------------------------------------------------------------------

Thanks,
Fengguang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to