I've been able to replicate what seems to be a race condition based bug.
I can run many traces without a crash, and sometimes it will crash on
the first attempt.
I've attached a log of the dmesg output of what is happening. Please
let me know if there is anything else I can provide that might help
diagnose.
----------------------------------------
From: Manu Awasthi <manu.awas...@gmail.com>
Date: Tue, Nov 3, 2009 at 10:31 AM
Subject: kernel panic with monitoring DRAM events
To: perfmon2-devel@lists.sourceforge.net
Hi all,
I have been measuring memory events for the parsec benchmark suite on a
dual socket, quad-core opteron machine with pfmlib version 3.9, kernel
pfmon version 2.82 and kernel version 2.6.29.6 . This is what I use as
my commandline option:
>> pfmon --with-header --outfile=test1 --verbose -u --switch-timeout=100
-eDRAM_ACCESSES_PAGE:HIT,DRAM_ACCESSES_PAGE:MISS,DRAM_ACCESSES_PAGE:CONFLICT,DRAM_ACCESSES_PAGE:ALL
-eDRAM_ACCESSES_PAGE:DCT1_PAGE_HIT,DRAM_ACCESSES_PAGE:DCT1_PAGE_MISS,DRAM_ACCESSES_PAGE:DCT1_PAGE_CONFLICT,DRAM_ACCESSES_PAGE:ALL
$PASEC_COMMAND
The problem is, sometimes, over different runs of the same
(multi-threaded) benchmark, the kernel panics and the machine freezes
up. has anybody ever experienced something of this sort before? Or is
there something that I am doing wrong? Is there a better way to measure
these stats (system-wide monitoring?)?
Any help is appreciated.
Thanks,
Manu
-----------------------------
David W Nellans
dnell...@cs.utah.edu
[ 214.111050] BUG: using smp_processor_id() in preemptible [00000000]
code: pfmon/2344
[ 214.111070] caller is pfm_arch_get_pmu_module_name+0x1a/0xe8
[ 214.111077] Pid: 2344, comm: pfmon Not tainted 2.6.29.6 #1
[ 214.111082] Call Trace:
[ 214.111104] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 214.111113] [<ffffffff80227863>]
pfm_arch_get_pmu_module_name+0x1a/0xe8
[ 214.111126] [<ffffffff803c756f>] pfm_pmu_conf_get+0x4b/0x117
[ 214.111138] [<ffffffff803cbf0a>] __pfm_create_context+0x2b/0x26d
[ 214.111145] [<ffffffff803c8897>] sys_pfm_create_context+0x8e/0xaa
[ 214.111155] [<ffffffff802bb45a>] ? vfs_write+0xce/0x106
[ 214.111163] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
[ 214.111169] BUG: using smp_processor_id() in preemptible [00000000]
code: pfmon/2344
[ 214.111177] caller is pfm_arch_get_pmu_module_name+0x93/0xe8
[ 214.111182] Pid: 2344, comm: pfmon Not tainted 2.6.29.6 #1
[ 214.111186] Call Trace:
[ 214.111191] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 214.111201] [<ffffffff802278dc>]
pfm_arch_get_pmu_module_name+0x93/0xe8
[ 214.111209] [<ffffffff803c756f>] pfm_pmu_conf_get+0x4b/0x117
[ 214.111219] [<ffffffff803cbf0a>] __pfm_create_context+0x2b/0x26d
[ 214.111228] [<ffffffff803c8897>] sys_pfm_create_context+0x8e/0xaa
[ 214.111237] [<ffffffff802bb45a>] ? vfs_write+0xce/0x106
[ 214.111246] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
[ 214.142267] BUG: using smp_processor_id() in preemptible [00000000]
code: modprobe/2346
[ 214.142281] caller is pfm_amd64_probe_pmu+0x22/0x3f7 [perfmon_amd64]
[ 214.142287] Pid: 2346, comm: modprobe Not tainted 2.6.29.6 #1
[ 214.142292] Call Trace:
[ 214.142304] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 214.142316] [<ffffffffa02b5727>] pfm_amd64_probe_pmu+0x22/0x3f7
[perfmon_amd64]
[ 214.142328] [<ffffffff80280a32>] ?
tracepoint_update_probe_range+0x92/0x9b
[ 214.142342] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.142350] [<ffffffff803c770a>] pfm_pmu_register+0x5f/0x301
[ 214.142359] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.142367] [<ffffffffa02ba010>] pfm_amd64_pmu_init_module+0x10/0x12
[perfmon_amd64]
[ 214.142375] [<ffffffff80209056>] _stext+0x56/0x12b
[ 214.142383] [<ffffffff80251987>] ? up_read+0x9/0xb
[ 214.142391] [<ffffffff802523e8>] ?
__blocking_notifier_call_chain+0x56/0x62
[ 214.142403] [<ffffffff80261741>] sys_init_module+0xa5/0x1c8
[ 214.142409] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
[ 214.142419] BUG: using smp_processor_id() in preemptible [00000000]
code: modprobe/2346
[ 214.142428] caller is pfm_amd64_probe_pmu+0x4a/0x3f7 [perfmon_amd64]
[ 214.142450] Pid: 2346, comm: modprobe Not tainted 2.6.29.6 #1
[ 214.142457] Call Trace:
[ 214.142463] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 214.142475] [<ffffffffa02b574f>] pfm_amd64_probe_pmu+0x4a/0x3f7
[perfmon_amd64]
[ 214.142487] [<ffffffff80280a32>] ?
tracepoint_update_probe_range+0x92/0x9b
[ 214.142498] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.142511] [<ffffffff803c770a>] pfm_pmu_register+0x5f/0x301
[ 214.142521] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.142535] [<ffffffffa02ba010>] pfm_amd64_pmu_init_module+0x10/0x12
[perfmon_amd64]
[ 214.142542] [<ffffffff80209056>] _stext+0x56/0x12b
[ 214.142551] [<ffffffff80251987>] ? up_read+0x9/0xb
[ 214.142561] [<ffffffff802523e8>] ?
__blocking_notifier_call_chain+0x56/0x62
[ 214.142573] [<ffffffff80261741>] sys_init_module+0xa5/0x1c8
[ 214.142585] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
[ 214.142593] BUG: using smp_processor_id() in preemptible [00000000]
code: modprobe/2346
[ 214.142601] caller is pfm_amd64_probe_pmu+0xae/0x3f7 [perfmon_amd64]
[ 214.142609] Pid: 2346, comm: modprobe Not tainted 2.6.29.6 #1
[ 214.142612] Call Trace:
[ 214.142617] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 214.142631] [<ffffffffa02b57b3>] pfm_amd64_probe_pmu+0xae/0x3f7
[perfmon_amd64]
[ 214.142643] [<ffffffff80280a32>] ?
tracepoint_update_probe_range+0x92/0x9b
[ 214.142654] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.142667] [<ffffffff803c770a>] pfm_pmu_register+0x5f/0x301
[ 214.142678] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.142689] [<ffffffffa02ba010>] pfm_amd64_pmu_init_module+0x10/0x12
[perfmon_amd64]
[ 214.142701] [<ffffffff80209056>] _stext+0x56/0x12b
[ 214.142706] [<ffffffff80251987>] ? up_read+0x9/0xb
[ 214.142715] [<ffffffff802523e8>] ?
__blocking_notifier_call_chain+0x56/0x62
[ 214.142726] [<ffffffff80261741>] sys_init_module+0xa5/0x1c8
[ 214.142735] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
[ 214.142744] BUG: using smp_processor_id() in preemptible [00000000]
code: modprobe/2346
[ 214.142752] caller is pfm_amd64_probe_pmu+0xef/0x3f7 [perfmon_amd64]
[ 214.142758] Pid: 2346, comm: modprobe Not tainted 2.6.29.6 #1
[ 214.142761] Call Trace:
[ 214.142766] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 214.142778] [<ffffffffa02b57f4>] pfm_amd64_probe_pmu+0xef/0x3f7
[perfmon_amd64]
[ 214.142790] [<ffffffff80280a32>] ?
tracepoint_update_probe_range+0x92/0x9b
[ 214.142801] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.142814] [<ffffffff803c770a>] pfm_pmu_register+0x5f/0x301
[ 214.142824] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.142838] [<ffffffffa02ba010>] pfm_amd64_pmu_init_module+0x10/0x12
[perfmon_amd64]
[ 214.142851] [<ffffffff80209056>] _stext+0x56/0x12b
[ 214.142860] [<ffffffff80251987>] ? up_read+0x9/0xb
[ 214.142871] [<ffffffff802523e8>] ?
__blocking_notifier_call_chain+0x56/0x62
[ 214.142891] [<ffffffff80261741>] sys_init_module+0xa5/0x1c8
[ 214.142904] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
[ 214.142916] found family=16 IBSCTL=0x0
[ 214.148340] BUG: using smp_processor_id() in preemptible [00000000]
code: modprobe/2346
[ 214.148357] caller is pfm_amd64_probe_pmu+0x13a/0x3f7 [perfmon_amd64]
[ 214.148365] Pid: 2346, comm: modprobe Not tainted 2.6.29.6 #1
[ 214.148370] Call Trace:
[ 214.148386] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 214.148397] [<ffffffffa02b583f>] pfm_amd64_probe_pmu+0x13a/0x3f7
[perfmon_amd64]
[ 214.148409] [<ffffffff80280a32>] ?
tracepoint_update_probe_range+0x92/0x9b
[ 214.148425] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.148436] [<ffffffff803c770a>] pfm_pmu_register+0x5f/0x301
[ 214.148445] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.148453] [<ffffffffa02ba010>] pfm_amd64_pmu_init_module+0x10/0x12
[perfmon_amd64]
[ 214.148465] [<ffffffff80209056>] _stext+0x56/0x12b
[ 214.148473] [<ffffffff80251987>] ? up_read+0x9/0xb
[ 214.148480] [<ffffffff802523e8>] ?
__blocking_notifier_call_chain+0x56/0x62
[ 214.148491] [<ffffffff80261741>] sys_init_module+0xa5/0x1c8
[ 214.148499] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
[ 214.148508] NorthBridge socket-level event access control: enabled
[ 214.148534] Failed to initialize extended interrupts for IBS
[ 214.148538] Unable to use IBS
[ 214.148542] BUG: using smp_processor_id() in preemptible [00000000]
code: modprobe/2346
[ 214.148551] caller is pfm_amd64_probe_pmu+0x3a3/0x3f7 [perfmon_amd64]
[ 214.148556] Pid: 2346, comm: modprobe Not tainted 2.6.29.6 #1
[ 214.148560] Call Trace:
[ 214.148566] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 214.148575] [<ffffffffa02b5aa8>] pfm_amd64_probe_pmu+0x3a3/0x3f7
[perfmon_amd64]
[ 214.148582] [<ffffffff80280a32>] ?
tracepoint_update_probe_range+0x92/0x9b
[ 214.148591] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.148603] [<ffffffff803c770a>] pfm_pmu_register+0x5f/0x301
[ 214.148611] [<ffffffffa02ba000>] ?
pfm_amd64_pmu_init_module+0x0/0x12 [perfmon_amd64]
[ 214.148620] [<ffffffffa02ba010>] pfm_amd64_pmu_init_module+0x10/0x12
[perfmon_amd64]
[ 214.148626] [<ffffffff80209056>] _stext+0x56/0x12b
[ 214.148632] [<ffffffff80251987>] ? up_read+0x9/0xb
[ 214.148638] [<ffffffff802523e8>] ?
__blocking_notifier_call_chain+0x56/0x62
[ 214.148646] [<ffffffff80261741>] sys_init_module+0xa5/0x1c8
[ 214.148654] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
[ 214.148763] AMD64 PMU installed
[ 543.442145] 3w-9xxx: scsi0: AEN: INFO (0x04:0x0029): Verify
started:unit=0.
[ 571.776021] BUG: using smp_processor_id() in preemptible [00000000]
code: facesim/2355
[ 571.776035] caller is __pfm_exit_thread+0xaf/0xe8
[ 571.776041] Pid: 2355, comm: facesim Not tainted 2.6.29.6 #1
[ 571.776044] Call Trace:
[ 571.776053] [<ffffffff803d6956>] debug_smp_processor_id+0xca/0xe0
[ 571.776058] [<ffffffff803cb99f>] __pfm_exit_thread+0xaf/0xe8
[ 571.776064] [<ffffffff8020a5a4>] exit_thread+0x127/0x130
[ 571.776071] [<ffffffff8023e980>] do_exit+0x235/0x755
[ 571.776076] [<ffffffff802a1273>] ? do_munmap+0x2f1/0x313
[ 571.776082] [<ffffffff805df544>] ? _spin_lock_irqsave+0x18/0x34
[ 571.776086] [<ffffffff8023ef13>] do_group_exit+0x73/0xa0
[ 571.776090] [<ffffffff8023ef52>] sys_exit_group+0x12/0x16
[ 571.776094] [<ffffffff8020b79b>] system_call_fastpath+0x16/0x1b
------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day
trial. Simplify your report design, integration and deployment - and focus on
what you do best, core application coding. Discover what's new with
Crystal Reports now. http://p.sf.net/sfu/bobj-july
_______________________________________________
perfmon2-devel mailing list
perfmon2-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/perfmon2-devel