Re: system 20% busy at all times?

2013-02-20 Thread Eggert, Lars
Hi,

On Feb 19, 2013, at 17:58, Adrian Chadd adr...@freebsd.org wrote:
 Try top -HS .. to try and break down the kernel threads.

ACPI is eating the cycles, according to top:

0 root 80 0K   496K -   2   1:13 27.88% 
kernel{acpi_task_2}
0 root 80 0K   496K -   0   1:13 25.68% 
kernel{acpi_task_1}
0 root 80 0K   496K CPU11   1:07 23.68% 
kernel{acpi_task_0}

I got an off-list hint that the machine in question requires device mptable 
instead of relying on ACPI. I will try that.

As for dtrace, a complete buildworld/installworld cycle didn't change things, I 
still get:

# dtrace -n 'syscall:::entry { @num[execname] = count(); }'
dtrace: invalid probe specifier syscall:::entry { @num[execname] = count(); }: 
/usr/lib/dtrace/psinfo.d, line 90: failed to resolve type kernel`struct 
thread * for identifier curthread: Module is no longer loaded

Thanks for all the help!

Lars
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


system 20% busy at all times?

2013-02-19 Thread Eggert, Lars
Hi,

I have a system running -CURRENT that in top(1) is showing ~20% CPU usage for 
the system at all times. Any ideas what could be causing this, or how I would 
go about diagnosing this further? Nothing in the logs.

Thanks,
Lars

PS: dmesg attached, in case it helps:

Copyright (c) 1992-2013 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 10.0-CURRENT #11 r+2fc9b3d: Tue Feb 12 19:32:15 CET 2013

el...@stanley.muccbc.hq.netapp.com:/home/elars/obj/usr/home/elars/src/sys/FAS3270
 amd64
FreeBSD clang version 3.2 (tags/RELEASE_32/final 170710) 20121221
CPU: Intel(R) Xeon(R) CPU   E5240  @ 3.00GHz (3000.17-MHz K8-class CPU)
  Origin = GenuineIntel  Id = 0x1067a  Family = 0x6  Model = 0x17  Stepping = 
10
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  
Features2=0xc0ce3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,DCA,SSE4.1,XSAVE,OSXSAVE
  AMD Features=0x20100800SYSCALL,NX,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant, performance statistics
real memory  = 18253611008 (17408 MB)
avail memory = 16526143488 (15760 MB)
Event timer LAPIC quality 400
ACPI APIC Table: PTLTD  CARNEGIE
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
FreeBSD/SMP: 2 package(s) x 2 core(s)
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
 cpu2 (AP): APIC ID:  6
 cpu3 (AP): APIC ID:  7
ioapic0 Version 2.0 irqs 0-23 on motherboard
kbd0 at kbdmux0
ctl: CAM Target Layer loaded
smbios0: System Management BIOS at iomem 0xf6c00-0xf6c1e on motherboard
smbios0: Version: 2.5
cryptosoft0: software crypto on motherboard
acpi0: PTLTD CARNEGIE on motherboard
acpi0: Power Button (fixed)
cpu0: ACPI CPU on acpi0
ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node 
0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)
cpu1: ACPI CPU on acpi0
cpu2: ACPI CPU on acpi0
cpu3: ACPI CPU on acpi0
atrtc0: AT realtime clock port 0x70-0x71 irq 8 on acpi0
Event timer RTC frequency 32768 Hz quality 0
attimer0: AT timer port 0x40-0x43,0x50-0x53 irq 0 on acpi0
Timecounter i8254 frequency 1193182 Hz quality 0
Event timer i8254 frequency 1193182 Hz quality 100
Timecounter ACPI-safe frequency 3579545 Hz quality 850
acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
pci1: ACPI PCI bus on pcib1
pci1: network, ethernet at device 0.0 (no driver attached)
pcib2: PCI-PCI bridge at device 3.0 on pci0
pci2: PCI bus on pcib2
pcib3: ACPI PCI-PCI bridge at device 4.0 on pci0
pci3: ACPI PCI bus on pcib3
pcib4: ACPI PCI-PCI bridge mem 0xdeb0-0xdeb1 irq 16 at device 0.0 on 
pci3
pci4: ACPI PCI bus on pcib4
pcib4: no PRT entry for 4.4.INTA
pcib4: no PRT entry for 4.5.INTA
pcib4: no PRT entry for 4.8.INTA
pcib5: PCI-PCI bridge irq 5 at device 4.0 on pci4
pci5: PCI bus on pcib5
pcib6: PCI-PCI bridge irq 10 at device 5.0 on pci4
pci6: PCI bus on pcib6
pcib4: no PRT entry for 4.5.INTA
pcib4: no PRT entry for 4.5.INTB
ix0: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 2.5.0 mem 
0xdec0-0xdec7,0xded0-0xded03fff irq 10 at device 0.0 on pci6
ix0: Using MSIX interrupts with 5 vectors
ix0: Ethernet address: 90:e2:ba:2b:3b:6c
ix0: PCI Express Bus: Speed 5.0Gb/s Width x8
ix1: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 2.5.0 mem 
0xdec8-0xdecf,0xded04000-0xded07fff irq 11 at device 0.1 on pci6
ix1: Using MSIX interrupts with 5 vectors
ix1: Ethernet address: 90:e2:ba:2b:3b:6d
ix1: PCI Express Bus: Speed 5.0Gb/s Width x8
pcib7: ACPI PCI-PCI bridge irq 5 at device 8.0 on pci4
pci7: ACPI PCI bus on pcib7
pcib8: PCI-PCI bridge at device 5.0 on pci0
pci8: PCI bus on pcib8
pcib9: PCI-PCI bridge at device 6.0 on pci0
pci9: PCI bus on pcib9
pcib10: PCI-PCI bridge mem 0xdee0-0xdee1 irq 16 at device 0.0 on pci9
pci10: PCI bus on pcib10
pcib11: PCI-PCI bridge irq 16 at device 0.0 on pci10
pci11: PCI bus on pcib11
pcib12: PCI-PCI bridge mem 0xdef0-0xdef1 irq 16 at device 0.0 on pci11
pci12: PCI bus on pcib12
pcib13: PCI-PCI bridge irq 17 at device 1.0 on pci12
pci13: PCI bus on pcib13
pcib14: PCI-PCI bridge irq 16 at device 4.0 on pci12
pci14: PCI bus on pcib14
pcib15: PCI-PCI bridge irq 17 at device 5.0 on pci12

Re: system 20% busy at all times?

2013-02-19 Thread Fleuriot Damien

On Feb 19, 2013, at 10:36 AM, Eggert, Lars l...@netapp.com wrote:

 Hi,
 
 I have a system running -CURRENT that in top(1) is showing ~20% CPU usage for 
 the system at all times. Any ideas what could be causing this, or how I would 
 go about diagnosing this further? Nothing in the logs.
 
 Thanks,
 Lars
 
 PS: dmesg attached, in case it helps:
 
 Copyright (c) 1992-2013 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
   The Regents of the University of California. All rights reserved.
 FreeBSD is a registered trademark of The FreeBSD Foundation.
 FreeBSD 10.0-CURRENT #11 r+2fc9b3d: Tue Feb 12 19:32:15 CET 2013

 el...@stanley.muccbc.hq.netapp.com:/home/elars/obj/usr/home/elars/src/sys/FAS3270
  amd64
 FreeBSD clang version 3.2 (tags/RELEASE_32/final 170710) 20121221
 CPU: Intel(R) Xeon(R) CPU   E5240  @ 3.00GHz (3000.17-MHz K8-class 
 CPU)
  Origin = GenuineIntel  Id = 0x1067a  Family = 0x6  Model = 0x17  Stepping 
 = 10
  
 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  
 Features2=0xc0ce3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,DCA,SSE4.1,XSAVE,OSXSAVE
  AMD Features=0x20100800SYSCALL,NX,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant, performance statistics
 real memory  = 18253611008 (17408 MB)
 avail memory = 16526143488 (15760 MB)
 Event timer LAPIC quality 400
 ACPI APIC Table: PTLTD  CARNEGIE
 FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 FreeBSD/SMP: 2 package(s) x 2 core(s)
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
 cpu2 (AP): APIC ID:  6
 cpu3 (AP): APIC ID:  7
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 kbd0 at kbdmux0
 ctl: CAM Target Layer loaded
 smbios0: System Management BIOS at iomem 0xf6c00-0xf6c1e on motherboard
 smbios0: Version: 2.5
 cryptosoft0: software crypto on motherboard
 acpi0: PTLTD CARNEGIE on motherboard
 acpi0: Power Button (fixed)
 cpu0: ACPI CPU on acpi0
 ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
 (20130117/psargs-393)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
 0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
 ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
 (20130117/psargs-393)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
 0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node 
 0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)
 cpu1: ACPI CPU on acpi0
 cpu2: ACPI CPU on acpi0
 cpu3: ACPI CPU on acpi0
 atrtc0: AT realtime clock port 0x70-0x71 irq 8 on acpi0
 Event timer RTC frequency 32768 Hz quality 0
 attimer0: AT timer port 0x40-0x43,0x50-0x53 irq 0 on acpi0
 Timecounter i8254 frequency 1193182 Hz quality 0
 Event timer i8254 frequency 1193182 Hz quality 100
 Timecounter ACPI-safe frequency 3579545 Hz quality 850
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
 pci1: ACPI PCI bus on pcib1
 pci1: network, ethernet at device 0.0 (no driver attached)
 pcib2: PCI-PCI bridge at device 3.0 on pci0
 pci2: PCI bus on pcib2
 pcib3: ACPI PCI-PCI bridge at device 4.0 on pci0
 pci3: ACPI PCI bus on pcib3
 pcib4: ACPI PCI-PCI bridge mem 0xdeb0-0xdeb1 irq 16 at device 0.0 
 on pci3
 pci4: ACPI PCI bus on pcib4
 pcib4: no PRT entry for 4.4.INTA
 pcib4: no PRT entry for 4.5.INTA
 pcib4: no PRT entry for 4.8.INTA
 pcib5: PCI-PCI bridge irq 5 at device 4.0 on pci4
 pci5: PCI bus on pcib5
 pcib6: PCI-PCI bridge irq 10 at device 5.0 on pci4
 pci6: PCI bus on pcib6
 pcib4: no PRT entry for 4.5.INTA
 pcib4: no PRT entry for 4.5.INTB
 ix0: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 2.5.0 mem 
 0xdec0-0xdec7,0xded0-0xded03fff irq 10 at device 0.0 on pci6
 ix0: Using MSIX interrupts with 5 vectors
 ix0: Ethernet address: 90:e2:ba:2b:3b:6c
 ix0: PCI Express Bus: Speed 5.0Gb/s Width x8
 ix1: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 2.5.0 mem 
 0xdec8-0xdecf,0xded04000-0xded07fff irq 11 at device 0.1 on pci6
 ix1: Using MSIX interrupts with 5 vectors
 ix1: Ethernet address: 90:e2:ba:2b:3b:6d
 ix1: PCI Express Bus: Speed 5.0Gb/s Width x8
 pcib7: ACPI PCI-PCI bridge irq 5 at device 8.0 on pci4
 pci7: ACPI PCI bus on pcib7
 pcib8: PCI-PCI bridge at device 5.0 on pci0
 pci8: PCI bus on pcib8
 pcib9: PCI-PCI bridge at device 6.0 on pci0
 pci9: PCI bus on pcib9
 pcib10: PCI-PCI bridge mem 0xdee0-0xdee1 irq 16 at device 0.0 on 
 pci9
 pci10: PCI bus on pcib10
 pcib11: PCI-PCI bridge irq 16 at device 0.0 on pci10
 pci11: PCI bus on pcib11
 pcib12: PCI-PCI bridge mem 0xdef0-0xdef1 irq 16 at device 0.0 on 
 pci11
 pci12: PCI bus on pcib12
 pcib13: PCI-PCI bridge irq 17 at device 

Re: system 20% busy at all times?

2013-02-19 Thread Eggert, Lars
Hi,

On Feb 19, 2013, at 10:40, Fleuriot Damien m...@my.gd
 wrote:
 What about reviewing top(1) ?

top shows the ~20% I mentioned:

last pid:  3176;  load averages:  0.79,  0.80,  0.84
 up 0+14:49:49  09:43:51
17 processes:  1 running, 16 sleeping
CPU:  0.0% user,  0.0% nice, 18.7% system,  0.0% interrupt, 81.3% idle
Mem: 32M Active, 9456K Inact, 196M Wired, 19M Buf, 15G Free
Swap: 

  PID USERNAMETHR PRI NICE   SIZERES STATE   C   TIME   WCPU COMMAND
 3002 root  1  200 14264K  1664K select  0   0:02  0.00% powerd
 2999 root  1  200 25120K  3304K select  3   0:01  0.00% ntpd
 3084 root  1  200 81420K  6120K select  0   0:00  0.00% sshd
 3094 root  1  200 17180K  3956K pause   1   0:00  0.00% csh
 3062 root  1  210 17180K  3900K ttyin   2   0:00  0.00% csh
 2867 root  1  200 14296K  2028K select  1   0:00  0.00% syslogd
 2959 root  1  520 20500K  7512K rpcsvc  3   0:00  0.00% rpc.lockd
 2943 root  1  200 16376K  2064K select  2   0:00  0.00% rpcbind
 3061 root  1  200 47504K  2604K wait3   0:00  0.00% login
 2945 root  1  200   274M  7448K select  1   0:00  0.00% rpc.statd
 3176 root  1  200 19608K  2996K CPU31   0:00  0.00% top
 2676 root  1  200  9016K  4652K select  0   0:00  0.00% devd
 3014 root  1  200 56152K  4964K select  2   0:00  0.00% sshd
 2562 root  1  290 14416K  2224K select  1   0:00  0.00% dhclient
 2629 _dhcp 1  200 14416K  2240K select  2   0:00  0.00% dhclient
 3065 root  1  200 14528K  1708K select  2   0:00  0.00% netserver
 2708 root  1  200 14232K  1568K select  2   0:00  0.00% rtsold


 or possibly ps(1) aufx

# ps -aufx
USER   PID  %CPU %MEMVSZ  RSS TT  STAT STARTED   TIME COMMAND
root10 346.8  0.0  0   64  -  RL6:54PM 2862:46.43 [idle]
root 0  64.1  0.0  0  496  -  DLs   6:54PM  694:47.32 [kernel]
root 1   0.0  0.0   9344  792  -  ILs   6:54PM0:00.09 /sbin/init --
root 2   0.0  0.0  0   16  -  DL6:54PM0:00.00 [crypto]
root 3   0.0  0.0  0   16  -  DL6:54PM0:00.00 [crypto returns]
root 4   0.0  0.0  0   16  -  DL6:54PM0:00.00 [ctl_thrd]
root 5   0.0  0.0  0   16  -  DL6:54PM0:00.00 [xpt_thrd]
root 6   0.0  0.0  0   16  -  DL6:54PM0:00.00 [ipmi0: kcs]
root 7   0.0  0.0  0   16  -  DL6:54PM0:00.04 [pagedaemon]
root 8   0.0  0.0  0   16  -  DL6:54PM0:00.00 [pagezero]
root 9   0.0  0.0  0   16  -  DL6:54PM0:00.15 [bufdaemon]
root11   0.0  0.0  0  416  -  WL6:54PM0:10.47 [intr]
root12   0.0  0.0  0   48  -  DL6:54PM0:00.03 [geom]
root13   0.0  0.0  0   16  -  DL6:54PM0:01.87 [yarrow]
root14   0.0  0.0  0  256  -  DL6:54PM0:00.50 [usb]
root15   0.0  0.0  0   16  -  DL6:54PM0:00.17 [vnlru]
root16   0.0  0.0  0   16  -  DL6:54PM0:00.56 [syncer]
root17   0.0  0.0  0   16  -  DL6:54PM0:00.21 [softdepflush]
root42   0.0  0.0  0   16  -  DL6:54PM0:00.03 [md0]
root53   0.0  0.0  0   16  -  DL6:54PM0:00.00 [md1]
root   120   0.0  0.0  0   16  -  DL6:54PM0:00.00 [md2]
root   125   0.0  0.0  0   16  -  DL6:54PM0:00.00 [md3]
root  2562   0.0  0.0  14416 2224  -  Is6:54PM0:00.00 dhclient: em4 
[priv] (dhclient)
_dhcp 2629   0.0  0.0  14416 2240  -  Is6:54PM0:00.00 dhclient: em4 
(dhclient)
root  2676   0.0  0.0   9016 4652  -  Is6:54PM0:00.01 /sbin/devd
root  2708   0.0  0.0  14232 1568  -  Is6:54PM0:00.00 /usr/sbin/rtsold 
-a
root  2867   0.0  0.0  14296 2028  -  Ss6:54PM0:00.04 /usr/sbin/syslogd 
-s
root  2943   0.0  0.0  16376 2064  -  Ss6:54PM0:00.03 /usr/sbin/rpcbind
root  2945   0.0  0.0 280472 7448  -  Ss6:54PM0:00.02 
/usr/sbin/rpc.statd
root  2959   0.0  0.0  20500 7512  -  Ss6:54PM0:00.04 
/usr/sbin/rpc.lockd
root  2999   0.0  0.0  25120 3304  -  Ss6:54PM0:00.85 /usr/sbin/ntpd -g 
-c /etc/ntp.conf -p /var/run/ntpd.pid -f /var/db/ntpd.drift
root  3002   0.0  0.0  14264 1664  -  Ss6:54PM0:01.61 /usr/sbin/powerd
root  3014   0.0  0.0  56152 4964  -  Is6:54PM0:00.00 /usr/sbin/sshd -o 
PermitRootLogin=without-password
root  3065   0.0  0.0  14528 1708  -  Is6:54PM0:00.00 netserver
root  3084   0.0  0.0  81420 6120  -  Ss9:21AM0:00.09 sshd: root@pts/0 
(sshd)
root  3061   0.0  0.0  47504 2604 u0  Is6:54PM0:00.02 login [pam] 
(login)
root  3062   0.0  0.0  17180 3900 u0  I+6:54PM0:00.05 -csh (csh)
root  3094   0.0  0.0  17180 3956  0  Ss9:32AM0:00.05 -csh (csh)
root  3177   0.0  0.0  16436 1900  0  R+9:44AM0:00.00 ps -aufx


 At least you 

Re: system 20% busy at all times?

2013-02-19 Thread Fleuriot Damien

On Feb 19, 2013, at 10:44 AM, Eggert, Lars l...@netapp.com wrote:

 Hi,
 
 On Feb 19, 2013, at 10:40, Fleuriot Damien m...@my.gd
 wrote:
 What about reviewing top(1) ?
 
 top shows the ~20% I mentioned:
 
 last pid:  3176;  load averages:  0.79,  0.80,  0.84  
up 0+14:49:49  
 09:43:51
 17 processes:  1 running, 16 sleeping
 CPU:  0.0% user,  0.0% nice, 18.7% system,  0.0% interrupt, 81.3% idle
 Mem: 32M Active, 9456K Inact, 196M Wired, 19M Buf, 15G Free
 Swap: 
 


Ok the ~20% is from the system itself.


 
 or possibly ps(1) aufx
 
 # ps -aufx
 USER   PID  %CPU %MEMVSZ  RSS TT  STAT STARTED   TIME COMMAND
 root10 346.8  0.0  0   64  -  RL6:54PM 2862:46.43 [idle]
 root 0  64.1  0.0  0  496  -  DLs   6:54PM  694:47.32 [kernel]
 

65% of a CPU core only for the kernel...


 
 
 At least you should be able to see what takes up CPU:
 - system
 - user processes
 - interrupts
 
 # vmstat -i
 interrupt  total   rate
 irq3: uart111535  0
 irq4: uart0 1227  0
 irq9: acpi0   1989762564  37379
 irq16: uhci0 uhci1+  393  0
 cpu0:timer  32147924603
 irq270: em4  1907258 35
 cpu3:timer  63027976   1184
 cpu2:timer  56428246   1060
 cpu1:timer  44799884841
 Total 2188087007  41104
 
 So it seems that irq 9 is firing a whole lot. Why?


And indeed we find your answer here, acpi0 firing up a lot of interrupts.

Don't you get any message about that in dmesg -a or /var/log/messages ?

I'd expect something like interrupt storm blabla… source throttled blabla..



From man 4 acpi , in /boot/loader.conf :
hint.acpi.0.disabled=1
 Set this to 1 to disable all of ACPI.  If ACPI has been disabled
 on your system due to a blacklist entry for your BIOS, you can
 set this to 0 to re-enable ACPI for testing.


Any chance you could reboot the host with ACPI disabled ?


If that helps your CPU load, try setting this in /boot/loader.conf :
hw.acpi.verbose=1
Turn on verbose debugging information about what ACPI is doing.


Hoping this gets some logs :)

___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: system 20% busy at all times?

2013-02-19 Thread Eggert, Lars
Hi,

On Feb 19, 2013, at 10:54, Fleuriot Damien m...@my.gd
 wrote:
 And indeed we find your answer here, acpi0 firing up a lot of interrupts.
 
 Don't you get any message about that in dmesg -a or /var/log/messages ?
 
 I'd expect something like interrupt storm blabla… source throttled blabla..

nope. The only odd ACPI-related messages I see in dmesg are these:

ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node 
0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)

Nothing in syslog.

 From man 4 acpi , in /boot/loader.conf :
 hint.acpi.0.disabled=1
 Set this to 1 to disable all of ACPI.  If ACPI has been disabled
 on your system due to a blacklist entry for your BIOS, you can
 set this to 0 to re-enable ACPI for testing.
 
 Any chance you could reboot the host with ACPI disabled ?

If I do that, I get an early kernel crash:

Loading 10.11.12.13/~elars/kernel/kernel:0x20/7634255 0xb47d50/473552 
0xbbb720/890736 Entry at 0x802746f0
Closing network.
Starting program at 0x802746f0
GDB: no debug ports present
KDB: debugger backends: ddb
KDB: current backend: ddb
panic: running without device atpic requires a local APIC
cpuid = 0
KDB: stack backtrace:
kernel trap 12 with interrupts disabled


Fatal trap 12: page fault while in kernel mode
cpuid = 0; apic id = 00
fault virtual address   = 0x0
fault code  = supervisor read data, page not present
instruction pointer = 0x20:0x805c2973
stack pointer   = 0x28:0x80c9a960
frame pointer   = 0x28:0x80c9aa80
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags= resume, IOPL = 0
current process = 0 ()
[ thread pid 0 tid 0 ]
Stopped at  0x805c2973: movzbl  (%rdi),%ecx


 If that helps your CPU load, try setting this in /boot/loader.conf :
 hw.acpi.verbose=1
   Turn on verbose debugging information about what ACPI is doing.

Done, but it doesn't really result in any additional messages:

# dmesg | grep -i acpi
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
ACPI APIC Table: PTLTD  CARNEGIE
acpi0: PTLTD CARNEGIE on motherboard
acpi0: Power Button (fixed)
cpu0: ACPI CPU on acpi0
ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node 
0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)
cpu1: ACPI CPU on acpi0
cpu2: ACPI CPU on acpi0
cpu3: ACPI CPU on acpi0
atrtc0: AT realtime clock port 0x70-0x71 irq 8 on acpi0
attimer0: AT timer port 0x40-0x43,0x50-0x53 irq 0 on acpi0
Timecounter ACPI-fast frequency 3579545 Hz quality 900
acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
pci1: ACPI PCI bus on pcib1
pcib3: ACPI PCI-PCI bridge at device 4.0 on pci0
pci3: ACPI PCI bus on pcib3
pcib4: ACPI PCI-PCI bridge mem 0xdeb0-0xdeb1 irq 16 at device 0.0 on 
pci3
pci4: ACPI PCI bus on pcib4
pcib7: ACPI PCI-PCI bridge irq 5 at device 8.0 on pci4
pci7: ACPI PCI bus on pcib7
pcib29: ACPI PCI-PCI bridge irq 16 at device 28.0 on pci0
pci29: ACPI PCI bus on pcib29
pcib30: ACPI PCI-PCI bridge irq 16 at device 28.4 on pci0
pci30: ACPI PCI bus on pcib30
pcib31: ACPI PCI-PCI bridge irq 17 at device 28.5 on pci0
pci31: ACPI PCI bus on pcib31
pcib32: ACPI PCI-PCI bridge at device 30.0 on pci0
pci32: ACPI PCI bus on pcib32
acpi_button0: Power Button on acpi0
uart0: 16550 or compatible port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
uart1: 16550 or compatible port 0x2f8-0x2ff irq 3 on acpi0

Lars

___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: system 20% busy at all times?

2013-02-19 Thread Fleuriot Damien

On Feb 19, 2013, at 11:16 AM, Eggert, Lars l...@netapp.com wrote:

 Hi,
 
 On Feb 19, 2013, at 10:54, Fleuriot Damien m...@my.gd
 wrote:
 And indeed we find your answer here, acpi0 firing up a lot of interrupts.
 
 Don't you get any message about that in dmesg -a or /var/log/messages ?
 
 I'd expect something like interrupt storm blabla… source throttled blabla..
 
 nope. The only odd ACPI-related messages I see in dmesg are these:
 
 ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
 (20130117/psargs-393)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
 0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
 ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
 (20130117/psargs-393)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
 0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node 
 0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)
 
 Nothing in syslog.
 
 From man 4 acpi , in /boot/loader.conf :
 hint.acpi.0.disabled=1
Set this to 1 to disable all of ACPI.  If ACPI has been disabled
on your system due to a blacklist entry for your BIOS, you can
set this to 0 to re-enable ACPI for testing.
 
 Any chance you could reboot the host with ACPI disabled ?
 
 If I do that, I get an early kernel crash:
 
 Loading 10.11.12.13/~elars/kernel/kernel:0x20/7634255 0xb47d50/473552 
 0xbbb720/890736 Entry at 0x802746f0
 Closing network.
 Starting program at 0x802746f0
 GDB: no debug ports present
 KDB: debugger backends: ddb
 KDB: current backend: ddb
 panic: running without device atpic requires a local APIC
 cpuid = 0
 KDB: stack backtrace:
 kernel trap 12 with interrupts disabled
 
 
 Fatal trap 12: page fault while in kernel mode
 cpuid = 0; apic id = 00
 fault virtual address = 0x0
 fault code= supervisor read data, page not present
 instruction pointer   = 0x20:0x805c2973
 stack pointer = 0x28:0x80c9a960
 frame pointer = 0x28:0x80c9aa80
 code segment  = base 0x0, limit 0xf, type 0x1b
   = DPL 0, pres 1, long 1, def32 0, gran 1
 processor eflags  = resume, IOPL = 0
 current process   = 0 ()
 [ thread pid 0 tid 0 ]
 Stopped at  0x805c2973: movzbl  (%rdi),%ecx
 
 
 If that helps your CPU load, try setting this in /boot/loader.conf :
 hw.acpi.verbose=1
  Turn on verbose debugging information about what ACPI is doing.
 
 Done, but it doesn't really result in any additional messages:
 
 # dmesg | grep -i acpi
  
 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
 ACPI APIC Table: PTLTD  CARNEGIE
 acpi0: PTLTD CARNEGIE on motherboard
 acpi0: Power Button (fixed)
 cpu0: ACPI CPU on acpi0
 ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
 (20130117/psargs-393)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
 0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
 ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
 (20130117/psargs-393)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
 0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node 
 0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)
 cpu1: ACPI CPU on acpi0
 cpu2: ACPI CPU on acpi0
 cpu3: ACPI CPU on acpi0
 atrtc0: AT realtime clock port 0x70-0x71 irq 8 on acpi0
 attimer0: AT timer port 0x40-0x43,0x50-0x53 irq 0 on acpi0
 Timecounter ACPI-fast frequency 3579545 Hz quality 900
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
 pci1: ACPI PCI bus on pcib1
 pcib3: ACPI PCI-PCI bridge at device 4.0 on pci0
 pci3: ACPI PCI bus on pcib3
 pcib4: ACPI PCI-PCI bridge mem 0xdeb0-0xdeb1 irq 16 at device 0.0 
 on pci3
 pci4: ACPI PCI bus on pcib4
 pcib7: ACPI PCI-PCI bridge irq 5 at device 8.0 on pci4
 pci7: ACPI PCI bus on pcib7
 pcib29: ACPI PCI-PCI bridge irq 16 at device 28.0 on pci0
 pci29: ACPI PCI bus on pcib29
 pcib30: ACPI PCI-PCI bridge irq 16 at device 28.4 on pci0
 pci30: ACPI PCI bus on pcib30
 pcib31: ACPI PCI-PCI bridge irq 17 at device 28.5 on pci0
 pci31: ACPI PCI bus on pcib31
 pcib32: ACPI PCI-PCI bridge at device 30.0 on pci0
 pci32: ACPI PCI bus on pcib32
 acpi_button0: Power Button on acpi0
 uart0: 16550 or compatible port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
 uart1: 16550 or compatible port 0x2f8-0x2ff irq 3 on acpi0
 



Jeez, I certainly hope people more knowledgeable than me about the kernel will 
be able to make something of all this.


What about a newly build kernel without the line 

Re: system 20% busy at all times?

2013-02-19 Thread Eggert, Lars
Hi,

On Feb 19, 2013, at 11:21, Fleuriot Damien m...@my.gd wrote:
 What about a newly build kernel without the line device acpi and without 
 the options ACPI_DEBUG ?
 Hoping that this kernel:
 1/ won't crash on boot
 2/ will make the 20% cpu load and high interrupt rates disappear

I added device atpic to my kernel config and rebooted with 
hint.acpi.0.disabled=1 in the loader. I get further during boot, but then get a 
panic: No usable event timer found! Also, my is devices showed errors trying 
to allocate bus resources.

Lars
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: system 20% busy at all times?

2013-02-19 Thread Andriy Gapon

Completely disabling ACPI rarely works with modern machines.

Please try to run the following DTrace script (dtrace -s script-file) and
capture its output.

#pragma D option flowindent

fbt::acpi_intr_handler:entry
{
self-trace = 1;
}

fbt:::entry
/self-trace/
{
printf(arg0 = %#x, arg1 = %#x, arg2 = %#x, arg0, arg1, arg2);
}

fbt:::return
/self-trace/
{
printf(@%p ret = %u, arg0, arg1);
}

fbt::acpi_intr_handler:return
{
self-trace = 0;
exit(0);
}


-- 
Andriy Gapon
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: system 20% busy at all times?

2013-02-19 Thread Eggert, Lars
Hi,

thanks for looking into this!

On Feb 19, 2013, at 12:14, Andriy Gapon a...@freebsd.org wrote:
 Please try to run the following DTrace script (dtrace -s script-file) and
 capture its output.

I get this error:

# dtrace -s x
dtrace: failed to compile script x: /usr/lib/dtrace/psinfo.d, line 90: failed 
to resolve type kernel`struct thread * for identifier curthread: Module is no 
longer loaded

(New to dtrace, so no clue what this means.)

Lars
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: system 20% busy at all times?

2013-02-19 Thread Lars Engels
On Tue, Feb 19, 2013 at 12:12:32PM +, Eggert, Lars wrote:
 Hi,
 
 thanks for looking into this!
 
 On Feb 19, 2013, at 12:14, Andriy Gapon a...@freebsd.org wrote:
  Please try to run the following DTrace script (dtrace -s script-file) and
  capture its output.
 
 I get this error:
 
 # dtrace -s x
 dtrace: failed to compile script x: /usr/lib/dtrace/psinfo.d, line 90: 
 failed to resolve type kernel`struct thread * for identifier curthread: 
 Module is no longer loaded
 
 (New to dtrace, so no clue what this means.)

You need to recompile your Kernel to use DTrace:

https://wiki.freebsd.org/DTrace


The other Lars :)


pgpcWfuHeIgaQ.pgp
Description: PGP signature


Re: system 20% busy at all times?

2013-02-19 Thread Eggert, Lars
Hi,

On Feb 19, 2013, at 13:15, Lars Engels lars.eng...@0x20.net wrote:
 You need to recompile your Kernel to use DTrace:
 https://wiki.freebsd.org/DTrace

I did. But I still get that error, even with the sample from the wiki:

# dtrace -n 'syscall:::entry { @num[execname] = count(); }'
dtrace: invalid probe specifier syscall:::entry { @num[execname] = count(); }: 
/usr/lib/dtrace/psinfo.d, line 90: failed to resolve type kernel`struct 
thread * for identifier curthread: Module is no longer loaded

I cross-compile the -CURRENT world and kernel under -STABLE for netbooting. 
Could doing that cause this issue?

Lars
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: system 20% busy at all times?

2013-02-19 Thread Eggert, Lars
Hi,

On Feb 19, 2013, at 13:37, Eggert, Lars l...@netapp.com wrote:
 On Feb 19, 2013, at 13:15, Lars Engels lars.eng...@0x20.net wrote:
 You need to recompile your Kernel to use DTrace:
 https://wiki.freebsd.org/DTrace
 
 I did. But I still get that error, even with the sample from the wiki:
 
 # dtrace -n 'syscall:::entry { @num[execname] = count(); }'
 dtrace: invalid probe specifier syscall:::entry { @num[execname] = count(); 
 }: /usr/lib/dtrace/psinfo.d, line 90: failed to resolve type kernel`struct 
 thread * for identifier curthread: Module is no longer loaded
 
 I cross-compile the -CURRENT world and kernel under -STABLE for netbooting. 
 Could doing that cause this issue?

FWIW, a full buildworld/installworld of the latest -CURRENT also didn't help, 
the error remains.

Lars
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: system 20% busy at all times?

2013-02-19 Thread Adrian Chadd
Try top -HS .. to try and break down the kernel threads.



Adrian
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: system 20% busy at all times?

2013-02-19 Thread Oliver Pinter
try other event timer source

http://comments.gmane.org/gmane.os.freebsd.bugs/59695

On 2/19/13, Eggert, Lars l...@netapp.com wrote:
 Hi,

 I have a system running -CURRENT that in top(1) is showing ~20% CPU usage
 for the system at all times. Any ideas what could be causing this, or how I
 would go about diagnosing this further? Nothing in the logs.

 Thanks,
 Lars

 PS: dmesg attached, in case it helps:

 Copyright (c) 1992-2013 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
   The Regents of the University of California. All rights reserved.
 FreeBSD is a registered trademark of The FreeBSD Foundation.
 FreeBSD 10.0-CURRENT #11 r+2fc9b3d: Tue Feb 12 19:32:15 CET 2013

 el...@stanley.muccbc.hq.netapp.com:/home/elars/obj/usr/home/elars/src/sys/FAS3270
 amd64
 FreeBSD clang version 3.2 (tags/RELEASE_32/final 170710) 20121221
 CPU: Intel(R) Xeon(R) CPU   E5240  @ 3.00GHz (3000.17-MHz K8-class
 CPU)
   Origin = GenuineIntel  Id = 0x1067a  Family = 0x6  Model = 0x17
 Stepping = 10

 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE

 Features2=0xc0ce3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,DCA,SSE4.1,XSAVE,OSXSAVE
   AMD Features=0x20100800SYSCALL,NX,LM
   AMD Features2=0x1LAHF
   TSC: P-state invariant, performance statistics
 real memory  = 18253611008 (17408 MB)
 avail memory = 16526143488 (15760 MB)
 Event timer LAPIC quality 400
 ACPI APIC Table: PTLTD  CARNEGIE
 FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 FreeBSD/SMP: 2 package(s) x 2 core(s)
  cpu0 (BSP): APIC ID:  0
  cpu1 (AP): APIC ID:  1
  cpu2 (AP): APIC ID:  6
  cpu3 (AP): APIC ID:  7
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 kbd0 at kbdmux0
 ctl: CAM Target Layer loaded
 smbios0: System Management BIOS at iomem 0xf6c00-0xf6c1e on motherboard
 smbios0: Version: 2.5
 cryptosoft0: software crypto on motherboard
 acpi0: PTLTD CARNEGIE on motherboard
 acpi0: Power Button (fixed)
 cpu0: ACPI CPU on acpi0
 ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND
 (20130117/psargs-393)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node
 0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
 ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND
 (20130117/psargs-393)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node
 0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
 ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node
 0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)
 cpu1: ACPI CPU on acpi0
 cpu2: ACPI CPU on acpi0
 cpu3: ACPI CPU on acpi0
 atrtc0: AT realtime clock port 0x70-0x71 irq 8 on acpi0
 Event timer RTC frequency 32768 Hz quality 0
 attimer0: AT timer port 0x40-0x43,0x50-0x53 irq 0 on acpi0
 Timecounter i8254 frequency 1193182 Hz quality 0
 Event timer i8254 frequency 1193182 Hz quality 100
 Timecounter ACPI-safe frequency 3579545 Hz quality 850
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
 pci1: ACPI PCI bus on pcib1
 pci1: network, ethernet at device 0.0 (no driver attached)
 pcib2: PCI-PCI bridge at device 3.0 on pci0
 pci2: PCI bus on pcib2
 pcib3: ACPI PCI-PCI bridge at device 4.0 on pci0
 pci3: ACPI PCI bus on pcib3
 pcib4: ACPI PCI-PCI bridge mem 0xdeb0-0xdeb1 irq 16 at device 0.0
 on pci3
 pci4: ACPI PCI bus on pcib4
 pcib4: no PRT entry for 4.4.INTA
 pcib4: no PRT entry for 4.5.INTA
 pcib4: no PRT entry for 4.8.INTA
 pcib5: PCI-PCI bridge irq 5 at device 4.0 on pci4
 pci5: PCI bus on pcib5
 pcib6: PCI-PCI bridge irq 10 at device 5.0 on pci4
 pci6: PCI bus on pcib6
 pcib4: no PRT entry for 4.5.INTA
 pcib4: no PRT entry for 4.5.INTB
 ix0: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 2.5.0 mem
 0xdec0-0xdec7,0xded0-0xded03fff irq 10 at device 0.0 on pci6
 ix0: Using MSIX interrupts with 5 vectors
 ix0: Ethernet address: 90:e2:ba:2b:3b:6c
 ix0: PCI Express Bus: Speed 5.0Gb/s Width x8
 ix1: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 2.5.0 mem
 0xdec8-0xdecf,0xded04000-0xded07fff irq 11 at device 0.1 on pci6
 ix1: Using MSIX interrupts with 5 vectors
 ix1: Ethernet address: 90:e2:ba:2b:3b:6d
 ix1: PCI Express Bus: Speed 5.0Gb/s Width x8
 pcib7: ACPI PCI-PCI bridge irq 5 at device 8.0 on pci4
 pci7: ACPI PCI bus on pcib7
 pcib8: PCI-PCI bridge at device 5.0 on pci0
 pci8: PCI bus on pcib8
 pcib9: PCI-PCI bridge at device 6.0 on pci0
 pci9: PCI bus on pcib9
 pcib10: PCI-PCI bridge mem 0xdee0-0xdee1 irq 16 at device 0.0 on
 pci9
 pci10: PCI bus on pcib10
 pcib11: PCI-PCI bridge irq 16 at device 0.0 on pci10
 pci11: PCI bus on pcib11
 pcib12: PCI-PCI bridge mem 0xdef0-0xdef1 irq 16 at device 0.0 on
 pci11
 pci12: PCI bus on 

Re: system 20% busy at all times?

2013-02-19 Thread Julian Elischer

On 2/19/13 2:21 AM, Fleuriot Damien wrote:

On Feb 19, 2013, at 11:16 AM, Eggert, Lars l...@netapp.com wrote:


Hi,

On Feb 19, 2013, at 10:54, Fleuriot Damien m...@my.gd
wrote:

And indeed we find your answer here, acpi0 firing up a lot of interrupts.

Don't you get any message about that in dmesg -a or /var/log/messages ?

I'd expect something like interrupt storm blabla… source throttled blabla..

nope. The only odd ACPI-related messages I see in dmesg are these:

ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node 
0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)

Nothing in syslog.


 From man 4 acpi , in /boot/loader.conf :
hint.acpi.0.disabled=1
Set this to 1 to disable all of ACPI.  If ACPI has been disabled
on your system due to a blacklist entry for your BIOS, you can
set this to 0 to re-enable ACPI for testing.

Any chance you could reboot the host with ACPI disabled ?

If I do that, I get an early kernel crash:

Loading 10.11.12.13/~elars/kernel/kernel:0x20/7634255 0xb47d50/473552 
0xbbb720/890736 Entry at 0x802746f0
Closing network.
Starting program at 0x802746f0
GDB: no debug ports present
KDB: debugger backends: ddb
KDB: current backend: ddb
panic: running without device atpic requires a local APIC
cpuid = 0
KDB: stack backtrace:
kernel trap 12 with interrupts disabled


Fatal trap 12: page fault while in kernel mode
cpuid = 0; apic id = 00
fault virtual address   = 0x0
fault code  = supervisor read data, page not present
instruction pointer = 0x20:0x805c2973
stack pointer   = 0x28:0x80c9a960
frame pointer   = 0x28:0x80c9aa80
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags= resume, IOPL = 0
current process = 0 ()
[ thread pid 0 tid 0 ]
Stopped at  0x805c2973: movzbl  (%rdi),%ecx



If that helps your CPU load, try setting this in /boot/loader.conf :
hw.acpi.verbose=1
Turn on verbose debugging information about what ACPI is doing.

Done, but it doesn't really result in any additional messages:

# dmesg | grep -i acpi
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
ACPI APIC Table: PTLTD  CARNEGIE
acpi0: PTLTD CARNEGIE on motherboard
acpi0: Power Button (fixed)
cpu0: ACPI CPU on acpi0
ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: [\134_SB_.PCI0.LPC0.BCMD] Namespace lookup failure, AE_NOT_FOUND 
(20130117/psargs-393)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._OSC] (Node 
0xfe0007630c00), AE_NOT_FOUND (20130117/psparse-560)
ACPI Error: Method parse/execution failed [\134_PR_.CPU0._PDC] (Node 
0xfe0007630c40), AE_NOT_FOUND (20130117/psparse-560)
cpu1: ACPI CPU on acpi0
cpu2: ACPI CPU on acpi0
cpu3: ACPI CPU on acpi0
atrtc0: AT realtime clock port 0x70-0x71 irq 8 on acpi0
attimer0: AT timer port 0x40-0x43,0x50-0x53 irq 0 on acpi0
Timecounter ACPI-fast frequency 3579545 Hz quality 900
acpi_timer0: 24-bit timer at 3.579545MHz port 0x1008-0x100b on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
pci1: ACPI PCI bus on pcib1
pcib3: ACPI PCI-PCI bridge at device 4.0 on pci0
pci3: ACPI PCI bus on pcib3
pcib4: ACPI PCI-PCI bridge mem 0xdeb0-0xdeb1 irq 16 at device 0.0 on 
pci3
pci4: ACPI PCI bus on pcib4
pcib7: ACPI PCI-PCI bridge irq 5 at device 8.0 on pci4
pci7: ACPI PCI bus on pcib7
pcib29: ACPI PCI-PCI bridge irq 16 at device 28.0 on pci0
pci29: ACPI PCI bus on pcib29
pcib30: ACPI PCI-PCI bridge irq 16 at device 28.4 on pci0
pci30: ACPI PCI bus on pcib30
pcib31: ACPI PCI-PCI bridge irq 17 at device 28.5 on pci0
pci31: ACPI PCI bus on pcib31
pcib32: ACPI PCI-PCI bridge at device 30.0 on pci0
pci32: ACPI PCI bus on pcib32
acpi_button0: Power Button on acpi0
uart0: 16550 or compatible port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
uart1: 16550 or compatible port 0x2f8-0x2ff irq 3 on acpi0




Jeez, I certainly hope people more knowledgeable than me about the kernel will 
be able to make something of all this.


What about a newly build kernel without the line device acpi and without the 
options ACPI_DEBUG ?