Bug#838491: linux-image-4.7.0-0.bpo.1-amd64-unsigned: extreme load averages and over 2000 kworker threads

2016-10-14 Thread Markus Koeberl
On Monday 10 October 2016 16:16:30 Ben Hutchings wrote:
>
> I think this might be fixed by "mm: memcontrol: use special workqueue
> for creating per-memcg caches" included in version 4.7.6-1.  Let us
> know whether that does it.

It did not happen again with linux-image-4.7.0-1-amd64-unsigned (4.7.6-1) 
within the last day.
I guess you can close the BUG.
Thanks!


regards
Markus Köberl
-- 
Markus Koeberl
Graz University of Technology
Signal Processing and Speech Communication Laboratory
E-mail: markus.koeb...@tugraz.at



Bug#838491: linux-image-4.7.0-0.bpo.1-amd64-unsigned: extreme load averages and over 2000 kworker threads

2016-10-10 Thread Markus Koeberl
Package: src:linux
Followup-For: Bug #838491

Dear Maintainer,

   * What led up to the situation?

upgrade kernel and systemd to the version proveded in jessie-backports

   * What exactly did you do (or not do) that was effective (or
 ineffective)?

during normal usage (slurm cluster node):

load average: 1290.54, 513.19, 466.29

the load 5 peaks reache 2000

ps aux | grep kworker | wc -l
4188

I followed the Debugging instruction of
https://raw.githubusercontent.com/torvalds/linux/master/Documentation/workqueue.txt

echo workqueue:workqueue_queue_work > /sys/kernel/debug/tracing/set_event
cat /sys/kernel/debug/tracing/trace_pipe > out.txt
after a vew seconds:
cat out.txt | awk '{print $8}' | sort | uniq -c | sort -n
  1 function=do_cache_clean
  1 function=pcpu_balance_workfn
  1 function=xfs_eofblocks_worker
  2 function=neigh_periodic_work
  6 function=xfs_reclaim_worker
  6 function=xlog_cil_push_work
  8 function=disk_events_workfn
  8 function=igb_watchdog_task
 12 function=push_to_pool
 13 function=blk_timeout_work
 15 function=vmstat_shepherd
 22 function=xfs_end_io
 27 function=key_garbage_collector
 27 function=lru_add_drain_per_cpu
 34 function=delayed_fput
 38 function=scsi_requeue_run_queue
 39 function=blk_delay_work
 40 function=cgroup_pidlist_destroy_work_fn
 56 function=flush_to_ldisc
 64 function=cache_reap
 77 function=wb_workfn
101 function=os_execute_work_item
131 function=css_killed_work_fn
142 function=xfs_buf_ioend_work
156 function=vmstat_update
162 function=call_usermodehelper_exec_work
162 function=cgroup_release_agent
409 function=vmpressure_work_fn
497 function=css_release_work_fn
500 function=css_free_work_fn
  47931 function=memcg_kmem_cache_create_func


I found https://bugzilla.kernel.org/show_bug.cgi?id=172981 which seams to be 
the same problem.



-- Package-specific info:
** Version:
Linux version 4.7.0-0.bpo.1-amd64 (debian-ker...@lists.debian.org) (gcc version 
4.9.2 (Debian 4.9.2-10) ) #1 SMP Debian 4.7.5-1~bpo8+2 (2016-10-01)

** Command line:
BOOT_IMAGE=/vmlinuz-4.7.0-0.bpo.1-amd64 
root=UUID=d3b74f44-0f5e-4ba1-9606-ad42b76e5918 ro cgroup_enable=memory 
swapaccount=1 elevator=deadline quiet nomodeset nouveau.modeset=0

** Tainted: POE (12289)
 * Proprietary module has been loaded.
 * Out-of-tree module has been loaded.
 * Unsigned module has been loaded.


** Model information
sys_vendor: Supermicro
product_name: X10SRA
product_version: 0123456789
chassis_vendor: Supermicro
chassis_version: 0123456789
bios_vendor: American Megatrends Inc.
bios_version: 2.0
board_vendor: Supermicro
board_name: X10SRA
board_version: 1.01

** Loaded modules:
8021q(E)
garp(E)
mrp(E)
stp(E)
llc(E)
nvidia_drm(POE)
nvidia_modeset(POE)
nvidia(POE)
drm_kms_helper(E)
drm(E)
openafs(POE)
nfsd(E)
auth_rpcgss(E)
nfs_acl(E)
nfs(E)
lockd(E)
grace(E)
fscache(E)
sunrpc(E)
intel_rapl(E)
sb_edac(E)
edac_core(E)
x86_pkg_temp_thermal(E)
intel_powerclamp(E)
coretemp(E)
xfs(E)
libcrc32c(E)
snd_hda_codec_hdmi(E)
iTCO_wdt(E)
iTCO_vendor_support(E)
mxm_wmi(E)
evdev(E)
kvm_intel(E)
kvm(E)
irqbypass(E)
crct10dif_pclmul(E)
crc32_pclmul(E)
ghash_clmulni_intel(E)
hmac(E)
drbg(E)
ansi_cprng(E)
aesni_intel(E)
aes_x86_64(E)
lrw(E)
gf128mul(E)
glue_helper(E)
ablk_helper(E)
cryptd(E)
pcspkr(E)
serio_raw(E)
snd_hda_codec_realtek(E)
snd_hda_codec_generic(E)
snd_hda_intel(E)
snd_hda_codec(E)
snd_hda_core(E)
snd_hwdep(E)
snd_pcm(E)
snd_timer(E)
snd(E)
soundcore(E)
lpc_ich(E)
mfd_core(E)
sg(E)
i2c_i801(E)
shpchp(E)
ipmi_msghandler(E)
wmi(E)
acpi_power_meter(E)
tpm_tis(E)
tpm(E)
button(E)
usbhid(E)
hid(E)
fuse(E)
autofs4(E)
ext4(E)
crc16(E)
jbd2(E)
crc32c_generic(E)
mbcache(E)
dm_mod(E)
sr_mod(E)
cdrom(E)
sd_mod(E)
crc32c_intel(E)
psmouse(E)
ahci(E)
igb(E)
libahci(E)
ehci_pci(E)
i2c_algo_bit(E)
ehci_hcd(E)
dca(E)
ptp(E)
pps_core(E)
xhci_pci(E)
libata(E)
xhci_hcd(E)
usbcore(E)
scsi_mod(E)
usb_common(E)
fjes(E)

** PCI devices:
00:00.0 Host bridge [0600]: Intel Corporation Haswell-E DMI2 [8086:2f00] (rev 
02)
Subsystem: Super Micro Computer Inc Device [15d9:0857]
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- 

00:01.0 PCI bridge [0604]: Intel Corporation Haswell-E PCI Express Root Port 1 
[8086:2f02] (rev 02) (prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- TAbort- Reset- FastB2B-
PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
Capabilities: 
Kernel driver in use: pcieport

00:03.0 PCI bridge [0604]: Intel Corporation Haswell-E PCI Express Root Port 3 
[8086:2f08] (rev 02) (prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- 

Bug#737960: ganglia-monitor: gmond segfault

2014-02-07 Thread Markus Koeberl
Package: ganglia-monitor
Version: 3.6.0-2
Severity: normal

Dear Maintainer,
I am replacing an old ganglia setup running on debian lenny which consists of  
4 clusters. Therefore I changed the init.d script to run 4 instances of gmond. 
I also configured it to use rrdcached. The main difference to the old setup is 
that I activated sFlow support and that I changed from tmpfs to rrdcached. I 
changed the configuration of the clients to send to both the old and the new 
server.

During the last Day I got 3 segfaults:

Feb  6 09:53:52 web05 kernel: [232710.661591] gmond[24775]: segfault at 
cc000928 ip 7f6ad56f2f21 sp 7f6ad2a9bfb8 error 4 in 
libc-2.13.so[7f6ad5672000+182000]
Feb  6 10:08:28 web05 kernel: [233589.004172] gmond[24314]: segfault at 
6c0008d8 ip 7f6274ab7f21 sp 7f6271e60fb8 error 4 in 
libc-2.13.so[7f6274a37000+182000]
Feb  7 07:16:52 web05 kernel: [309692.562475] gmond[28971]: segfault at 
84000928 ip 7fcf912e2f21 sp 7fcf8e68bfb8 error 4 in 
libc-2.13.so[7fcf91262000+182000]

All 3 segfaults could be related to sFlow. I configured sFlow on some Windows 
hosts. It is possible that the segfaults occurred when a Windows host is booted 
bat I am not completely sure about that.

-- System Information:
Debian Release: 7.4
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'proposed-updates'), (500, 
'stable')
Architecture: amd64 (x86_64)

Kernel: Linux 3.2.0-4-amd64 (SMP w/8 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages ganglia-monitor depends on:
ii  adduser  3.113+nmu3
ii  libapr1  1.4.6-3+deb7u1
ii  libc62.13-38+deb7u1
ii  libconfuse0  2.7-4
ii  libexpat12.1.0-1+deb7u1
ii  libganglia1  3.6.0-2
ii  libpcre3 1:8.30-5
ii  zlib1g   1:1.2.7.dfsg-13

ganglia-monitor recommends no packages.

ganglia-monitor suggests no packages.

-- Configuration Files:
/etc/ganglia/gmond.conf changed [not included]
/etc/init.d/ganglia-monitor changed [not included]

-- no debconf information


-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#703724: libkio5: Problem with copy files 9 GB into OpenAFS

2013-03-22 Thread Markus Koeberl
Package: libkio5
Version: 4:4.8.4-4
Severity: normal
Tags: lfs

Dear Maintainer,
If I try to copy a file  900KB (df report for /afs) into a subdirectoy of
/afs with enough quota, KDE copy breaks with a disk full message before
starting the copy process, although enough space is available.
I tryed to copy a large file 9GB from the local disk somewhere into /afs/...
using Dolphin.
I also used kioclient copy 'source' 'destination' which does not display or do
anything but returns exit value 1



-- System Information:
Debian Release: 7.0
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'proposed-updates'), (500,
'testing'), (500, 'stable')
Architecture: amd64 (x86_64)

Kernel: Linux 3.2.0-4-amd64 (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages libkio5 depends on:
ii  libacl1 2.2.51-8
ii  libattr11:2.4.46-8
ii  libc6   2.13-38
ii  libkdecore5 4:4.8.4-4
ii  libkdeui5   4:4.8.4-4
ii  libnepomuk4 4:4.8.4-4
ii  libqt4-dbus 4:4.8.2+dfsg-11
ii  libqt4-network  4:4.8.2+dfsg-11
ii  libqt4-svg  4:4.8.2+dfsg-11
ii  libqt4-xml  4:4.8.2+dfsg-11
ii  libqtcore4  4:4.8.2+dfsg-11
ii  libqtgui4   4:4.8.2+dfsg-11
ii  libsolid4   4:4.8.4-4
ii  libstdc++6  4.6.1-4
ii  libstreamanalyzer0  0.7.7-3
ii  libx11-62:1.5.0-1
ii  libxrender1 1:0.9.7-1

Versions of packages libkio5 recommends:
ii  kdelibs5-plugins  4:4.8.4-4

libkio5 suggests no packages.


-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#659652: udev: ignore virtualbox virtual interfaces

2012-02-12 Thread Markus Koeberl
Package: udev
Version: 175-3
Severity: wishlist
Tags: patch

Dear Maintainer,
Virtualbox uses MAC addresses starting with 08:00:27
Please also ignore this MAC addresses



-- System Information:
Debian Release: wheezy/sid
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'proposed-updates'), (500, 
'testing'), (500, 'stable')
Architecture: amd64 (x86_64)

Kernel: Linux 3.1.0-1-amd64 (SMP w/2 CPU cores)
Locale: LANG=en_US.utf8, LC_CTYPE=en_US.utf8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages udev depends on:
ii  debconf [debconf-2.0]  1.5.41
ii  libc6  2.13-24
ii  libselinux12.1.0-4.1
ii  libudev0   175-3
ii  lsb-base   3.2-28.1
ii  util-linux 2.20.1-1.2

Versions of packages udev recommends:
ii  pciutils  1:3.1.8-2
ii  usbutils  1:005-2

udev suggests no packages.

-- debconf information excluded
--- 75-persistent-net-generator.rules.orig	2012-02-12 20:59:23.638573769 +0100
+++ 75-persistent-net-generator.rules	2012-02-12 21:01:22.235161851 +0100
@@ -64,13 +64,14 @@ ENV{MATCHADDR}==52:54:ab:*, GOTO=glob
 ENV{MATCHADDR}==e2:0c:0f:*, GOTO=globally_administered_whitelist
 
 # ignore interfaces with locally administered or null MAC addresses
-# and VMWare, Hyper-V, KVM and Xen virtual interfaces
+# and VMWare, Hyper-V, KVM, Virtualbox and Xen virtual interfaces
 ENV{MATCHADDR}==?[2367abef]:*,	ENV{MATCHADDR}=
 ENV{MATCHADDR}==00:00:00:00:00:00,	ENV{MATCHADDR}=
 ENV{MATCHADDR}==00:0c:29:*|00:50:56:*|00:05:69:*|00:1C:14:*,
 	ENV{MATCHADDR}=
 ENV{MATCHADDR}==00:15:5d:*,		ENV{MATCHADDR}=
 ENV{MATCHADDR}==52:54:00:*|54:52:00:*, ENV{MATCHADDR}=
+ENV{MATCHADDR}==08:00:27:*,		ENV{MATCHADDR}=
 ENV{MATCHADDR}==00:16:3e:*,		ENV{MATCHADDR}=
 
 LABEL=globally_administered_whitelist


Bug#545139: akonadi-server: not possible to create socket in AFS $HOME

2009-09-05 Thread Markus Koeberl

Package: akonadi-server
Version: 1.2.0-2
Severity: important

It is not possible to run akonadi-server if the home directory is located on a 
AFS server because creating sockets are not sported by AFS.

Please change ~/.local/share/akonadi/akonadiserver.socket somewhere to /tmp or 
make it configurable.


-- System Information:
Debian Release: squeeze/sid
  APT prefers testing
  APT policy: (990, 'testing'), (700, 'unstable'), (500, 'oldstable'), (500, 
'stable'), (100, 'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.26-2-686 (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages akonadi-server depends on:
ii  libakonadiprivate1 1.2.0-2   libraries for the Akonadi PIM 
stor
ii  libboost-program-optio 1.38.0-7  program options library for C++
ii  libc6  2.9-23GNU C Library: Shared libraries
ii  libgcc11:4.4.1-1 GCC support library
ii  libqt4-dbus4:4.5.2-1 Qt 4 D-Bus module
ii  libqt4-sql-mysql   4:4.5.2-1 Qt 4 MySQL database driver
ii  libqtcore4 4:4.5.2-1 Qt 4 core module
ii  libstdc++6 4.4.1-1   The GNU Standard C++ Library v3
ii  mysql-server   5.0.51a-24+lenny1 MySQL database server 
(metapackage
ii  mysql-server-5.0 [mysq 5.0.51a-24+lenny1 MySQL database server binaries

akonadi-server recommends no packages.

akonadi-server suggests no packages.

-- no debconf information




-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#512670: krb5-kdc: Master - Slave replication is not working

2009-01-23 Thread Markus Koeberl
On Friday 23 January 2009 00:03:44 Russ Allbery wrote:
 Markus Koeberl markus.koeb...@tugraz.at writes:
  On Thursday 22 January 2009 20:36:24 Russ Allbery wrote:
  Does the database already exist on the slave?
 
  No the database does not exist. The directory /var/lib/krb5kdc is
  empty. After running the commands there are the following entries:
  ~# ls -l /var/lib/krb5kdc/
  total 472
  -rw---  1 root root 261582 Jan 22 22:43 from_master
  -rw---  1 root root   8192 Jan 22 22:43 principal
  -rw---  1 root root 188416 Jan 22 22:43 principal~
  -rw---  1 root root   8192 Jan 22 22:43 principal~.kadm5
  -rw---  1 root root  0 Jan 22 22:43 principal~.kadm5.lock
  -rw---  1 root root  0 Jan 22  2009 principal~.ok
 
  I have compared the configuration several times with an kdc running on
  etch.  But I cannot find a difference. On a etch machine deleting the
  entries of /var/lib/krb5kdc and running the commands works fine.

 Yeah, that's the problem.  It's a known bug in MIT Kerberos 1.6.  If you
 create an empty database with kdb5_util, the propagation will then work
 correctly, but it won't cope with not having a database at all.

 It will presumably be fixed in a later release, but as there isn't an
 upstream fix yet (so far as I know) I'm afraid lenny is likely to release
 with that problem.  :/

Yeah, thanks replication works now.
Please can you put this information somewhere into the docs?


Markus
-- 
Markus Köberl
Graz University of Technology
Signal Processing and Speech Communication Laboratory
E-mail: markus.koeb...@tugraz.at



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#512670: krb5-kdc: Master - Slave replication is not working

2009-01-22 Thread Markus Koeberl
On Thursday 22 January 2009 20:36:24 Russ Allbery wrote:
 Markus Köberl markus.koeb...@tugraz.at writes:
  Master - Slave replication is not working from krb5-admin-server
  1.4.4-7etch6 or 1.3.6-2sarge6
 
  Master:
  /usr/sbin/kdb5_util dump /tmp/slave_datatrans
  /usr/sbin/kprop -f /tmp/slave_datatrans slavehostname
  returns:
  /usr/sbin/kprop: Software caused connection abort while reading response
  from server
 
  Slave:
  /usr/sbin/kpropd: /usr/sbin/kdb5_util returned a bad exit status (1)

 Does the database already exist on the slave?

No the database does not exist. The directory /var/lib/krb5kdc is empty. After 
running the commands there are the following entries:
~# ls -l /var/lib/krb5kdc/
total 472
-rw---  1 root root 261582 Jan 22 22:43 from_master
-rw---  1 root root   8192 Jan 22 22:43 principal
-rw---  1 root root 188416 Jan 22 22:43 principal~
-rw---  1 root root   8192 Jan 22 22:43 principal~.kadm5
-rw---  1 root root  0 Jan 22 22:43 principal~.kadm5.lock
-rw---  1 root root  0 Jan 22  2009 principal~.ok

I have compared the configuration several times with an kdc running on etch. 
But I cannot find a difference. On a etch machine deleting the entries 
of /var/lib/krb5kdc and running the commands works fine.


Markus
-- 
Markus Köberl
Graz University of Technology
Signal Processing and Speech Communication Laboratory
E-mail: markus.koeb...@tugraz.at



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org