Bug#838491: linux-image-4.7.0-0.bpo.1-amd64-unsigned: extreme load averages and over 2000 kworker threads
On Monday 10 October 2016 16:16:30 Ben Hutchings wrote: > > I think this might be fixed by "mm: memcontrol: use special workqueue > for creating per-memcg caches" included in version 4.7.6-1. Let us > know whether that does it. It did not happen again with linux-image-4.7.0-1-amd64-unsigned (4.7.6-1) within the last day. I guess you can close the BUG. Thanks! regards Markus Köberl -- Markus Koeberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail: markus.koeb...@tugraz.at
Bug#838491: linux-image-4.7.0-0.bpo.1-amd64-unsigned: extreme load averages and over 2000 kworker threads
Package: src:linux Followup-For: Bug #838491 Dear Maintainer, * What led up to the situation? upgrade kernel and systemd to the version proveded in jessie-backports * What exactly did you do (or not do) that was effective (or ineffective)? during normal usage (slurm cluster node): load average: 1290.54, 513.19, 466.29 the load 5 peaks reache 2000 ps aux | grep kworker | wc -l 4188 I followed the Debugging instruction of https://raw.githubusercontent.com/torvalds/linux/master/Documentation/workqueue.txt echo workqueue:workqueue_queue_work > /sys/kernel/debug/tracing/set_event cat /sys/kernel/debug/tracing/trace_pipe > out.txt after a vew seconds: cat out.txt | awk '{print $8}' | sort | uniq -c | sort -n 1 function=do_cache_clean 1 function=pcpu_balance_workfn 1 function=xfs_eofblocks_worker 2 function=neigh_periodic_work 6 function=xfs_reclaim_worker 6 function=xlog_cil_push_work 8 function=disk_events_workfn 8 function=igb_watchdog_task 12 function=push_to_pool 13 function=blk_timeout_work 15 function=vmstat_shepherd 22 function=xfs_end_io 27 function=key_garbage_collector 27 function=lru_add_drain_per_cpu 34 function=delayed_fput 38 function=scsi_requeue_run_queue 39 function=blk_delay_work 40 function=cgroup_pidlist_destroy_work_fn 56 function=flush_to_ldisc 64 function=cache_reap 77 function=wb_workfn 101 function=os_execute_work_item 131 function=css_killed_work_fn 142 function=xfs_buf_ioend_work 156 function=vmstat_update 162 function=call_usermodehelper_exec_work 162 function=cgroup_release_agent 409 function=vmpressure_work_fn 497 function=css_release_work_fn 500 function=css_free_work_fn 47931 function=memcg_kmem_cache_create_func I found https://bugzilla.kernel.org/show_bug.cgi?id=172981 which seams to be the same problem. -- Package-specific info: ** Version: Linux version 4.7.0-0.bpo.1-amd64 (debian-ker...@lists.debian.org) (gcc version 4.9.2 (Debian 4.9.2-10) ) #1 SMP Debian 4.7.5-1~bpo8+2 (2016-10-01) ** Command line: BOOT_IMAGE=/vmlinuz-4.7.0-0.bpo.1-amd64 root=UUID=d3b74f44-0f5e-4ba1-9606-ad42b76e5918 ro cgroup_enable=memory swapaccount=1 elevator=deadline quiet nomodeset nouveau.modeset=0 ** Tainted: POE (12289) * Proprietary module has been loaded. * Out-of-tree module has been loaded. * Unsigned module has been loaded. ** Model information sys_vendor: Supermicro product_name: X10SRA product_version: 0123456789 chassis_vendor: Supermicro chassis_version: 0123456789 bios_vendor: American Megatrends Inc. bios_version: 2.0 board_vendor: Supermicro board_name: X10SRA board_version: 1.01 ** Loaded modules: 8021q(E) garp(E) mrp(E) stp(E) llc(E) nvidia_drm(POE) nvidia_modeset(POE) nvidia(POE) drm_kms_helper(E) drm(E) openafs(POE) nfsd(E) auth_rpcgss(E) nfs_acl(E) nfs(E) lockd(E) grace(E) fscache(E) sunrpc(E) intel_rapl(E) sb_edac(E) edac_core(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) xfs(E) libcrc32c(E) snd_hda_codec_hdmi(E) iTCO_wdt(E) iTCO_vendor_support(E) mxm_wmi(E) evdev(E) kvm_intel(E) kvm(E) irqbypass(E) crct10dif_pclmul(E) crc32_pclmul(E) ghash_clmulni_intel(E) hmac(E) drbg(E) ansi_cprng(E) aesni_intel(E) aes_x86_64(E) lrw(E) gf128mul(E) glue_helper(E) ablk_helper(E) cryptd(E) pcspkr(E) serio_raw(E) snd_hda_codec_realtek(E) snd_hda_codec_generic(E) snd_hda_intel(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E) snd_pcm(E) snd_timer(E) snd(E) soundcore(E) lpc_ich(E) mfd_core(E) sg(E) i2c_i801(E) shpchp(E) ipmi_msghandler(E) wmi(E) acpi_power_meter(E) tpm_tis(E) tpm(E) button(E) usbhid(E) hid(E) fuse(E) autofs4(E) ext4(E) crc16(E) jbd2(E) crc32c_generic(E) mbcache(E) dm_mod(E) sr_mod(E) cdrom(E) sd_mod(E) crc32c_intel(E) psmouse(E) ahci(E) igb(E) libahci(E) ehci_pci(E) i2c_algo_bit(E) ehci_hcd(E) dca(E) ptp(E) pps_core(E) xhci_pci(E) libata(E) xhci_hcd(E) usbcore(E) scsi_mod(E) usb_common(E) fjes(E) ** PCI devices: 00:00.0 Host bridge [0600]: Intel Corporation Haswell-E DMI2 [8086:2f00] (rev 02) Subsystem: Super Micro Computer Inc Device [15d9:0857] Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- 00:01.0 PCI bridge [0604]: Intel Corporation Haswell-E PCI Express Root Port 1 [8086:2f02] (rev 02) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- TAbort- Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: Kernel driver in use: pcieport 00:03.0 PCI bridge [0604]: Intel Corporation Haswell-E PCI Express Root Port 3 [8086:2f08] (rev 02) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle-
Bug#737960: ganglia-monitor: gmond segfault
Package: ganglia-monitor Version: 3.6.0-2 Severity: normal Dear Maintainer, I am replacing an old ganglia setup running on debian lenny which consists of 4 clusters. Therefore I changed the init.d script to run 4 instances of gmond. I also configured it to use rrdcached. The main difference to the old setup is that I activated sFlow support and that I changed from tmpfs to rrdcached. I changed the configuration of the clients to send to both the old and the new server. During the last Day I got 3 segfaults: Feb 6 09:53:52 web05 kernel: [232710.661591] gmond[24775]: segfault at cc000928 ip 7f6ad56f2f21 sp 7f6ad2a9bfb8 error 4 in libc-2.13.so[7f6ad5672000+182000] Feb 6 10:08:28 web05 kernel: [233589.004172] gmond[24314]: segfault at 6c0008d8 ip 7f6274ab7f21 sp 7f6271e60fb8 error 4 in libc-2.13.so[7f6274a37000+182000] Feb 7 07:16:52 web05 kernel: [309692.562475] gmond[28971]: segfault at 84000928 ip 7fcf912e2f21 sp 7fcf8e68bfb8 error 4 in libc-2.13.so[7fcf91262000+182000] All 3 segfaults could be related to sFlow. I configured sFlow on some Windows hosts. It is possible that the segfaults occurred when a Windows host is booted bat I am not completely sure about that. -- System Information: Debian Release: 7.4 APT prefers stable-updates APT policy: (500, 'stable-updates'), (500, 'proposed-updates'), (500, 'stable') Architecture: amd64 (x86_64) Kernel: Linux 3.2.0-4-amd64 (SMP w/8 CPU cores) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages ganglia-monitor depends on: ii adduser 3.113+nmu3 ii libapr1 1.4.6-3+deb7u1 ii libc62.13-38+deb7u1 ii libconfuse0 2.7-4 ii libexpat12.1.0-1+deb7u1 ii libganglia1 3.6.0-2 ii libpcre3 1:8.30-5 ii zlib1g 1:1.2.7.dfsg-13 ganglia-monitor recommends no packages. ganglia-monitor suggests no packages. -- Configuration Files: /etc/ganglia/gmond.conf changed [not included] /etc/init.d/ganglia-monitor changed [not included] -- no debconf information -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#703724: libkio5: Problem with copy files 9 GB into OpenAFS
Package: libkio5 Version: 4:4.8.4-4 Severity: normal Tags: lfs Dear Maintainer, If I try to copy a file 900KB (df report for /afs) into a subdirectoy of /afs with enough quota, KDE copy breaks with a disk full message before starting the copy process, although enough space is available. I tryed to copy a large file 9GB from the local disk somewhere into /afs/... using Dolphin. I also used kioclient copy 'source' 'destination' which does not display or do anything but returns exit value 1 -- System Information: Debian Release: 7.0 APT prefers stable-updates APT policy: (500, 'stable-updates'), (500, 'proposed-updates'), (500, 'testing'), (500, 'stable') Architecture: amd64 (x86_64) Kernel: Linux 3.2.0-4-amd64 (SMP w/2 CPU cores) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages libkio5 depends on: ii libacl1 2.2.51-8 ii libattr11:2.4.46-8 ii libc6 2.13-38 ii libkdecore5 4:4.8.4-4 ii libkdeui5 4:4.8.4-4 ii libnepomuk4 4:4.8.4-4 ii libqt4-dbus 4:4.8.2+dfsg-11 ii libqt4-network 4:4.8.2+dfsg-11 ii libqt4-svg 4:4.8.2+dfsg-11 ii libqt4-xml 4:4.8.2+dfsg-11 ii libqtcore4 4:4.8.2+dfsg-11 ii libqtgui4 4:4.8.2+dfsg-11 ii libsolid4 4:4.8.4-4 ii libstdc++6 4.6.1-4 ii libstreamanalyzer0 0.7.7-3 ii libx11-62:1.5.0-1 ii libxrender1 1:0.9.7-1 Versions of packages libkio5 recommends: ii kdelibs5-plugins 4:4.8.4-4 libkio5 suggests no packages. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#659652: udev: ignore virtualbox virtual interfaces
Package: udev Version: 175-3 Severity: wishlist Tags: patch Dear Maintainer, Virtualbox uses MAC addresses starting with 08:00:27 Please also ignore this MAC addresses -- System Information: Debian Release: wheezy/sid APT prefers stable-updates APT policy: (500, 'stable-updates'), (500, 'proposed-updates'), (500, 'testing'), (500, 'stable') Architecture: amd64 (x86_64) Kernel: Linux 3.1.0-1-amd64 (SMP w/2 CPU cores) Locale: LANG=en_US.utf8, LC_CTYPE=en_US.utf8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages udev depends on: ii debconf [debconf-2.0] 1.5.41 ii libc6 2.13-24 ii libselinux12.1.0-4.1 ii libudev0 175-3 ii lsb-base 3.2-28.1 ii util-linux 2.20.1-1.2 Versions of packages udev recommends: ii pciutils 1:3.1.8-2 ii usbutils 1:005-2 udev suggests no packages. -- debconf information excluded --- 75-persistent-net-generator.rules.orig 2012-02-12 20:59:23.638573769 +0100 +++ 75-persistent-net-generator.rules 2012-02-12 21:01:22.235161851 +0100 @@ -64,13 +64,14 @@ ENV{MATCHADDR}==52:54:ab:*, GOTO=glob ENV{MATCHADDR}==e2:0c:0f:*, GOTO=globally_administered_whitelist # ignore interfaces with locally administered or null MAC addresses -# and VMWare, Hyper-V, KVM and Xen virtual interfaces +# and VMWare, Hyper-V, KVM, Virtualbox and Xen virtual interfaces ENV{MATCHADDR}==?[2367abef]:*, ENV{MATCHADDR}= ENV{MATCHADDR}==00:00:00:00:00:00, ENV{MATCHADDR}= ENV{MATCHADDR}==00:0c:29:*|00:50:56:*|00:05:69:*|00:1C:14:*, ENV{MATCHADDR}= ENV{MATCHADDR}==00:15:5d:*, ENV{MATCHADDR}= ENV{MATCHADDR}==52:54:00:*|54:52:00:*, ENV{MATCHADDR}= +ENV{MATCHADDR}==08:00:27:*, ENV{MATCHADDR}= ENV{MATCHADDR}==00:16:3e:*, ENV{MATCHADDR}= LABEL=globally_administered_whitelist
Bug#545139: akonadi-server: not possible to create socket in AFS $HOME
Package: akonadi-server Version: 1.2.0-2 Severity: important It is not possible to run akonadi-server if the home directory is located on a AFS server because creating sockets are not sported by AFS. Please change ~/.local/share/akonadi/akonadiserver.socket somewhere to /tmp or make it configurable. -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (990, 'testing'), (700, 'unstable'), (500, 'oldstable'), (500, 'stable'), (100, 'experimental') Architecture: i386 (i686) Kernel: Linux 2.6.26-2-686 (SMP w/2 CPU cores) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/bash Versions of packages akonadi-server depends on: ii libakonadiprivate1 1.2.0-2 libraries for the Akonadi PIM stor ii libboost-program-optio 1.38.0-7 program options library for C++ ii libc6 2.9-23GNU C Library: Shared libraries ii libgcc11:4.4.1-1 GCC support library ii libqt4-dbus4:4.5.2-1 Qt 4 D-Bus module ii libqt4-sql-mysql 4:4.5.2-1 Qt 4 MySQL database driver ii libqtcore4 4:4.5.2-1 Qt 4 core module ii libstdc++6 4.4.1-1 The GNU Standard C++ Library v3 ii mysql-server 5.0.51a-24+lenny1 MySQL database server (metapackage ii mysql-server-5.0 [mysq 5.0.51a-24+lenny1 MySQL database server binaries akonadi-server recommends no packages. akonadi-server suggests no packages. -- no debconf information -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#512670: krb5-kdc: Master - Slave replication is not working
On Friday 23 January 2009 00:03:44 Russ Allbery wrote: Markus Koeberl markus.koeb...@tugraz.at writes: On Thursday 22 January 2009 20:36:24 Russ Allbery wrote: Does the database already exist on the slave? No the database does not exist. The directory /var/lib/krb5kdc is empty. After running the commands there are the following entries: ~# ls -l /var/lib/krb5kdc/ total 472 -rw--- 1 root root 261582 Jan 22 22:43 from_master -rw--- 1 root root 8192 Jan 22 22:43 principal -rw--- 1 root root 188416 Jan 22 22:43 principal~ -rw--- 1 root root 8192 Jan 22 22:43 principal~.kadm5 -rw--- 1 root root 0 Jan 22 22:43 principal~.kadm5.lock -rw--- 1 root root 0 Jan 22 2009 principal~.ok I have compared the configuration several times with an kdc running on etch. But I cannot find a difference. On a etch machine deleting the entries of /var/lib/krb5kdc and running the commands works fine. Yeah, that's the problem. It's a known bug in MIT Kerberos 1.6. If you create an empty database with kdb5_util, the propagation will then work correctly, but it won't cope with not having a database at all. It will presumably be fixed in a later release, but as there isn't an upstream fix yet (so far as I know) I'm afraid lenny is likely to release with that problem. :/ Yeah, thanks replication works now. Please can you put this information somewhere into the docs? Markus -- Markus Köberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail: markus.koeb...@tugraz.at -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#512670: krb5-kdc: Master - Slave replication is not working
On Thursday 22 January 2009 20:36:24 Russ Allbery wrote: Markus Köberl markus.koeb...@tugraz.at writes: Master - Slave replication is not working from krb5-admin-server 1.4.4-7etch6 or 1.3.6-2sarge6 Master: /usr/sbin/kdb5_util dump /tmp/slave_datatrans /usr/sbin/kprop -f /tmp/slave_datatrans slavehostname returns: /usr/sbin/kprop: Software caused connection abort while reading response from server Slave: /usr/sbin/kpropd: /usr/sbin/kdb5_util returned a bad exit status (1) Does the database already exist on the slave? No the database does not exist. The directory /var/lib/krb5kdc is empty. After running the commands there are the following entries: ~# ls -l /var/lib/krb5kdc/ total 472 -rw--- 1 root root 261582 Jan 22 22:43 from_master -rw--- 1 root root 8192 Jan 22 22:43 principal -rw--- 1 root root 188416 Jan 22 22:43 principal~ -rw--- 1 root root 8192 Jan 22 22:43 principal~.kadm5 -rw--- 1 root root 0 Jan 22 22:43 principal~.kadm5.lock -rw--- 1 root root 0 Jan 22 2009 principal~.ok I have compared the configuration several times with an kdc running on etch. But I cannot find a difference. On a etch machine deleting the entries of /var/lib/krb5kdc and running the commands works fine. Markus -- Markus Köberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail: markus.koeb...@tugraz.at -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org