Re: transparent redirection with pf and squid

2015-10-04 Thread Konstantin Belousov
On Sun, Oct 04, 2015 at 07:31:43PM +0100, krad wrote: > Is anyone else having problems with squid core dumping on Freebsd 10-stable > when using the transparent caching feature. It started happening recently > after I re enabled ipv6 on my network. It may just be coincidence though. > It has even

Re: pthread_cancel / sleep change in behaviour

2015-09-04 Thread Konstantin Belousov
On Fri, Sep 04, 2015 at 04:35:24PM +0800, Erich Dollansky wrote: > Hi,. > > On Fri, 4 Sep 2015 11:18:57 +0300 > Konstantin Belousov <kostik...@gmail.com> wrote: > > > On Fri, Sep 04, 2015 at 08:37:40AM +0800, Erich Dollansky wrote: > > &g

Re: pthread_cancel / sleep change in behaviour

2015-09-04 Thread Konstantin Belousov
On Fri, Sep 04, 2015 at 08:37:40AM +0800, Erich Dollansky wrote: > Hi, > > On Thu, 3 Sep 2015 13:43:51 +0300 > Konstantin Belousov <kostik...@gmail.com> wrote: > > On Thu, Sep 03, 2015 at 06:33:53PM +0800, Erich Dollansky wrote: > > > On Thu, 3 Sep 2015 11:19:4

Re: pthread_cancel / sleep change in behaviour

2015-09-03 Thread Konstantin Belousov
On Thu, Sep 03, 2015 at 08:00:47AM +0800, Erich Dollansky wrote: > Hi, > > I noticed a change in behaviour when calling pthread_cancel for a > thread which is sleeping using i.e. sleep (). > > How I understand pthread_cancel, it should cancel the thread and call > the clean up handler soon after

Re: pthread_cancel / sleep change in behaviour

2015-09-03 Thread Konstantin Belousov
On Thu, Sep 03, 2015 at 06:33:53PM +0800, Erich Dollansky wrote: > Hi, > > On Thu, 3 Sep 2015 11:19:47 +0300 > Konstantin Belousov <kostik...@gmail.com> wrote: > > > On Thu, Sep 03, 2015 at 08:00:47AM +0800, Erich Dollansky wrote: > > > > > > I

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-29 Thread Konstantin Belousov
On Sat, Aug 29, 2015 at 01:43:36PM +0200, Michiel Boland wrote: Do I understand correctly that the problem is that if you install a signal handler with signal() (which is what xterm does) and pull in libthr.so somehow, then there is no thr_sighandler inserted? Yes. The problem does not

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-29 Thread Konstantin Belousov
On Sat, Aug 29, 2015 at 03:01:38PM +0200, Jilles Tjoelker wrote: Looks good to me, except that I think a vforked child (in system() and posix_spawn*()) should use the system calls and not libthr's wrappers. This reduces the probability of weird things happening between vfork and exec, and also

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-29 Thread Konstantin Belousov
On Sat, Aug 29, 2015 at 04:02:43PM +0200, Michiel Boland wrote: I verified the patch. The getumask part of lib/libc/gen/setmode.c part was rejected on stable/10 (probably due to other changes in ^/head.) Thank you. The setmode bits are from the Jilles' r280713. I will merge this revision

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-28 Thread Konstantin Belousov
On Fri, Aug 28, 2015 at 08:08:27AM +0200, Michiel Boland wrote: On 08/27/2015 22:16, Konstantin Belousov wrote: [...] I just verified that the signal handler is correctly wrapped for me, on the latest stable/10. Both with the pre-linked libthr.so and with the library loaded dynamically

Re: ia64 stable/10 r286316: hang at Entering /boot/kernel/kernel

2015-08-28 Thread Konstantin Belousov
On Fri, Aug 28, 2015 at 11:30:18AM +0100, Anton Shterenlikht wrote: From kostik...@gmail.com Thu Aug 27 18:22:37 2015 On Thu, Aug 27, 2015 at 01:12:16PM +0100, Anton Shterenlikht wrote: ia64 stable/10 r286315 boots, but r286316 hangs at Entering /boot/kernel/kernel. Please advise

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-28 Thread Konstantin Belousov
On Fri, Aug 28, 2015 at 05:52:42PM +0200, Michiel Boland wrote: set -e for a in `seq 1000` do echo -n $a xterm -e ssh nonexisting done echo (The idea here is that 'ssh nonexisting' should do some work and then exit, xterm -e false, etc. don't appear to trigger the bug.) Prior to

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-27 Thread Konstantin Belousov
On Thu, Aug 27, 2015 at 06:06:26PM +0100, Pete French wrote: Try to obtain the backtrace from the hung xterm. Ideally, you would rebuild xterm and the system libraries (rtld+libc+libthr) with debug symbols and get the backtraces after that. I can try this tomorrow - what do I need to set

Re: ia64 stable/10 r286316: hang at Entering /boot/kernel/kernel

2015-08-27 Thread Konstantin Belousov
On Thu, Aug 27, 2015 at 01:12:16PM +0100, Anton Shterenlikht wrote: ia64 stable/10 r286315 boots, but r286316 hangs at Entering /boot/kernel/kernel. Please advise To state an obvious thing. The commit which you pointed to, changes the code which is not executed at that early kernel boot

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-27 Thread Konstantin Belousov
On Thu, Aug 27, 2015 at 02:04:05PM +0200, Mark Martinec wrote: Pete French wrote: I updated to stable yesterday, plus updated all my porst to the latest pecompiled packages, but I am now seeing odd problems with bash on exit. Sometimes it quits, but leaves a zombie process... e.g

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-27 Thread Konstantin Belousov
On Thu, Aug 27, 2015 at 08:53:09PM +0200, Michiel Boland wrote: The xterm program has a SIGCHLD signal handler that calls wait(). If the handler is invoked while xterm is exiting, a deadlock occurs in rtld. Cheers Michiel #0 _umtx_op_err () at

Re: ia64 regression from r285809 to 286952: kernel does not boot

2015-08-23 Thread Konstantin Belousov
On Sun, Aug 23, 2015 at 03:51:58PM +0100, Anton Shterenlikht wrote: I'm not sure if ia64 list is still operational, so post here. ia64 r285809 10-STABLE works fine. I updated to r286952, buildworld, buildkernel, installkernel, reboot, boot -s, and I get stuck with: FreeBSD/ia64 EFI boot,

Re: 10.2: ntp update breaks DCF77 clock

2015-08-17 Thread Konstantin Belousov
On Sun, Aug 16, 2015 at 10:49:57PM -0700, Cy Schubert wrote: qemu-sbruno) doesn't support all our supported platforms, especially the multitude of ARM platforms, so holes in our auto-generated config.h support will exist. I believe that the userspace arm ABI is not that variable. There

Re: Many core dumps in pthread_getspecific.

2015-07-22 Thread Konstantin Belousov
On Wed, Jul 22, 2015 at 11:46:35AM +0200, Andre Meiser wrote: On Sun, Jul 19, 2015 at 22:57 +0200, Konstantin Belousov wrote: It seems that besides sigreturn(), ucontext symbols must be pre-resolved as well. Try this update (it includes the previous change). thanks for looking

Re: Many core dumps in pthread_getspecific.

2015-07-19 Thread Konstantin Belousov
On Wed, Jul 15, 2015 at 06:32:13PM +0200, Andre Meiser wrote: Hi, no crash from vim or Xorg but from xterm and again at getcontext(uc) after alloca: % readelf -d xterm | grep NEEDED 0x0001 (NEEDED) Shared library: [libXinerama.so.1] 0x0001 (NEEDED)

Re: amd64 kernel dynamic linking allows extern references to statics

2015-07-16 Thread Konstantin Belousov
On Thu, Jul 16, 2015 at 09:18:15AM +1000, Jan Mikkelsen wrote: On 15 Jul 2015, at 11:27 pm, Konstantin Belousov kostik...@gmail.com wrote: On Wed, Jul 15, 2015 at 06:17:20PM +1000, Jan Mikkelsen wrote: Hi, (All on 10.2-BETA1.) I noticed that the latest patch in the bug

Re: amd64 kernel dynamic linking allows extern references to statics

2015-07-15 Thread Konstantin Belousov
On Wed, Jul 15, 2015 at 06:17:20PM +1000, Jan Mikkelsen wrote: Hi, (All on 10.2-BETA1.) I noticed that the latest patch in the bug https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=187594 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=187594 works on amd64 but fails to load zfs.ko

Re: Many core dumps in pthread_getspecific.

2015-07-03 Thread Konstantin Belousov
On Fri, Jul 03, 2015 at 05:21:50PM +0200, Andre Meiser wrote: Hi, back again. Sorry, I accidently deleted the core file and I'd to wait two weeks until vim crashed again. Xorg didn't crashed so far with the debug libs. On Tue, Jun 16, 2015 at 09:36 +0200, Konstantin Belousov wrote: Ok

Re: panic: wm_page_unwire

2015-06-20 Thread Konstantin Belousov
On Sat, Jun 20, 2015 at 10:23:39AM +0300, Dmitry Sivachenko wrote: On 19 июня 2015 г., at 22:57, Dmitry Sivachenko trtrmi...@gmail.com wrote: Hello, got this panic today on my 10.1-STABLE #0 r279956 box: Screen Shot 2015-06-19 at 22.52.57.png Well, I tracked this down a

Re: Many core dumps in pthread_getspecific.

2015-06-16 Thread Konstantin Belousov
On Tue, Jun 16, 2015 at 09:09:25AM +0200, Andre Meiser wrote: On Sun, Jun 14, 2015 at 21:05 +0200, Konstantin Belousov wrote: Show me the 'ldd vim' output and output from the readelf -d vim | grep NEEDED. % ldd /usr/local/bin/vim /usr/local/bin/vim: libm.so.5 = /lib/libm.so.5

Re: Re: Many core dumps in pthread_getspecific.

2015-06-14 Thread Konstantin Belousov
On Fri, Jun 12, 2015 at 12:03:16PM +0200, Andre Meiser wrote: Hi, On Wed, Jun 03, 2015 at 16:58 +0200, Konstantin Belousov wrote: You should recompile both libc and libthr with debugging symbols, like cd /usr/src (cd lib/libc make all install DEBUG_FLAGS=-g) (cd lib/libthr make all

Re: Many core dumps in pthread_getspecific.

2015-06-06 Thread Konstantin Belousov
On Sat, Jun 06, 2015 at 09:50:22AM +0200, Andre Meiser wrote: On Wed, Jun 03, 2015 at 16:58 +0200, Konstantin Belousov wrote: You should recompile both libc and libthr with debugging symbols, like cd /usr/src (cd lib/libc make all install DEBUG_FLAGS=-g) (cd lib/libthr make all install

Re: Many core dumps in pthread_getspecific.

2015-06-03 Thread Konstantin Belousov
On Wed, Jun 03, 2015 at 01:34:04PM +0200, Andre Meiser wrote: Hi, I'm new to FreeBSD and I'm running into a lot of core dumps (mainly from Xorg, but also from vim, firefox, ...). That's why I asked on freebsd-questions:

Re: protecting some processes from out-of-swap killer

2015-04-25 Thread Konstantin Belousov
On Sat, Apr 25, 2015 at 01:31:14PM +0300, Dmitry Morozovsky wrote: Hi there colleagues, I have stable/10 on a rather big machine (2*8*2 e5 Xeon, 64G RAM, SAS+SSD ZFS raid10+ZIL+L2ARC) acting as a PostgreSQL server. 64G RAM is not big by modern standards, normally populated desktop can take

Re: Significant memory leak in 9.3p10?

2015-03-27 Thread Konstantin Belousov
On Thu, Mar 26, 2015 at 03:46:05PM -0400, J David wrote: On Mon, Mar 16, 2015 at 7:52 PM, J David j.david.li...@gmail.com wrote: On Mon, Mar 16, 2015 at 7:24 PM, Konstantin Belousov kostik...@gmail.com wrote: There are a lot of possibilities to create persistent anonymous shared memory

Re: RELENG_10 performance regression (was Re: 35-40% performance drop releng9 vs releng10 openvpn

2015-03-21 Thread Konstantin Belousov
On Sat, Mar 21, 2015 at 02:13:06PM -0400, Mike Tancsa wrote: On 3/21/2015 11:52 AM, John Baldwin wrote: http://tancsa.com/time/ Do you know why you are using the HPET instead of TSC for timestamping? Hi, I am not consciously making any time keep decisions. kern.eventtimer.choice:

Re: RELENG_10 performance regression (was Re: 35-40% performance drop releng9 vs releng10 openvpn

2015-03-20 Thread Konstantin Belousov
On Fri, Mar 20, 2015 at 04:53:42PM -0700, Xin Li wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA512 On 03/20/15 14:02, Mike Tancsa wrote: OK, I think I found where the RELENG_10 performance loss happened. It seems

Re: Significant memory leak in 9.3p10?

2015-03-16 Thread Konstantin Belousov
On Mon, Mar 16, 2015 at 06:59:33PM -0400, J David wrote: Recently we have seen a large-scale memory leak on amd64 machines running FreeBSD 9.3-RELEASE-p10. This was first observed on 9.3p2 but has since shown up all the way through p10. Here's what the header of top shows: last pid:

Re: Suspected libkvm infinite loop

2015-03-12 Thread Konstantin Belousov
On Wed, Mar 11, 2015 at 09:34:07PM -0700, Mark Johnston wrote: On Thu, Mar 12, 2015 at 02:05:32PM +1000, Nick Frampton wrote: On 12/03/15 00:38, John Baldwin wrote: It sounds like this issue might be the one fixed in r272566: if the KERN_PROC_ALL sysctl is read with an insufficiently

Re: 9.2-RC4 amd64 panic: vm_page_unwire

2013-10-05 Thread Konstantin Belousov
On Sat, Oct 05, 2013 at 05:16:08PM +1000, John Marshall wrote: Since you have a will to play with this, please try to bisect the stable to see which commit introduced the regression (you said that 9.1 does not panic). pgpCeXKhcpYky.pgp Description: PGP signature

Re: Device timeout from mfi(9) while booting 9.2-RELEASE

2013-10-04 Thread Konstantin Belousov
On Fri, Oct 04, 2013 at 03:53:10PM +1000, Jan Mikkelsen wrote: We have Drake Skinny (9240-4i and 9240-8i) cards running, most recently with 9.2-RC4. We'll be testing 9.2-RELEASE in the next week or two. We have seen problems with older versions of the firmware, and interactions with

Re: Device timeout from mfi(9) while booting 9.2-RELEASE

2013-10-03 Thread Konstantin Belousov
On Thu, Oct 03, 2013 at 11:50:15AM -0400, Ryan Stone wrote: We're having trouble booting 9.2-RELEASE on a system with an mfi (Drake Skinny) controller. When it tries to mount root we get: mfi0: COMMAND 0xff8000cb7220 TIMEOUT AFTER 59 SECONDS mfi0: COMMAND 0xff8000cb72a8 TIMEOUT

Re: Device timeout from mfi(9) while booting 9.2-RELEASE

2013-10-03 Thread Konstantin Belousov
On Fri, Oct 04, 2013 at 11:23:45AM +1000, Jan Mikkelsen wrote: Hi, We have Drake Skinny (9240-4i and 9240-8i) cards running, most recently with 9.2-RC4. We'll be testing 9.2-RELEASE in the next week or two. We have seen problems with older versions of the firmware, and interactions with

Re: 9.2-RC4 amd64 panic: vm_page_unwire

2013-09-27 Thread Konstantin Belousov
On Fri, Sep 27, 2013 at 10:07:28AM +1000, John Marshall wrote: I'm running 9.2-RC4 on a handful of desktop and server machines (both i386 and amd64). I have seen three panics (all vm_page_unwire) on one of those systems only (amd64 server) during the past week. The first two panics were

Re: Possible kqueue related issue on STABLE/RC.

2013-09-25 Thread Konstantin Belousov
On Wed, Sep 25, 2013 at 09:58:05AM +0200, Patrick Lamaiziere wrote: Le Wed, 25 Sep 2013 00:21:27 +0300, Konstantin Belousov kostik...@gmail.com a ?crit : Hello, On Tue, Sep 24, 2013 at 10:45:17AM -0700, John-Mark Gurney wrote: I'd like to understand why you think protecting

Re: Possible kqueue related issue on STABLE/RC.

2013-09-25 Thread Konstantin Belousov
On Wed, Sep 25, 2013 at 09:19:54AM -0700, John-Mark Gurney wrote: Konstantin Belousov wrote this message on Wed, Sep 25, 2013 at 00:21 +0300: On Tue, Sep 24, 2013 at 10:45:17AM -0700, John-Mark Gurney wrote: I'd like to understand why you think protecting these functions w/ the _DETACHED

Re: Possible kqueue related issue on STABLE/RC.

2013-09-24 Thread Konstantin Belousov
On Tue, Sep 24, 2013 at 09:44:27AM +0200, Patrick Lamaiziere wrote: Le Mon, 23 Sep 2013 23:31:41 +0300, Konstantin Belousov kostik...@gmail.com a ?crit : Hello, ... Ok This has been mfced to 9.2-STABLE. But I still see this panic with 9-2/STABLE of today (Revision : 255811

Re: Possible kqueue related issue on STABLE/RC.

2013-09-24 Thread Konstantin Belousov
On Tue, Sep 24, 2013 at 11:47:38AM +0200, Patrick Lamaiziere wrote: Le Tue, 24 Sep 2013 11:29:09 +0300, Konstantin Belousov kostik...@gmail.com a ?crit : Hello, ... Ok This has been mfced to 9.2-STABLE. But I still see this panic with 9-2/STABLE of today (Revision : 255811

Re: Possible kqueue related issue on STABLE/RC.

2013-09-24 Thread Konstantin Belousov
On Tue, Sep 24, 2013 at 10:45:17AM -0700, John-Mark Gurney wrote: I'd like to understand why you think protecting these functions w/ the _DETACHED check is correct... In kern_event.c, all calls to f_detach are followed by knote_drop which will ensure that the knote is removed and free, so no

Re: Possible kqueue related issue on STABLE/RC.

2013-09-23 Thread Konstantin Belousov
On Mon, Sep 23, 2013 at 03:37:08PM +0200, Patrick Lamaiziere wrote: Le Fri, 20 Sep 2013 15:17:05 +0200, Patrick Lamaiziere patf...@davenulle.org a ?crit : Le Thu, 12 Sep 2013 10:36:43 +0300, Konstantin Belousov kostik...@gmail.com a ?crit : Hello, Might be, your issue

Re: Possible kqueue related issue on STABLE/RC.

2013-09-12 Thread Konstantin Belousov
On Wed, Sep 11, 2013 at 11:18:34PM +0200, Jimmy Olgeni wrote: Hi, On Wed, 11 Sep 2013, Konstantin Belousov wrote: Also, do you have all options listed at http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kerneldebug-deadlocks.html enabled ? This time I tried

Re: Possible kqueue related issue on STABLE/RC.

2013-09-12 Thread Konstantin Belousov
On Wed, Sep 11, 2013 at 10:32:31PM +0200, Jimmy Olgeni wrote: Hi, On Wed, 11 Sep 2013, Konstantin Belousov wrote: Could you list the lines around the the vfs_subr.c:4591 in your kernel ? 4591 is the VI_LOCK(vp) in filt_vfsvnode: static int filt_vfsvnode(struct knote *kn, long hint

Re: Possible kqueue related issue on STABLE/RC.

2013-09-12 Thread Konstantin Belousov
On Thu, Sep 12, 2013 at 08:28:48PM +0200, Jimmy Olgeni wrote: On Thu, 12 Sep 2013, Konstantin Belousov wrote: Might be, your issue is that some filesystems do not care about proper locking mode for the fifos. UFS carefully disables shared locking for VFIFO, but it seems ZFS is not. I

Re: Possible kqueue related issue on STABLE/RC.

2013-09-12 Thread Konstantin Belousov
On Fri, Sep 13, 2013 at 12:40:28AM +0300, Andriy Gapon wrote: on 12/09/2013 21:49 Konstantin Belousov said the following: Ok, so it is ZFS indeed. I think I will commit the band-aid to head shortly. I am not sure if my message 5231a016.7060...@freebsd.org was intercepted by NSA

Re: Possible kqueue related issue on STABLE/RC.

2013-09-11 Thread Konstantin Belousov
On Wed, Sep 11, 2013 at 05:07:10PM +0200, Jimmy Olgeni wrote: - However, this time I managed to get a minidump from the old -STABLE. I saved it here: http://olgeni.olgeni.com/~olgeni/core.txt.0 Could you list the lines around the the vfs_subr.c:4591 in your kernel ? Also, do you have

Re: 9.2-STABLE: supervisor read data, page not present

2013-09-09 Thread Konstantin Belousov
On Mon, Sep 09, 2013 at 08:45:21AM +0200, Oliver Pinter wrote: On 9/9/13, Konstantin Belousov kostik...@gmail.com wrote: On Sun, Sep 08, 2013 at 11:40:01PM +0200, Oliver Pinter wrote: #6 0x806a2ab3 in cpuctl_ioctl (dev=value optimized out, cmd=value optimized out, data=value

Re: 9.2-STABLE: supervisor read data, page not present

2013-09-09 Thread Konstantin Belousov
On Mon, Sep 09, 2013 at 05:39:22PM +0200, Oliver Pinter wrote: Thanks, this patch fixed the issue. Which issue among the two problems you reported ? I assume that cpuctl panic is gone, but what about proc_reap issue ? pgp_2jaDyVSvF.pgp Description: PGP signature

Re: 9.2-STABLE: supervisor read data, page not present

2013-09-08 Thread Konstantin Belousov
On Sun, Sep 08, 2013 at 11:40:01PM +0200, Oliver Pinter wrote: #6 0x806a2ab3 in cpuctl_ioctl (dev=value optimized out, cmd=value optimized out, data=value optimized out, flags=0, td=value optimized out) at /usr/src/sys/dev/cpuctl/cpuctl.c:478 cpu = value optimized out

Re: 9-STABLE panic on intensive fork

2013-08-29 Thread Konstantin Belousov
On Wed, Aug 28, 2013 at 06:20:29PM +0400, Dmitry Sivachenko wrote: Hello! I am using very recent FreeBSD-9-STABLE snapshot: 9.2-PRERELEASE FreeBSD 9.2-PRERELEASE #0 r254986: Wed Aug 28 17:18:57 MSK 2013 I run uwsgi program (ports/www/uwsgi) on that machine. When uwsgi starts, it forks

Re: another? NFS deadlock on 9.2-PRERELEASE

2013-08-27 Thread Konstantin Belousov
On Tue, Aug 27, 2013 at 05:00:14PM +0300, Daniel Braniss wrote: Daniel Braniss wrote: Daniel Braniss wrote: I upgraded our web server, and only after 3 hours it hung :-( (as a side note, I have 2 other web servers, also running 9.2 doing great :-) go figure.

Re: Stack overflow with kernel r254683

2013-08-26 Thread Konstantin Belousov
On Mon, Aug 26, 2013 at 07:11:48PM -0400, Rick Macklem wrote: Matthias Schuendehuette wrote: Hello, yesterday I got a kernel crash on my server (a ProLiant DL380 G5): panic: stack overflow detected; backtrace may be corrupted Kernel is 9.2-PRERELEASE FreeBSD 9.2-PRERELEASE #7

Re: NFS deadlock on 9.2-Beta1

2013-08-24 Thread Konstantin Belousov
On Sat, Aug 24, 2013 at 01:08:05PM -0400, J David wrote: The requested information about the deadlock was finally obtained and provided off-list to the requested parties due to size. Thank you, the problem is clear now. The problematic process backtrace is Tracing command httpd pid 86383 tid

Re: NFS deadlock on 9.2-Beta1

2013-08-24 Thread Konstantin Belousov
On Sat, Aug 24, 2013 at 02:03:50PM -0400, J David wrote: On Sat, Aug 24, 2013 at 1:41 PM, Konstantin Belousov kostik...@gmail.com wrote: I think the easiest route is to a partial merge of the r253927 from HEAD. Is it helpful if we restart testing releng/9.2 using your suggested fix

Re: NFS deadlock on 9.2-Beta1

2013-08-24 Thread Konstantin Belousov
On Sat, Aug 24, 2013 at 04:11:09PM -0400, J David wrote: On Sat, Aug 24, 2013 at 3:38 PM, Konstantin Belousov kostik...@gmail.com wrote: No, at least not without reverting the r254754 first. The IGN_SBUSY patch is not critical there. There is lots of other stuff in r250907 / reverted

Re: NFS deadlock on 9.2-Beta1

2013-08-22 Thread Konstantin Belousov
On Wed, Aug 21, 2013 at 09:08:10PM -0400, Rick Macklem wrote: Kostik wrote: On Tue, Aug 20, 2013 at 06:18:16PM -0400, Rick Macklem wrote: J David wrote: On Thu, Aug 15, 2013 at 5:39 PM, Rick Macklem rmack...@uoguelph.ca wrote: Have you been able to pass the debugging info on

Re: NFS deadlock on 9.2-Beta1

2013-08-21 Thread Konstantin Belousov
On Tue, Aug 20, 2013 at 06:18:16PM -0400, Rick Macklem wrote: J David wrote: On Thu, Aug 15, 2013 at 5:39 PM, Rick Macklem rmack...@uoguelph.ca wrote: Have you been able to pass the debugging info on to Kostik? It would be really nice to get this fixed for FreeBSD9.2. You're

Re: NFS deadlock on 9.2-Beta1

2013-08-21 Thread Konstantin Belousov
On Wed, Aug 21, 2013 at 08:03:35PM +0200, Yamagi Burmeister wrote: Could the problem be related to this deadlock / LOR? - http://lists.freebsd.org/pipermail/freebsd-fs/2013-August/018052.html This is not related. My test setup is still in place. Will test with r250907 reverted tomorrow

Re: [releng_9 tinderbox] failure on amd64/amd64

2013-08-07 Thread Konstantin Belousov
On Wed, Aug 07, 2013 at 01:09:08PM +, FreeBSD Tinderbox wrote: /src/sys/amd64/amd64/machdep.c: In function 'db_show_sysregs': /src/sys/amd64/amd64/machdep.c:1226: error: 'MSR_IA32_FEATURE_CONTROL' undeclared (first use in this function) Should be fixed with the r254066, sorry for the

Re: stopping amd causes a freeze

2013-07-28 Thread Konstantin Belousov
On Sat, Jul 27, 2013 at 10:33:18AM +0200, Dominic Fandrey wrote: On 26/07/2013 19:10, Dominic Fandrey wrote: On 25/07/2013 12:00, Konstantin Belousov wrote: On Thu, Jul 25, 2013 at 09:56:59AM +0200, Dominic Fandrey wrote: On 22/07/2013 12:07, Konstantin Belousov wrote: On Mon, Jul 22

Re: NFS deadlock on 9.2-Beta1

2013-07-28 Thread Konstantin Belousov
On Sat, Jul 27, 2013 at 03:13:05PM -0700, Michael Tratz wrote: Let's assume the pid which started the deadlock is 14001 (it will be a different pid when we get the results, because the machine has been restarted) I type: show proc 14001 I get the thread numbers from that output and

Re: NFS deadlock on 9.2-Beta1

2013-07-27 Thread Konstantin Belousov
On Sat, Jul 27, 2013 at 04:20:49PM -0400, Rick Macklem wrote: Michael Tratz wrote: On Jul 24, 2013, at 5:25 PM, Rick Macklem rmack...@uoguelph.ca wrote: Michael Tratz wrote: Two machines (NFS Server: running ZFS / Client: disk-less), both are running FreeBSD r253506. The NFS

Re: stopping amd causes a freeze

2013-07-25 Thread Konstantin Belousov
On Thu, Jul 25, 2013 at 09:56:59AM +0200, Dominic Fandrey wrote: On 22/07/2013 12:07, Konstantin Belousov wrote: On Mon, Jul 22, 2013 at 11:50:24AM +0200, Dominic Fandrey wrote: ... I run amd through sysutils/automounter, which is a scripting solution that generates an amd.map file

Re: stopping amd causes a freeze

2013-07-22 Thread Konstantin Belousov
On Mon, Jul 22, 2013 at 11:50:24AM +0200, Dominic Fandrey wrote: Occasionally stopping amd freezes my system. It's a rare occurrence, and I haven't found a reliable way to reproduce it. It's also a real freeze, so there's no way to get into the debugger or grab a core dump. I only can

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-19 Thread Konstantin Belousov
On Fri, Jul 19, 2013 at 07:32:47AM +0200, Andre Albsmeier wrote: On Thu, 04-Jul-2013 at 19:25:28 +0200, Konstantin Belousov wrote: Patch is to improve debugging. I probably commit it after the issue is closed. Arguments against the commit is that the change imposes small performance

Re: syncer causing latency spikes

2013-07-17 Thread Konstantin Belousov
On Wed, Jul 17, 2013 at 02:07:55PM -0400, Mark Johnston wrote: Hello, I'm trying to investigate and solve some postgres latency spikes that I'm seeing as a result of some behaviour in the syncer. This is with FreeBSD 8.2 (with some local modifications and backports, r231160 in particular).

Re: (9.2) panic under disk load (gam_server / knlist_remove_kq)

2013-07-16 Thread Konstantin Belousov
On Mon, Jul 15, 2013 at 06:50:09PM +0200, Patrick Lamaiziere wrote: Le Mon, 15 Jul 2013 16:26:47 +0200, Mateusz Guzik mjgu...@gmail.com a ?crit : Hello, I'm seeing a panic while trying to build a poudriere repository. As far I can see it always happens when gam_server is

Re: (9.2) panic under disk load (gam_server / knlist_remove_kq)

2013-07-16 Thread Konstantin Belousov
On Tue, Jul 16, 2013 at 10:14:36PM +0200, Patrick Lamaiziere wrote: Le Tue, 16 Jul 2013 09:05:55 +0300, Konstantin Belousov kostik...@gmail.com a ?crit : Hello, Thanks Konstantin. I'm trying your patch and that looks better. poudriere runs since 3 hours now (before the box paniced few

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-12 Thread Konstantin Belousov
On Fri, Jul 12, 2013 at 07:24:40AM +0200, Andre Albsmeier wrote: On Thu, 04-Jul-2013 at 19:25:28 +0200, Konstantin Belousov wrote: On Thu, Jul 04, 2013 at 04:29:19PM +0200, Andre Albsmeier wrote: OK, patch is applied. I will reboot the machine later and see what happens tomorrow

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-12 Thread Konstantin Belousov
On Fri, Jul 12, 2013 at 08:05:27AM +0200, Andre Albsmeier wrote: On Fri, 12-Jul-2013 at 08:01:12 +0200, Konstantin Belousov wrote: On Fri, Jul 12, 2013 at 07:24:40AM +0200, Andre Albsmeier wrote: On Thu, 04-Jul-2013 at 19:25:28 +0200, Konstantin Belousov wrote: On Thu, Jul 04, 2013 at 04

Re: Weird regex behavior on 9.1-RELEASE on amd64 in 32-bit mode

2013-07-09 Thread Konstantin Belousov
On Tue, Jul 09, 2013 at 08:08:22PM +0200, Michael Sperber wrote: I noticed that scsh (which only runs in 32-bit mode) fails on amd64. I narrowed it down to a regex malfunction (I think). This program: snip #include regex.h #include stdio.h int main(void) { regex_t r;

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-07 Thread Konstantin Belousov
On Sun, Jul 07, 2013 at 09:25:53AM +0200, Andre Albsmeier wrote: OK, here we go (looks better now): GNU gdb 6.1.1 [FreeBSD] Copyright 2004 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute

Re: Shutdown hangs on unmount of a gjournaled file system in 8-Stable

2013-07-07 Thread Konstantin Belousov
On Mon, Jul 08, 2013 at 12:26:43AM +0200, Andreas Longwitz wrote: The deadlock can be explained now: pid 1 (init) sleeps on mount drain because mp-mnt_lockref was 1. This setting was done by pid 18 (gjournal switcher) by calling vfs_busy(). pid 18 now sleeps on suspwt because

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-04 Thread Konstantin Belousov
On Thu, Jul 04, 2013 at 07:27:00AM +0200, Andre Albsmeier wrote: On Thu, 04-Jul-2013 at 07:24:40 +0200, Konstantin Belousov wrote: On Thu, Jul 04, 2013 at 07:14:09AM +0200, Andre Albsmeier wrote: On Mon, 17-Jun-2013 at 21:30:31 +0200, John Baldwin wrote: On Sunday, June 16, 2013 2:39:42

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-04 Thread Konstantin Belousov
On Thu, Jul 04, 2013 at 04:29:19PM +0200, Andre Albsmeier wrote: OK, patch is applied. I will reboot the machine later and see what happens tomorrow in the morning. However, it might take a few days since the last 2 weeks all was fine. BTW, should this patch be used in general or is it just

Re: Phantom nulls in usbdevs.h during 9-STABLE kernel build

2013-07-04 Thread Konstantin Belousov
On Thu, Jul 04, 2013 at 07:38:24PM -0400, J David wrote: On Thu, Jul 4, 2013 at 12:15 PM, Dimitry Andric d...@freebsd.org wrote: One other thing: which type of file system are you using for /usr/obj, or wherever you pointed $MAKEOBJDIRPREFIX? Also ZFS-over-NFS. The goal is to build on

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-03 Thread Konstantin Belousov
On Thu, Jul 04, 2013 at 07:14:09AM +0200, Andre Albsmeier wrote: On Mon, 17-Jun-2013 at 21:30:31 +0200, John Baldwin wrote: On Sunday, June 16, 2013 2:39:42 am Andre Albsmeier wrote: On Fri, 31-May-2013 at 16:51:03 +0200, John Baldwin wrote: On Friday, May 31, 2013 8:26:11 am Andre

Re: system sporadically hangs on shutdown after switching to WITH_NEW_XORG

2013-06-17 Thread Konstantin Belousov
On Mon, Jun 17, 2013 at 09:16:56PM +0200, Michiel Boland wrote: On 06/16/2013 17:11, Michiel Boland wrote: Hi. Recently I switched to WITH_NEW_XORG, primarily because the stock X server with Intel driver has some issues that make it unusable for me. The new X server and Intel driver

Re: system sporadically hangs on shutdown after switching to WITH_NEW_XORG

2013-06-16 Thread Konstantin Belousov
On Sun, Jun 16, 2013 at 05:11:15PM +0200, Michiel Boland wrote: Hi. Recently I switched to WITH_NEW_XORG, primarily because the stock X server with Intel driver has some issues that make it unusable for me. The new X server and Intel driver works extremely well, so kudos to whoever made

Re: system sporadically hangs on shutdown after switching to WITH_NEW_XORG

2013-06-16 Thread Konstantin Belousov
On Sun, Jun 16, 2013 at 07:12:33PM +0200, Michiel Boland wrote: On 06/16/2013 17:37, Konstantin Belousov wrote: [...] I do not see anything related to i915 in the core.txt you provided. Next time the machine hangs, start with the output of ps command from ddb and 'show allpcpu', together

Re: system sporadically hangs on shutdown after switching to WITH_NEW_XORG

2013-06-16 Thread Konstantin Belousov
On Sun, Jun 16, 2013 at 08:06:21PM +0200, Michiel Boland wrote: On 06/16/2013 19:46, Konstantin Belousov wrote: On Sun, Jun 16, 2013 at 07:12:33PM +0200, Michiel Boland wrote: On 06/16/2013 17:37, Konstantin Belousov wrote: [...] I do not see anything related to i915 in the core.txt you

Re: Does quota still require a custom kernel in FreeBSD 9?

2013-06-13 Thread Konstantin Belousov
On Thu, Jun 13, 2013 at 09:44:15PM +0200, Peter Olsson wrote: Just wondering if a custom kernel with options QUOTA is still needed in FreeBSD 9. I have no test server available right now to test this. According to the handbook it is needed, but I found in a post from 2011 that it would be

Re: Nullfs leaks i-nodes

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 09:02:56AM +0200, Peter Holm wrote: On Wed, May 08, 2013 at 12:13:17PM +0300, Konstantin Belousov wrote: On Tue, May 07, 2013 at 08:30:06AM +0200, G??ran L??wkrantz wrote: I created a PR, kern/178238, on this but would like to know if anyone has any ideas

Re: Nullfs leaks i-nodes

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 02:11:44PM +0200, Goran Lowkrantz wrote: I assume this is CURRENT? Tried on STABLE but got this: cc1: warnings being treated as errors /usr/src/sys/kern/vfs_subr.c: In function 'vfs_notify_upper': /usr/src/sys/kern/vfs_subr.c:2801: warning: implicit declaration of

Re: Apparent regression in r250359

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 10:13:15AM -0400, Jim Ohlstein wrote: # sysctl hw.model hw.model: AMD FX(tm)-8350 Eight-Core Processor Ahh, so it seems that this is a CPU with the LWP. Please try the patch at the end of message. As another workaround, which does not disable AVX support, you could try

Re: Apparent regression in r250359

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 11:42:28AM -0400, Jim Ohlstein wrote: On 05/09/13 10:30, Konstantin Belousov wrote: On Thu, May 09, 2013 at 10:13:15AM -0400, Jim Ohlstein wrote: # sysctl hw.model hw.model: AMD FX(tm)-8350 Eight-Core Processor Ahh, so it seems that this is a CPU with the LWP

Re: Apparent regression in r250359

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 01:17:28PM -0400, Jim Ohlstein wrote: On 05/09/13 12:04, Konstantin Belousov wrote: Hm, I see another bug in the next line as well. Could you try this updated patch ? This does work. Committed to head, should be merged back to stable/9 in three days

Re: Nullfs leaks i-nodes

2013-05-08 Thread Konstantin Belousov
On Tue, May 07, 2013 at 08:30:06AM +0200, G??ran L??wkrantz wrote: I created a PR, kern/178238, on this but would like to know if anyone has any ideas or patches? Have updated the system where I see this to FreeBSD 9.1-STABLE #0 r250229 and still have the problem. The patch below should

Re: Apparent regression in r250359

2013-05-08 Thread Konstantin Belousov
On Wed, May 08, 2013 at 01:14:52PM -0400, Jim Ohlstein wrote: Hello, I upgraded my (custom) kernel earlier and found that multiple daemons (cups, hal, syslog, ntpd, csh) crashed and dumped cores at or shortly after boot. The error I saw several times on the console was: set_fpcontext

Re: Core Dump / panic sleeping thread

2013-03-21 Thread Konstantin Belousov
On Wed, Mar 20, 2013 at 09:14:37PM -0400, Rick Macklem wrote: Well, read/write sharing of files over NFS is pretty rare, so I suspect a truncation of a file by another client (or locally in the NFS server) is a rare event. As such, not invalidating the buffers here doesn't seem like a big

Re: Core Dump / panic sleeping thread

2013-03-21 Thread Konstantin Belousov
On Thu, Mar 21, 2013 at 07:59:25PM +0100, Michael Landin Hostbaek wrote: On Mar 21, 2013, at 8:58 AM, Konstantin Belousov kostik...@gmail.com wrote: On Wed, Mar 20, 2013 at 09:14:37PM -0400, Rick Macklem wrote: Well, read/write sharing of files over NFS is pretty rare, so I suspect

Re: Core Dump / panic sleeping thread

2013-03-20 Thread Konstantin Belousov
On Tue, Mar 19, 2013 at 07:37:43PM -0400, Rick Macklem wrote: Andriy Gapon wrote: on 19/03/2013 19:35 Jeremy Chadwick said the following: On Tue, Mar 19, 2013 at 06:18:06PM +0100, Michael Landin Hostbaek wrote: [snip] Unread portion of the kernel message buffer: Sleeping thread

Re: Core Dump / panic sleeping thread

2013-03-20 Thread Konstantin Belousov
On Wed, Mar 20, 2013 at 12:13:05PM +0100, Michael Landin Hostbaek wrote: On Mar 20, 2013, at 10:49 AM, Konstantin Belousov kostik...@gmail.com wrote: I do not like it. As I said in the previous response to Andrey, I think that moving the vnode_pager_setsize() after the unlock is better

Re: Core Dump / panic sleeping thread

2013-03-20 Thread Konstantin Belousov
On Wed, Mar 20, 2013 at 11:37:56AM -0400, Rick Macklem wrote: Konstantin Belousov wrote: On Wed, Mar 20, 2013 at 12:13:05PM +0100, Michael Landin Hostbaek wrote: On Mar 20, 2013, at 10:49 AM, Konstantin Belousov kostik...@gmail.com wrote: I do not like it. As I said

Re: Core Dump / panic sleeping thread

2013-03-20 Thread Konstantin Belousov
On Wed, Mar 20, 2013 at 09:43:20AM -0400, John Baldwin wrote: On Wednesday, March 20, 2013 9:22:22 am Konstantin Belousov wrote: On Wed, Mar 20, 2013 at 12:13:05PM +0100, Michael Landin Hostbaek wrote: On Mar 20, 2013, at 10:49 AM, Konstantin Belousov kostik...@gmail.com wrote

Re: Core Dump / panic sleeping thread

2013-03-20 Thread Konstantin Belousov
On Wed, Mar 20, 2013 at 08:58:08PM +0200, Konstantin Belousov wrote: On Wed, Mar 20, 2013 at 09:43:20AM -0400, John Baldwin wrote: On Wednesday, March 20, 2013 9:22:22 am Konstantin Belousov wrote: On Wed, Mar 20, 2013 at 12:13:05PM +0100, Michael Landin Hostbaek wrote: On Mar 20

Re: Core Dump / panic sleeping thread

2013-03-19 Thread Konstantin Belousov
On Tue, Mar 19, 2013 at 07:45:56PM +0200, Andriy Gapon wrote: on 19/03/2013 19:35 Jeremy Chadwick said the following: On Tue, Mar 19, 2013 at 06:18:06PM +0100, Michael Landin Hostbaek wrote: [snip] Unread portion of the kernel message buffer: Sleeping thread (tid 100256, pid 85641) owns a

<    1   2   3   4   >