Re: vmstat 'b' (disk busy?) field keeps climbing ...

2006-06-24 Thread Kostik Belousov
On Sat, Jun 24, 2006 at 11:55:26AM +0400, Dmitry Morozovsky wrote: On Sat, 24 Jun 2006, Marc G. Fournier wrote: MGF 'b' stands for blocked, not busy. Judging by your page fault rate MGF and the high number of frees and pages being scanned, you're probably MGF swapping tasks in and out

Re: vmstat 'b' (disk busy?) field keeps climbing ...

2006-06-24 Thread Kostik Belousov
On Sat, Jun 24, 2006 at 02:57:27PM -0300, Marc G. Fournier wrote: On Sat, 24 Jun 2006, Kostik Belousov wrote: On Sat, Jun 24, 2006 at 11:55:26AM +0400, Dmitry Morozovsky wrote: On Sat, 24 Jun 2006, Marc G. Fournier wrote: MGF 'b' stands for blocked, not busy. Judging by your page fault

Re: vmstat 'b' (disk busy?) field keeps climbing ...

2006-06-24 Thread Kostik Belousov
On Sat, Jun 24, 2006 at 09:52:03PM +0300, Kostik Belousov wrote: On Sat, Jun 24, 2006 at 02:57:27PM -0300, Marc G. Fournier wrote: On Sat, 24 Jun 2006, Kostik Belousov wrote: On Sat, Jun 24, 2006 at 11:55:26AM +0400, Dmitry Morozovsky wrote: On Sat, 24 Jun 2006, Marc G. Fournier wrote

Re: vmstat 'b' (disk busy?) field keeps climbing ...

2006-06-24 Thread Kostik Belousov
On Sat, Jun 24, 2006 at 04:45:49PM -0300, Marc G. Fournier wrote: On Sat, 24 Jun 2006, Kostik Belousov wrote: On Sat, Jun 24, 2006 at 09:52:03PM +0300, Kostik Belousov wrote: On Sat, Jun 24, 2006 at 02:57:27PM -0300, Marc G. Fournier wrote: On Sat, 24 Jun 2006, Kostik Belousov wrote

Re: LOR in vnode interlock and system map

2006-06-25 Thread Kostik Belousov
On Mon, Jun 26, 2006 at 02:17:20AM +0200, V??clav Haisman wrote: V??clav Haisman wrote, On 29.5.2006 0:09: See attached file. The kernel is todays CVS 6.1. -- Vaclav Haisman May 28 19:16:55

Re: vmstat 'b' (disk busy?) field keeps climbing ...

2006-06-25 Thread Kostik Belousov
On Mon, Jun 26, 2006 at 01:47:04AM -0300, Marc G. Fournier wrote: On Mon, 26 Jun 2006, Marc G. Fournier wrote: 3416 1 1004100 ufs ?? DsJ0:13.01 /usr/local/libexec/postfix/master 3418 3416 1004100 ufs ?? DJ 0:04.16 qmgr -l -t fifo -u 33561 3416 1004100 ufs ?? DJ

Re: LOR in vnode interlock and system map

2006-06-26 Thread Kostik Belousov
On Mon, Jun 26, 2006 at 08:15:35AM +0200, V??clav Haisman wrote: Kostik Belousov wrote, On 26.6.2006 6:17: Try manually MFC rev. 1.637 of sys/kern/vfs_subr.c. Please, report the results (at least to me). I have never done it manual patching before. Do you mean this http

Re: vmstat 'b' (disk busy?) field keeps climbing ...

2006-06-26 Thread Kostik Belousov
On Mon, Jun 26, 2006 at 02:20:12AM -0300, Marc G. Fournier wrote: On Mon, 26 Jun 2006, Kostik Belousov wrote: Yes, this looks like a deadlock. As I understand, that's on 6.1-STABLE ? Yes, kernel sources, it seems, from May 25th, according to my /usr/src tree ... BTW, do you use

Re: What denotes a 'blocked' process?

2006-06-26 Thread Kostik Belousov
On Mon, Jun 26, 2006 at 12:44:17PM -0300, Marc G. Fournier wrote: On Mon, 26 Jun 2006, Marc G. Fournier wrote: Just upgraded to June 15th sources, started up all the processes, and am already at 29 blocked processes ... I've checked for states D, E and L ... nothing ... Actually,

Re: 6.1-R ? 6-Stable ? 5.5-R ?

2006-06-28 Thread Kostik Belousov
On Wed, Jun 28, 2006 at 06:23:15AM -0400, Francisco Reyes wrote: Rink Springer writes: FreeBSD src repository Modified files:(Branch: RELENG_6) sys/nfsservernfs_serv.c nfs_srvsubs.c Log: MFC of the temporary fix for nfsd leaking GIANT.

Re: 6.1-R ? 6-Stable ? 5.5-R ?

2006-06-29 Thread Kostik Belousov
On Thu, Jun 29, 2006 at 01:38:54PM -0400, Francisco Reyes wrote: Kostik Belousov writes: Approved by:pjd (mentor) Revision ChangesPath 1.156.2.3 +16 -0 src/sys/nfsserver/nfs_serv.c 1.136.2.3 +4 -0 src/sys/nfsserver/nfs_srvsubs.c The above files are what I

Re: quota and snapshots in 6.1-RELEASE

2006-06-29 Thread Kostik Belousov
On Thu, Jun 29, 2006 at 10:48:06PM -0400, Mike Jakubik wrote: Konstantin Belousov wrote: On Tue, Jun 06, 2006 at 01:49:04PM -0400, Mike Jakubik wrote: Scott Long wrote: Dmitriy Kirhlarov wrote: Hi! On Tue, May 23, 2006 at 04:35:21PM -0400, Kris Kennaway wrote:

Re: quota and snapshots in 6.1-RELEASE

2006-06-29 Thread Kostik Belousov
On Fri, Jun 30, 2006 at 07:05:36AM +0300, Kostik Belousov wrote: OK, please, provide also your fstab, information on md config and dmesg, and kernel config. Also, it would be good to see the output of alltrace in ddb. It seems that your kernel does not contain quota option ? Oh, I see, you

md deadlocks on wdrain. Was: [Re: quota and snapshots in 6.1-RELEASE]

2006-06-30 Thread Kostik Belousov
On Thu, Jun 29, 2006 at 10:48:06PM -0400, Mike Jakubik wrote: Konstantin Belousov wrote: On Tue, Jun 06, 2006 at 01:49:04PM -0400, Mike Jakubik wrote: Scott Long wrote: Dmitriy Kirhlarov wrote: Hi! On Tue, May 23, 2006 at 04:35:21PM -0400, Kris Kennaway wrote:

Re: NFS Locking Issue

2006-07-02 Thread Kostik Belousov
On Sun, Jul 02, 2006 at 05:49:44PM -0400, Francisco Reyes wrote: User Freebsd writes: Since there are several of us experiencing what looks to be the same sort of deadlock issue, I beseech you not to give up I will try to setup the environment, but to be honest no more 6.X for us until

Re: NFS Locking Issue

2006-07-02 Thread Kostik Belousov
On Mon, Jul 03, 2006 at 12:50:11AM -0400, Francisco Reyes wrote: Kostik Belousov writes: Since nobody except you experience that problems (at least, only you notified about the problem existence) Did you miss the part of: User Freebsd writes: Since there are several of us experiencing

Re: NFS Locking Issue

2006-07-03 Thread Kostik Belousov
On Mon, Jul 03, 2006 at 10:06:52AM +0100, Robert Watson wrote: On Mon, 3 Jul 2006, Kostik Belousov wrote: On Mon, Jul 03, 2006 at 12:50:11AM -0400, Francisco Reyes wrote: Kostik Belousov writes: Since nobody except you experience that problems (at least, only you notified about

Re: NFS Locking Issue

2006-07-04 Thread Kostik Belousov
On Mon, Jul 03, 2006 at 03:40:01PM -0700, Michael Collette wrote: User Freebsd wrote: On Sat, 1 Jul 2006, Francisco Reyes wrote: John Hay writes: I only started to see the lockd problems when upgrading the server side to FreeBSD 6.x and later. I had various FreeBSD clients, between 4.x

Re: NFS Locking Issue

2006-07-05 Thread Kostik Belousov
On Wed, Jul 05, 2006 at 10:09:24AM +0100, Robert Watson wrote: The most significant problem working with rpc.lockd is creating easy to reproduce test cases. Not least because they can potentially involve multiple clients. If you can help to produce simple test cases to reproduce the bugs

Re: NFS Locking Issue

2006-07-05 Thread Kostik Belousov
On Wed, Jul 05, 2006 at 02:38:22PM +0300, Kostik Belousov wrote: On Wed, Jul 05, 2006 at 10:09:24AM +0100, Robert Watson wrote: The most significant problem working with rpc.lockd is creating easy to reproduce test cases. Not least because they can potentially involve multiple clients

Re: NFS Locking Issue

2006-07-05 Thread Kostik Belousov
On Wed, Jul 05, 2006 at 02:04:59PM +0100, Robert Watson wrote: On Wed, 5 Jul 2006, Kostik Belousov wrote: Also, the both lockd processes now put identification information in the proctitle (srv and kern). SIGUSR1 shall be sent to srv process. Hmm, after looking at the dump

Re: 6.1 quota issues

2006-07-09 Thread Kostik Belousov
On Mon, Jul 10, 2006 at 12:41:07AM -0400, Charles Sprickman wrote: On Sat, 8 Jul 2006, Matthew D. Fuller wrote: On Fri, Jul 07, 2006 at 10:56:47PM -0400 I heard the voice of Charles Sprickman, and lo! it spake thus: Trying again, it reported the same inconsistencies then sat there for

Re: 6.1 quota issues

2006-07-10 Thread Kostik Belousov
On Mon, Jul 10, 2006 at 01:39:01AM -0400, Charles Sprickman wrote: On Mon, 10 Jul 2006, Kostik Belousov wrote: On Mon, Jul 10, 2006 at 12:41:07AM -0400, Charles Sprickman wrote: On Sat, 8 Jul 2006, Matthew D. Fuller wrote: On Fri, Jul 07, 2006 at 10:56:47PM -0400 I heard the voice

Re: MySQL and default memory limits (mysqld: Out of memory)

2006-07-10 Thread Kostik Belousov
On Mon, Jul 10, 2006 at 05:41:16PM +1000, Peter Jeremy wrote: On Sun, 2006-Jul-09 23:45:44 +0200, Mathieu Arnold wrote: +-Le 09/07/2006 17:36 -0400, Mike Jakubik a dit : | Exactly, its nice being able to see the current values. How else can i | see what the values are set to? As I

Re: vm_map.c lock up (Was: Re: NFS Locking Issue)

2006-07-14 Thread Kostik Belousov
On Sat, Jul 15, 2006 at 12:10:29AM -0300, User Freebsd wrote: On Wed, 5 Jul 2006, Robert Watson wrote: If you can get into DDB when the hang has occurred, output via serial console for the following commands would be very helpful: show pcpu show allpcpu ps trace traceall show

Re: 6.1 panic after approx. 49 days uptime

2006-07-17 Thread Kostik Belousov
On Sun, Jul 16, 2006 at 12:19:25PM +0300, Kostik Belousov wrote: On Sun, Jul 16, 2006 at 09:46:49AM +0100, Mark Knight wrote: Index: mount.h === RCS file: /usr/local/arch/ncvs/src/sys/sys/mount.h,v retrieving revision 1.210

Re: 6.1 panic after approx. 49 days uptime

2006-07-17 Thread Kostik Belousov
On Sun, Jul 16, 2006 at 09:46:49AM +0100, Mark Knight wrote: In message [EMAIL PROTECTED], Kostik Belousov [EMAIL PROTECTED] writes On Sun, Jul 16, 2006 at 09:32:47AM +0100, Mark Knight wrote: Just awoke to fine my home server (6.1-RELEASE) had panicked during its daily update of /usr/ports

Re: 6.1 panic after approx. 49 days uptime

2006-07-17 Thread Kostik Belousov
On Sun, Jul 16, 2006 at 09:32:47AM +0100, Mark Knight wrote: Just awoke to fine my home server (6.1-RELEASE) had panicked during its daily update of /usr/ports with an uptime of 49 days. Stack trace is here: http://www.knigma.org.uk/scratch/crash_160706.txt Looks file system related

Re: file system deadlock - the whole story?

2006-07-18 Thread Kostik Belousov
On Wed, Jul 19, 2006 at 01:31:17AM -0300, User Freebsd wrote: Kostik/Robert ... does this provide enough (any?) information concerning the deadlock situation(s) that are being reported? is there anything else I should do the next time it happens? I tried to submit a GnATs report on this

Re: file system deadlock - the whole story?

2006-07-19 Thread Kostik Belousov
On Tue, Jul 18, 2006 at 07:51:52AM -0300, User Freebsd wrote: 'k, had a bunch of fun tonight, but one of the results is that I was able to achieve file system deadlock, or so it appears ... Using the following from DDB: set $lines=0 show pcpu show allpcpu ps trace alltrace show

Re: file system deadlock - the whole story?

2006-07-19 Thread Kostik Belousov
On Wed, Jul 19, 2006 at 11:23:21AM -0300, User Freebsd wrote: On Wed, 19 Jul 2006, Robert Watson wrote: On Wed, 19 Jul 2006, User Freebsd wrote: Also note that under FreeBSD 4.x, all three of these machines were pretty much my more solid machines, with even more vServers running on them

Re: setlocale howto

2006-08-12 Thread Kostik Belousov
On Sat, Aug 12, 2006 at 05:51:01PM +0400, Andrey V. Elsukov wrote: for(i = 0; i sizeof(buf); i++) buf[i] = (char)toupper(buf[i]); buf[i] = (char)toupper((unsigned char)buf[i]); Standard integer promotion promotes KOI8-R char codes like 0xd4 into 0xffd4. Since such

Re: NFS locking: lockf freezes (rpc.lockd problem?)

2006-08-27 Thread Kostik Belousov
On Sun, Aug 27, 2006 at 11:24:13AM +, Michael Abbott wrote: I've been trying to make some sense of the NFS locking issue. I am trying to run # make installworld DESTDIR=/mnt where /mnt is an NFS mount on a FreeBSD 4.11 server, but I am unable to get past a call to `lockf`. On

Re: NFS locking: lockf freezes (rpc.lockd problem?)

2006-08-27 Thread Kostik Belousov
On Sun, Aug 27, 2006 at 07:17:34PM +, Michael Abbott wrote: On Sun, 27 Aug 2006, Kostik Belousov wrote: For debugging purposes, tcpdump of the corresponding communications would be quite useful. Besides this, output of ps auxww | grep 'rpc\.' may be interesting. Um. How interesting

Re: NFS locking: lockf freezes (rpc.lockd problem?)

2006-08-28 Thread Kostik Belousov
On Mon, Aug 28, 2006 at 09:48:48AM +, Michael Abbott wrote: An alternative would be to update to RELENG_6 (or at least RELENG_6_1) and then try again. This machine is so painfully slow that I'll probably have to do that overnight, and then I'm out of time until next weekend. Just

Re: ffs snapshot lockup

2006-10-03 Thread Kostik Belousov
On Mon, Oct 02, 2006 at 03:23:49PM -0400, Vivek Khera wrote: On Sep 22, 2006, at 4:36 PM, Kris Kennaway wrote: Start by enabling INVARIANTS, INVARIANT_SUPPORT, DEBUG_LOCKS and DEBUG_VFS_LOCKS, then run 'show lockedvnods' and 'alltrace' in DDB (spammy, need that serial console), or at least

Re: ffs snapshot lockup

2006-10-04 Thread Kostik Belousov
On Wed, Oct 04, 2006 at 09:56:54AM -0400, Vivek Khera wrote: On Oct 3, 2006, at 4:43 AM, Kostik Belousov wrote: Details are posted at http://vivek.khera.org/scratch/crashlogs/ I have the crashdumps available to a kernel hacker upon request (i'd rather not make them generally available

Re: ffs snapshot lockup

2006-10-04 Thread Kostik Belousov
On Wed, Oct 04, 2006 at 01:06:37PM -0400, Vivek Khera wrote: On Oct 4, 2006, at 12:39 PM, Kris Kennaway wrote: The only thing I think was running at the time would be a large file copy from a remote system to this one using rsync. As I understand, you got the panic. Then, you shall

Re: ffs snapshot lockup

2006-10-05 Thread Kostik Belousov
On Wed, Oct 04, 2006 at 05:16:53PM -0400, Vivek Khera wrote: On Oct 3, 2006, at 4:43 AM, Kostik Belousov wrote: Details are posted at http://vivek.khera.org/scratch/crashlogs/ I have the crashdumps available to a kernel hacker upon request (i'd rather not make them generally available

Re: ffs snapshot lockup

2006-10-06 Thread Kostik Belousov
On Thu, Oct 05, 2006 at 10:01:07AM -0400, Vivek Khera wrote: On Oct 5, 2006, at 4:30 AM, Kostik Belousov wrote: The network load was minimal at the time. I had everyone log out and close mail etc. What were the symptoms of locked system ? Could you log in on console, or do

Re: ffs snapshot lockup

2006-10-06 Thread Kostik Belousov
On Fri, Oct 06, 2006 at 02:11:05PM -0400, Vivek Khera wrote: On Oct 6, 2006, at 1:57 PM, Kris Kennaway wrote: This is very strange. You 3 instances of getty where just reading the tty input, and all suspectible processes (like sshd) are waiting on net events. No processes are blocked on

Re: Bug in 6.x' C++ compiler

2006-10-12 Thread Kostik Belousov
On Wed, Oct 11, 2006 at 01:22:57PM -0400, Mikhail Teterin wrote: GCC would not fix the bug described in http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29390 because the compiler is of an unsupported version (they only support 4.x now). Yet, the problem is rather real and hits when the

Re: Bug in 6.x' C++ compiler

2006-10-12 Thread Kostik Belousov
On Thu, Oct 12, 2006 at 12:36:42PM -0400, Mikhail Teterin wrote: http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29390 четвер 12 жовтень 2006 05:54, Kostik Belousov написав: Thread model: posix gcc version 3.4.6 [FreeBSD] 20060305 opt/gcc-3.4.6 is the stock version of the 3.4.6 built

Re: locked vnode / nfs... requires kill -9 in ddb

2006-10-19 Thread Kostik Belousov
On Wed, Oct 18, 2006 at 10:01:45AM -0600, John E Hein wrote: 6.2-PRERELEASE from 20061016 RELENG_6 sources. Locked vnodes 0xc6b7bdd0: tag nfs, type VDIR usecount 2, writecount 0, refcount 8 mountedhere 0 flags (VV_ROOT) v_object 0xc9d84108 ref 0 pages 0 lock type nfs:

HEADS-UP: DEVFS fixes MFC for test [Was: Re: ps locks up on 6.2-PRERELEASE SMP]

2006-10-20 Thread Kostik Belousov
On Fri, Oct 20, 2006 at 09:45:34AM +0900, Kazuaki ODA wrote: Kris Kennaway wrote: Yep, devfs as I suspected. Keep an eye on the commit logs for when kib@ merges his fix. Kris Thanks. I'll test again after the fix is merged into RELENG_6, and post the result. I plan to ask the re@

Re: locked vnode / nfs... requires kill -9 in ddb

2006-10-20 Thread Kostik Belousov
On Sat, Oct 21, 2006 at 08:25:00AM +0800, David Xu wrote: On Thursday 19 October 2006 18:04, Kostik Belousov wrote: The nfs_reply is sleeping with the PCATCH set. The question is why SIGTSTP does not cause msleep to return with EINTR. I have not been tracking the thread

Re: panic: vfs_getopt: caller passed 'opts' as NULL

2006-11-01 Thread Kostik Belousov
On Wed, Nov 01, 2006 at 11:57:48AM +0100, Ulrich Spoerlein wrote: On 10/31/06, Kris Kennaway [EMAIL PROTECTED] wrote: Note that they'll be demand-loaded if requested (e.g. if you try to mount_nullfs). Maybe you or something else tried to mount such a filesystem by accident? But the point

Re: audit and quota don't get along

2006-11-14 Thread Kostik Belousov
On Tue, Nov 14, 2006 at 12:02:43PM -0500, Steve Wills wrote: On Nov 14, 2006, at 4:31 AM, Robert Watson wrote: A backtrace would be helpful. Fatal trap 12: page fault while in kernel mode fault virtual address = 0x0 fault code = supervisor read, page not present

Re: trussing a non existing file causes misbehavior

2006-11-16 Thread Kostik Belousov
On Thu, Nov 16, 2006 at 12:26:32PM +0300, Oleg Dambaev wrote: Nikos Vassiliadis wrote: Hello, In my 6.2-PRERELEASE(one month old approximately) truss gets stuck exiting when trying to truss a non existing file. I think the problem is not in truss itself. But I have not the skills to find

Re: md deadlocks on wdrain. Was: [Re: quota and snapshots in6.1-RELEASE]

2006-11-23 Thread Kostik Belousov
On Wed, Nov 22, 2006 at 07:28:37PM -, Steven Hartland wrote: The patch below fixed this issue for us. We had a jail which when upgrading ( installworld ) from 5.4 to 6.1 would constantly hang the machine with this error. After updating md.c to 1.164 from MAIN and applying the patch

Re: md deadlocks on wdrain. Was: [Re: quota and snapshotsin6.1-RELEASE]

2006-11-24 Thread Kostik Belousov
On Thu, Nov 23, 2006 at 10:36:57PM -, Steven Hartland wrote: Thanks for that, from our point of view its required as without the machine deadlocks without even trying. One real strange thing was that if I created a copy of the vnode file, this particular task ( installworld ) would

Re: kqueue LOR

2006-11-27 Thread Kostik Belousov
On Sun, Nov 26, 2006 at 09:30:39AM +0100, V??clav Haisman wrote: Hi, the attached lor.txt contains LOR I got this yesterday. It is FreeBSD 6.1 with relatively recent kernel, from last week or so. -- VH +lock order reversal: + 1st 0xc537f300 kqueue (kqueue) @

Re: Is there conflicts between gmirror and a quota enabled filesystem?

2006-11-29 Thread Kostik Belousov
On Wed, Nov 29, 2006 at 05:54:40AM -0800, Jason Vance wrote: Posted Monday Nov 27th. Is there a known conflict between gmirror and a quota enabled filesystem? I have a FreeBSD 5.5-STABLE box that is setup with a gmirror RAID 1 using two identical harddrives. I installed quotas

Re: witness_checkorder panic

2006-12-02 Thread Kostik Belousov
On Sat, Dec 02, 2006 at 07:15:37PM +0100, Arno J. Klaassen wrote: Hello, I just got this on a box I'm testing before installation. It has clean RELENG_6 from about two weeks ago with only some small if_bge.c-patches Bruce Evans sent me for testing performance/hang problems. Since I doubt

Re: LOR, 23/Nov/2006 sources, RELENG_6

2006-12-03 Thread Kostik Belousov
On Sun, Dec 03, 2006 at 01:30:41PM -0600, Larry Rosenman wrote: This may already be fixed, known, but... Dec 3 13:14:41 thebighonker kernel: lock order reversal: Dec 3 13:14:41 thebighonker kernel: 1st 0xff0093904c00 kqueue (kqueue) @ /usr/src/sys/kern/kern_event.c:1547 Dec 3

Re: witness_checkorder panic

2006-12-04 Thread Kostik Belousov
On Sun, Dec 03, 2006 at 09:12:08PM +0100, Arno J. Klaassen wrote: Hello, On Sat, Dec 02, 2006 at 07:15:37PM +0100, Arno J. Klaassen wrote: Hello, I just got this on a box I'm testing before installation. It has clean RELENG_6 from about two weeks ago with only some small

Re: panic in nfsd on 6.2-RC1

2006-12-04 Thread Kostik Belousov
On Tue, Dec 05, 2006 at 12:43:23AM +0900, Hiroki Sato wrote: Hi, One of my NFS servers running 6.2-RC1 that are highly-loaded causes a panic repeatedly these days. I am not sure which upgrading this panic starts after precisely, but this was running for almost one year (6.0R and 6.1R)

Re: kqueue LOR

2006-12-12 Thread Kostik Belousov
On Tue, Dec 12, 2006 at 12:44:54AM -0800, Suleiman Souhlal wrote: Kostik Belousov wrote: On Sun, Nov 26, 2006 at 09:30:39AM +0100, V??clav Haisman wrote: Hi, the attached lor.txt contains LOR I got this yesterday. It is FreeBSD 6.1 with relatively recent kernel, from last week or so

Re: kqueue LOR

2006-12-12 Thread Kostik Belousov
On Tue, Dec 12, 2006 at 11:49:42PM +1100, Bruce Evans wrote: On Tue, 12 Dec 2006, Kostik Belousov wrote: On Tue, Dec 12, 2006 at 12:44:54AM -0800, Suleiman Souhlal wrote: Is the mount lock really required, if all we're doing is a single read of a single word (mnt_kern_flags) (v_mount

Re: kqueue LOR

2006-12-13 Thread Kostik Belousov
On Wed, Dec 13, 2006 at 04:12:57AM +, Tor Egge wrote: Hmm, may be, since vnode must be interlocked by ffs_sync() after MNTK_SUSPENDED set, and before MNTK_SUSPEND, mount interlock is not needed in ufs_itimes. Tor ? If neither IN_CHANGE nor IN_UPDATE is set then it might be unsafe

Re: kern/92785: Using exported filesystem on OS/2 NFS client causes filesystem freeze

2006-12-15 Thread Kostik Belousov
On Fri, Dec 15, 2006 at 01:44:23PM +0100, Ulrich Spoerlein wrote: Hi, we too, ran into this problem. OS/2 Clients kill our NFS server. It is running a RELENG_6 snapshot from 2006-11-14. rpc.lockd and rpc.statd are running. I'll conduct a test without those two services shortly. You can

Re: kern/92785: Using exported filesystem on OS/2 NFS client causes filesystem freeze

2006-12-15 Thread Kostik Belousov
On Fri, Dec 15, 2006 at 03:12:58PM +0100, Ulrich Spoerlein wrote: A tcpdump of the session can be found at: http://coyote.dnsalias.net/rpc.pcap (9kB) Am I right that all you did was ls -l root of nfs mount ? Does OS/2 supports the notion of .. directory ? Could you do just ls -l .. from nfs

Re: kern/92785: Using exported filesystem on OS/2 NFS client causes filesystem freeze

2006-12-15 Thread Kostik Belousov
On Fri, Dec 15, 2006 at 04:18:32PM +0100, Ulrich Spoerlein wrote: [Lot of debugging info trimmed to pacify you ] Do you think you can fix it? Any idea why this seems to only happen with OS/2 Clients? It seems that my guess is right. OS/2 nfs client, in contrast to unix client, send LOOKUP ..

Re: Not panic in nfsd (Re: panic in nfsd on 6.2-RC1)

2006-12-15 Thread Kostik Belousov
, Hiroki Sato wrote: Kostik Belousov [EMAIL PROTECTED] wrote in [EMAIL PROTECTED]: ko What version of sys/nfsserver/nfs_serv.c do you use ? If it is older than ko 1.156.2.7, please, update the system. Thanks, I updated it just now and see how it works

Re: running mksnap_ffs

2007-01-03 Thread Kostik Belousov
On Wed, Jan 03, 2007 at 12:05:26AM +0100, Willem Jan Withagen wrote: Gary Palmer wrote: On Tue, Jan 02, 2007 at 09:06:24PM +0100, Willem Jan Withagen wrote: Hi, I got the following Filesystem: FilesystemSizeUsed Avail Capacity iused ifree %iused /dev/da0a 1.3T422G

Re: Filesystem hang on 3ware 6.2 system

2007-01-31 Thread Kostik Belousov
On Wed, Jan 31, 2007 at 02:24:35AM -0500, Michael R. Wayne wrote: We have a nubmer of similar machines that were initiallly formated with 6.2 before it was released and have subsequently been upgraded to 6.2-RELEASE with no issues. So, we upgraded a 6.1 box which has been running fine as long

Re: 6.2-RELEASE - Fatal trap 12 - nvidia driver ?

2007-02-01 Thread Kostik Belousov
On Thu, Feb 01, 2007 at 11:39:33AM +0100, Henri Hennebert wrote: Hello, I experience Fatal trap 12 when I shutdown if I have run the X server (with nvidia driver 1.0.9746). This crash happen 4/5 of the time. It is in devfs_populate_loop() in devfs.c. I don't have the vmcore anymore :-/.

Re: 6.2-RELEASE - Fatal trap 12 - nvidia driver ?

2007-02-01 Thread Kostik Belousov
On Thu, Feb 01, 2007 at 02:41:58PM +0100, Henri Hennebert wrote: Kostik Belousov wrote: Does the nvidia driver don't play right with devfs ? Thanks for your time, See PR/108078 In my case hal is not installed. In PR/108078 the nvidia driver is there too. So nvidia seems more a culprit

Re: 6.2 amd64 panic: lockmgr: thread 0xffffff009f9fd000, not exclusive lock holder 0xffffff003961c000 unlocking

2007-02-01 Thread Kostik Belousov
On Thu, Feb 01, 2007 at 10:02:08AM -0600, Guy Helmer wrote: Guy Helmer wrote: Does this make sense to anyone (it doesn't to me - procfs_doprofile simply locks, calls vn_fullpath, and unlocks)? I was trying to track down a hang by running a system under stress, and instead got this panic

Re: 6.2 amd64 panic: lockmgr: thread 0xffffff009f9fd000, not exclusive lock holder 0xffffff003961c000 unlocking

2007-02-01 Thread Kostik Belousov
On Thu, Feb 01, 2007 at 10:55:18AM -0600, Guy Helmer wrote: Kostik Belousov wrote: On Thu, Feb 01, 2007 at 06:38:26PM +0200, Kostik Belousov wrote: Index: fs/procfs/procfs.c === RCS file: /usr/local/arch/ncvs/src/sys/fs/procfs

Re: kqueue LOR

2007-02-08 Thread Kostik Belousov
On Thu, Feb 08, 2007 at 09:49:23PM +0100, Frode Nordahl wrote: On 27. nov. 2006, at 10.21, Kostik Belousov wrote: On Sun, Nov 26, 2006 at 09:30:39AM +0100, V??clav Haisman wrote: Hi, the attached lor.txt contains LOR I got this yesterday. It is FreeBSD 6.1 with relatively recent kernel

Re: Help debugging non-sleepable lock panic on 6.2-RELEASE

2007-02-28 Thread Kostik Belousov
On Wed, Feb 28, 2007 at 01:54:10PM +, Jason Thomson wrote: Regarding: http://lists.freebsd.org/pipermail/freebsd-stable/2007-February/033241.html Konstantin Belousov wrote: On Thu, Feb 22, 2007 at 06:50:37PM +, Jason Thomson wrote: Try this (already committed to CURRENT as

Re: 'panic: bad pte' error on 6.2-RELEASE (amd64)

2007-03-05 Thread Kostik Belousov
On Mon, Mar 05, 2007 at 12:05:32AM -0800, Peter Losher wrote: We recently updated one of our dual Opteron systems (w/ 4GB RAM) from 5.5 to 6.2 (amd64 wipe and reinstalled) and about once a week, it panics with the below message: -=- TPTE at 0x840028a0 IS ZERO @ VA 800514000 panic:

Re: 'panic: bad pte' error on 6.2-RELEASE (amd64)

2007-03-05 Thread Kostik Belousov
On Mon, Mar 05, 2007 at 06:28:30PM +, Gavin Atkinson wrote: On Mon, 5 Mar 2007, Kostik Belousov wrote: On Mon, Mar 05, 2007 at 12:05:32AM -0800, Peter Losher wrote: We recently updated one of our dual Opteron systems (w/ 4GB RAM) from 5.5 to 6.2 (amd64 wipe and reinstalled) and about

Re: panic: kmem_malloc(16384): kmem_map too small: md-mounted /tmp filled up

2007-03-05 Thread Kostik Belousov
On Mon, Mar 05, 2007 at 10:17:14PM +0300, Yar Tikhiy wrote: On Mon, Mar 05, 2007 at 01:14:29PM -0500, Mikhail Teterin wrote: On Monday 05 March 2007 08:23, Yar Tikhiy wrote: = How will it break them?  swap backing only touches swap if there is = memory pressure, i.e. precisely the

Re: LOR #193

2007-03-06 Thread Kostik Belousov
On Tue, Mar 06, 2007 at 10:44:37PM +0100, Andrea Venturoli wrote: Hello. I'm experiencing the above mentioned LOR on a 6.2p1/amd64 box (running gmirror and SMP if that matters). With reference to your question on

Re: Processes get stuck in ufs state

2007-03-06 Thread Kostik Belousov
On Wed, Mar 07, 2007 at 05:22:38AM +0300, Oleg Derevenetz wrote: Hi ! Sometimes (once a week approximately) I have a problem with the same symptoms described here on SMP FreeBSD 6.2-STABLE with dual AMD Opteron(tm) Processor 850: http://www.freebsd.org/cgi/query-pr.cgi?pr=104406cat=

Re: Processes get stuck in ufs state

2007-03-09 Thread Kostik Belousov
On Fri, Mar 09, 2007 at 06:08:25PM +0300, Oleg Derevenetz wrote: On Wed, Mar 07, 2007 at 05:22:38AM +0300, Oleg Derevenetz wrote: Sometimes (once a week approximately) I have a problem with the same symptoms described here on SMP FreeBSD 6.2-STABLE with dual AMD Opteron(tm) Processor 850:

Re: Testing UFS/Snapshots/Quotas on -STABLE

2007-03-13 Thread Kostik Belousov
On Tue, Mar 13, 2007 at 09:50:54AM -0500, Doug Poland wrote: Hello, I would like to test 6.2-STABLE's ability to handle UFS2, quotas, and snapshots. In particular, I would like to make sure the deadlock issue is resolved,

Re: Snapshot deadlock while dumping

2007-03-16 Thread Kostik Belousov
On Fri, Mar 16, 2007 at 09:21:13AM +0100, Ulrich Spoerlein wrote: Hi, One of our fileservers deadlocked, again. It is running RELENG_6 from 2006-11-14 and was running dump(8) -L on a 11% filled 400GB UFS2 volume. It is hanging for 3h hours now, and there is no disk activity. # ps axl |

ath(4) interrupt time

2007-04-18 Thread Kostik Belousov
I have Acer laptop with the Atheros card, identified as [EMAIL PROTECTED]:2:0: class=0x02 card=0x04181468 chip=0x001a168c rev=0x01 hdr=0x00 vendor = 'Atheros Communications Inc.' class = network subclass = ethernet Corresponding dmesg lines are ath0:

Re: 6.2-STABLE deadlock?

2007-04-23 Thread Kostik Belousov
On Mon, Apr 23, 2007 at 03:56:32AM +0100, Adrian Wontroba wrote: On Tue, Mar 13, 2007 at 02:08:48PM +, Adrian Wontroba wrote: At work, amoungst my stable of old computers running FreeBSD, I have a Fujitsu M800 - a 4 Zeon SMP processor with 4 GB of memory. This primarily runs Nagios and

[danger] broken libc.so on stable

2006-01-27 Thread Kostik Belousov
. After that, manually adding calloc.c to the MISRCS in Makefile.inc and installing libc put the system in order. I suppose that this is simple MFC error. Please, correct it before big havoc hit the stable boxes ! Best regards, Kostik Belousov ___ freebsd

Latest (1.17.2.3) sys/i386/i386/local_apic.c on RELENG_6 and DDB

2006-03-01 Thread Kostik Belousov
/interrupt.h rev. 1.32 shall be (partially) mfc'ed too, or (much more lilkely) reference to db_dump_intr_event shall be removed. Best regards, Kostik Belousov pgpc2Iam26DFT.pgp Description: PGP signature

Re: VFS MFC testers wanted

2006-03-04 Thread Kostik Belousov
On Fri, Mar 03, 2006 at 03:41:55PM -0800, Jeff Roberson wrote: I plan to MFC all of this lovely stuff for 6.1: http://www.chesapeake.net/~jroberson/vfsmfc.diff I'm looking for people who are willing to patch their stable boxes and test this. This has the following changes in it: 1)

Re: Failing to understand getrusage()

2006-03-07 Thread Kostik Belousov
On Tue, Mar 07, 2006 at 02:15:56AM +0300, Yar Tikhiy wrote: On Thu, Mar 02, 2006 at 11:50:29PM +, Nick Barnes wrote: At 2006-03-02 22:24:17+, Nik Clayton writes: I'm failing to understand how getrusage() works, which is a bit perplexing, because it doesn't seem like it would

Re: Failing to understand getrusage()

2006-03-07 Thread Kostik Belousov
On Tue, Mar 07, 2006 at 06:06:31PM +0300, Yar Tikhiy wrote: On Tue, Mar 07, 2006 at 12:11:56PM +0200, Kostik Belousov wrote: On Tue, Mar 07, 2006 at 02:15:56AM +0300, Yar Tikhiy wrote: Personally, I'd like to say a me too. /me too fails to see why in a quiet, idle system ru_maxrss

Re: RELENG_4 on flash disk and swap

2006-03-10 Thread Kostik Belousov
On Fri, Mar 10, 2006 at 01:57:50PM +0200, Dmitry Pryanishnikov wrote: This is still a concern for me. IMHO it would be useful to have the ability to disable process killing due to the lack of swap, because having this enabled on e.g. transit router can lead to very unpleasant scenario.

Re: Failing to understand getrusage()

2006-03-13 Thread Kostik Belousov
On Sat, Mar 11, 2006 at 01:49:50AM +0300, Yar Tikhiy wrote: On Tue, Mar 07, 2006 at 06:12:59PM +0200, Kostik Belousov wrote: It may be desirable to add ru_maxrss sampling at the calcru time too. Something like this: Index: sys/kern/kern_resource.c

Re: RELENG_4 on flash disk and swap

2006-03-14 Thread Kostik Belousov
On Mon, Mar 13, 2006 at 11:48:06AM -0800, Jon Dama wrote: If you feel this situation is undesirable, the first thing to do is to put together the patches necessary to allow the kernel to actually track how much ram+swap might be needed to cover the address-space allocations that have been

[patch] Re: kern/94278: Crash with FIFOs and ktrace

2006-03-14 Thread Kostik Belousov
On Tue, Mar 14, 2006 at 04:41:32PM +0800, David Xu wrote: On Tuesday 14 March 2006 15:27, Martin wrote: David Xu wrote: Can anyone add this to 6.1 todo list ? this definitely should be fixed before 6.1R. One of my friends also has found kern/94278:

Re: [patch] Re: kern/94278: Crash with FIFOs and ktrace

2006-03-14 Thread Kostik Belousov
Sorry for garbled patch. I do not know why mutt decided to encode some = as =3D. Index: compat/linux/linux_misc.c === RCS file: /usr/local/arch/ncvs/src/sys/compat/linux/linux_misc.c,v retrieving revision 1.172 diff -u -r1.172

Re: [patch] Re: kern/94278: Crash with FIFOs and ktrace

2006-03-14 Thread Kostik Belousov
On Tue, Mar 14, 2006 at 09:17:49PM +0800, David Xu wrote: I know, someone will work out such a messy patch, but is it reasonable ? why does not the fifi code suddenly work with well defined vnode interface ? why did someone want to break the well defined FILE-vnode-fs-device layers ? sigh.

Re: acquiring duplicate lock of same type: vnode interlock

2006-03-17 Thread Kostik Belousov
On Fri, Mar 17, 2006 at 06:51:42PM +0800, Ariff Abdullah wrote: I think I've read somewhere about panic during early root mount, fsck etc.. Perhaps this might be related: Full dmesg: http://people.freebsd.org/~ariff/misc/dmesg.boot.amd64 [] acquiring duplicate lock of same type: vnode

Re: more weird bugs with mmap-ing via NFS

2006-03-22 Thread Kostik Belousov
On Tue, Mar 21, 2006 at 09:07:48PM -0500, Mikhail Teterin wrote: в?второк 21 березень 2006 20:53, Matthew Dillon Ви написали: Ah ha. That's the problem. I don't know why you are getting a write error, but that is preventing the client from cleaning out the dirty buffers. The

Re: something changed with 'ps' ?

2006-04-05 Thread Kostik Belousov
On Wed, Apr 05, 2006 at 11:29:45AM -0400, Michael Butler wrote: In particular, the command-line used by the nagios plug-in check_procs produces: [EMAIL PROTECTED]:/home/imb /bin/ps axwo 'state uid ppid vsz rss pcpu ucomm command' Segmentation fault (core dumped) Removing 'ppid' stops

Re: fsck_ufs locked in snaplk

2006-04-25 Thread Kostik Belousov
On Tue, Apr 25, 2006 at 08:09:32AM -0500, Chris Dillon wrote: Quoting Dmitry Morozovsky [EMAIL PROTECTED]: On Mon, 24 Apr 2006, Kris Kennaway wrote: KK Also you should add DEBUG_LOCKS and DEBUG_VFS_LOCKS on the off chance KK they catch the problem. I got one thought about the source of

Re: fsck_ufs locked in snaplk

2006-04-25 Thread Kostik Belousov
On Tue, Apr 25, 2006 at 07:06:11PM +0400, Dmitry Morozovsky wrote: On Tue, 25 Apr 2006, Chris Dillon wrote: CD Please, also add DDB to the kernel and show the result of the CD show lockedvnodes CD alltrace CD ps CD in the DDB after the deadlock, as asked by Kris Kennaway earlier CD

Re: fsck_ufs locked in snaplk

2006-04-25 Thread Kostik Belousov
On Tue, Apr 25, 2006 at 08:05:25PM +0400, Dmitry Morozovsky wrote: On Tue, 25 Apr 2006, Kostik Belousov wrote: KB I just made lab machine with serial console, compile minimal kernel with quotas KB and KDB+WITNESS, and immediately after ``quotacheck /var quotaon /var'' got KB KB

Re: fsck_ufs locked in snaplk

2006-04-26 Thread Kostik Belousov
On Wed, Apr 26, 2006 at 01:43:42PM +0400, Dmitry Morozovsky wrote: On Tue, 25 Apr 2006, Kris Kennaway wrote: KK What people are seeing now must be some other problem that I wan't KK able to reproduce. KK KK Once I hear back from someone who can reproduce it with debugging KK enabled (I'm

  1   2   3   4   5   >