On Thu, Jun 29, 2006 at 10:48:06PM -0400, Mike Jakubik wrote:
Konstantin Belousov wrote:
On Tue, Jun 06, 2006 at 01:49:04PM -0400, Mike Jakubik wrote:
Scott Long wrote:
Dmitriy Kirhlarov wrote:
Hi!
On Tue, May 23, 2006 at 04:35:21PM -0400, Kris Kennaway wrote:
6.1-STABLE after 6.1-RELEASE is releases. So I think you may want
If you use snapshots with your quotas, update to 6.1-STABLE. If you
Sorry, guys. You are mean RELENG_6_1 or RELENG_6?
WBR
RELENG_6. However, the changes will likely make their way into
RELENG_6_1 in a few weeks as part of an errata update.
Scott
I have just done tests on 6.1-R and RELENG_6 as of yesterday evening.
Unfortunately both still lock up hard, no crash, just a frozen system. I
cant enter the KDB (ddb) via the console, but its unusable, as it wont
let me type in anything. There must be some other change in -CURRENT
that fixes this, as -CURRENT did not freeze during my previous tests.
Just to confirm, here is the ID of ufs_quota.c on my RELENG_6 system:
/usr/src/sys/ufs/ufs/ufs_quota.c:
$FreeBSD: src/sys/ufs/ufs/ufs_quota.c,v 1.74.2.4 2006/05/14
00:23:27 tegge Exp $
The hangs are mostly related to snapshots. It would be better to
update to the latest RELENG_6.
Hangs on RELENG_6_1 is not so much interesting. For
hanged RELENG_6 system, please do what described below and post
the log of the ddb session.
I'm not sure whether kbdmux was MFCed into RELENG_6 (AFAIR, yes).
If you have it in your kernel, add the line
hint.kbdmux.0.disabled=1
into the /boot/device.hints to make ddb usable.
After that, on the hang, enter ddb, and
do ps and tr pid for all suspected processes.
Better yet, add the following options to your kernel:
options INVARIANTS
options INVARIANT_SUPPORT
options WITNESS
options DEBUG_LOCKS
options DEBUG_VFS_LOCKS
options DIAGNOSTIC
and, after hang, do in ddb
show allpcpu
show alllocks
show lockedvnods
ps
For each process mentioned in show output, do where pid
(for threaded processes, do thread thread-id; where).
BTW, it would be great to add this instructions to the FAQ.
Well, i finally got around to setting up a serial console on this box,
the following is the output from the debugger after the system stopped
responding. Let me know if you need any more/different information, i
also made the kernel changes you recommended.
FreeBSD 6.1-STABLE #1: Thu Jun 10 00:22:29 EDT 2006
---
KDB: enter: Line break on console
[thread pid 12 tid 14 ]
Stopped at kdb_enter+0x30: leave
db ps
pid proc uid ppid pgrp flag stat wmesgwchan cmd
552 c36228302 550 549 0004000 [SLPQ flswai 0xc0707c24][SLP] rm
550 c35708302 549 549 0004000 [SLPQ wait 0xc3570830][SLP] sh
549 c342ec482 548 549 0004000 [SLPQ wait 0xc342ec48][SLP] sh
548 c36226240 422 422 000 [SLPQ piperd 0xc36027f8][SLP] cron
547 c361f8300 524 547 0004002 [SLPQ ufs 0xc3777c94][SLP] ls
546 c36bc4180 544 544 0004002 [SLPQ wdrain 0xc0707be4][SLP]
fsck_4.2bsd
544 c36bcc480 511 544 0004002 [SLPQ wait 0xc36bcc48][SLP] fsck
524 c35e020c0 522 524 0004002 [SLPQ wait 0xc35e020c][SLP] bash
522 c3570c480 406 522 0004100 [SLPQ flswai 0xc0707c24][SLP] sshd
515 c36bc20c0 0 0 204 [SLPQ wdrain 0xc0707be4][SLP] md0
511 c36bb6240 500 511 0004002 [SLPQ wait 0xc36bb624][SLP] bash
509 c3570418 65 1 509 100 [SLPQ select 0xc0707644][SLP]
dhclient
500 c361fa3c0 406 500 0004100 [SLPQ flswai 0xc0707c24][SLP] sshd
480 c342ea3c0 1 256 000 [SLPQ select 0xc0707644][SLP]
dhclient
465 c361f6240 1 465 0004002 [SLPQ ttyin 0xc342b010][SLP] getty
464 c35e0c480 1 464 0004002 [SLPQ ttyin 0xc3429410][SLP] getty
463 c356fa3c0 1 463 0004002 [SLPQ ttyin 0xc3429810][SLP] getty
462 c356f4180 1 462 0004002 [SLPQ ttyin 0xc343f010][SLP] getty
422 c342e6240 1 422 000 [SLPQ nanslp 0xc06ba32c][SLP] cron
416 c356f000 25 1 416 100 [SLPQ pause 0xc356f034][SLP]
sendmail
412 c356f6240 1 412 100 [SLPQ select 0xc0707644][SLP]
sendmail
406 c35e0 1 406 100 [SLPQ select 0xc0707644][SLP] sshd
290 c361f20c0 1 290 000 [SLPQ flswai 0xc0707c24][SLP]
syslogd
256 c36224180 1 256 000 [SLPQ select 0xc0707644][SLP] devd
145 c356f8300 1 145 000 [SLPQ pause 0xc356f864][SLP]
adjkerntz
38 c3378c480 0 0 204 [SLPQ - 0xd56f5cf8][SLP] schedcpu
37 c342d0000 0 0 204 [SLPQ sdflush 0xc070a3b4][SLP]
softdepflush
36 c342d20c0 0 0 204 [SLPQ vlruwt 0xc342d20c][SLP] vnlru
35 c342d4180 0 0 204 [SLPQ ufs 0xc363c46c][SLP] syncer
34 c342d624