Re: what is stuck here?!

2005-08-23 Thread dpk
On Tue, 23 Aug 2005, Fafa Hafiz Krantz wrote:


 hello!

 how come *nothing* happens when i rm -rf directory/?
 it just won't move ...

 top from another terminal tells me:
 55272 root 1160 14396K 13768K RUN  0:27 36.13% 35.40% rm

 what? the directory/ only contains a .maildir/, a .muttrc and an empty 
 directory
 it's not an immutable flag that has been set,
 chflags -R nouchg directory/ stands equally still to rm -rf

It's probably busy calculating the list of directory entries to remove. If
you have /proc mounted try:

truss -p 55272

to see what it is doing. If you don't, and you have KTRACE in your kernel,
you can try:

ktrace -i -d -p 55272
kdump -l

to monitor the process.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


OT: Supermicro IPMI 2.0 boards

2005-08-22 Thread dpk
I wasn't able to find anything about this on Google unfortunately. Hoping
that if this thread can determine the problem, others will be able to find
it.

We ordered a whole batch of servers which have the IPMI 2.0 board (I am
pretty sure). When they first come online they seem to have the MAC
00:30:48:12:34:56 . Then at some point they take on the real NICs MAC
address.

The problem is that some of them aren't, and the IPMI 2.0 boards do not
seem to be supported/working under FreeBSD yet. The older versions of the
board did work previously, using the 'freeipmi' port. On these servers the
tools just hang indefinitely.

The situation is getting more serious as it may be causing spanning tree
problems. We're getting warnings from our switches every 15 seconds about
00:30:48:12:34:56 being swapped from port to port and switch to switch.
While on the servers themselves, I'm seeing a steady stream of:

13:37:37.023915 arp who-has 192.168.1.113 tell 192.168.1.113
13:37:37.119740 arp who-has 192.168.1.113 tell 192.168.1.113
13:37:37.213317 arp who-has 192.168.1.113 tell 192.168.1.113
13:37:37.271162 arp who-has 192.168.1.113 tell 192.168.1.113
13:37:37.522533 arp who-has 192.168.1.113 tell 192.168.1.113
13:37:37.720931 arp who-has 192.168.1.113 tell 192.168.1.113

If I bring up that IP on one of the servers, dmesg fills up with entries
about other servers claiming the IP for their own. Most of the entries
have the real MAC address, and some of them have the 12:34:56 entry:

arp: 00:30:48:83:0b:4a is using my IP address 192.168.1.113!
arp: 00:30:48:83:0c:38 is using my IP address 192.168.1.113!
arp: 00:30:48:83:0b:e2 is using my IP address 192.168.1.113!
arp: 00:30:48:12:34:56 is using my IP address 192.168.1.113!
arp: 00:30:48:83:aa:84 is using my IP address 192.168.1.113!
arp: 00:30:48:83:0c:04 is using my IP address 192.168.1.113!
arp: 00:30:48:83:0c:06 is using my IP address 192.168.1.113!

FWIW, we don't use the 192.168.1 block anywhere on our network, so none of
us know why it picked that particular IP address.

I realize this is totally off-topic but I'm hoping others have been using
these servers and may know a solution to the problem. If this message is
totally inappropriate please feel free to reply off-list. In that case I'd
collect the replies and post the solution somewhere (giving credit
obviously).
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: OT: Supermicro IPMI 2.0 boards

2005-08-22 Thread dpk
On Mon, 22 Aug 2005, Danial Thom wrote:

 A more important question is: why did you order
 a whole bunch of servers without testing one
 first? A curious approach to computing.

 DT

We weren't made aware that the newer servers were coming with a later rev
board, and a customer came along asking for 30 some servers to be ordered
yesterday. The servers themselves work OK otherwise, for the most part.

FWIW, this particular IPMI problem would not have appeared at all if we
only ordered one server, since multiple servers are trying to grab the
same IP/using the same MAC.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable server

2005-08-17 Thread dpk
On Tue, 16 Aug 2005, Nikolas Britton wrote:

 4.x compared to 5.x will always be more stable
 5.x compared to 6.x will always be more stable
 6.x compared to 7.x will al

 Do you see a trend? 4.x works now but what about in another year, two
 years, or three?

I expect it should work just fine in 3 years -- when we purchase hardware
we expect it to last at least that long, and there's rarely a truly
compelling reason to replace the OS on a server.

 Try running the last version of 3.x on today's
 hardware and software, 4.x is already having problems with hardware
 support.

Unfortunately, yeah. 3ware's auto-carving feature (available in 5.4-S)
would not work on 4.x as an example. There may be other things, but 4.11
works on relatively standard hardware you can purchase today.

 FreeBSD 6 already has a -STABLE and it's first release is
 just around the corner, It would be unwise to deploy 4.x unless
 specifically needed If you need to build the next Mars rover or a
 persons life depends on the system working then use 4.x, If your
 deploying a new web server or what not you want 5.x, possibly even 6.x
 if you can wait another month or two.

It depends. I am really concerned about security updates being backported.
While I feel I'm probably capable of handling it myself, if I had to, by
reviewing patches submitted for later versions, I feel more confident in
the patch when it has been peer-reviewed. The fact is that FreeBSD's 4.11
release is scheduled to have patches long past 5.4, and I have to take
that into account when making recommendations to our clients. I don't want
to have to tell a customer: Install this OS, but in a year, you'll want to
install a different OS, and then deal with incompatibilities with the
software you've purchased for your sites.

The way I see it, every major release of FreeBSD takes some time to reach
stability -- the classic be wary of x.0 versions rule applies here as
with almost all software. Stable versions for web servers have been (in my
experience): FreeBSD 2.2.(something, I don't remember, 5?), 3.2, 4.5. 5's
appears to be 5.4, which so far seems to be pretty great, but was only
just recently released May 9th and is set to EOL in about 10 months.

Anyways, to the OP, it all depends on how long you want this particular
solution to be deployed. I'd keep an eye on the security page (of course).
There may be a company/set of hackers out there that would be able to
backport fixes to FreeBSD 5.4 after it expires, in case you're not able to
deploy the most recent version on that date.

I do stand by my recommendation of 4.11, because it is the pinnacle
before some architectural changes, and if it's anything like 4.5 or 3.2 it
should give you years of quality.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: how to know the file system type [programming]

2005-08-17 Thread dpk
On Wed, 17 Aug 2005, Jorge Mario G. Mazo wrote:

 hi there
 I've been looking for a way to check the fs type
 I need to do something like this

 if NTFS do this
 if msdis do that
 if ufs2  do that
 if ext2 do this other stuff

 thanks in advance

I'd check out the fdisk code. For example:

$ fdisk /dev/ad0 | grep sysid
sysid 165 (0xa5),(FreeBSD/NetBSD/386BSD)

Figure out how it determines the sysid, and then you can use that in your
code. You'd still need a function to determine what disks are physically
present.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable server

2005-08-16 Thread dpk
On Tue, 16 Aug 2005, Carstea Catalin wrote:

 what version of freebsd do u recomand for a stable server?

4.11 is solid, hasn't shown any problems here. 5.4 is the best of the 5.x
series but we (I mean at my company, not speaking as a FreeBSD rep)
haven't put it through as much stress as we have 4.11 .
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable server

2005-08-16 Thread dpk
On Tue, 16 Aug 2005, dpk wrote:

 On Tue, 16 Aug 2005, Carstea Catalin wrote:

  what version of freebsd do u recomand for a stable server?

 4.11 is solid, hasn't shown any problems here. 5.4 is the best of the 5.x
 series but we (I mean at my company, not speaking as a FreeBSD rep)
 haven't put it through as much stress as we have 4.11 .

I forgot to mention the other reason I recommend 4.11 first:

http://www.freebsd.org/security/index.html

4.11-R is scheduled to receive security updates 8 months longer than
5.4-R, which may be relevant if you want to stick with a specific version
for a while.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Failed installation of FreeBSD 5.4

2005-08-12 Thread dpk
On Fri, 12 Aug 2005, Greg Barniskis wrote:

 It can be argued (and has been, a lot) whether the hardware problems
 that some folks clearly do have are the fault of the hardware or of
 the new FreeBSD architecture. Myself, I think it's probably a little
 of each. Even though the hardware in question often works fine
 with other operating systems, that's not in my view conclusive
 evidence that the new FreeBSD code is bad. Make up your own mind, by
 all means, but jumping to conclusions is rarely going to help you
 actually resolve a problem.

I believe it's a little bit of each as well -- however, there have always
been hardware problems, and sometimes they just need to be hammered at
with software until they work.

There are some issues with modern hardware that some (including myself)
should have come up during testing (crashes w/ PAE, kernel dumps don't,
2GB filesystem/drive problems, sysinstall bugs). I suspect that
part of the problem is that this hardware may not be immediately available
to the FreeBSD developers, so they might be flying blind. I've been
trying to collect information as I see the issues, and opening PRs, but if
you go over the PR list you'll see some open issues for years. I'm not
sure if the issues will be handled, or if they have been and the tickets
just haven't been closed.

Unfortunately I'm not currently in a position to offer hardware loans to
developers so they can work out the kinks. I'm hoping that will change
(especially for large RAID support) but it's not directly in my hands.

With the solid FreeBSD 4.x relegated to legacy status, we've been forced
to use FreeBSD 5.x on some servers at customers requests, with very mixed
success. 5.4 seems relatively OK, so far, but earlier releases have some
random crash issues on hardware that has appeared stable in the past.

When the latest legacy 4.x release version has a EOL that is further off
than the latest best 5.x release, and with developers moving on to 6.0
and 7.0, it's hard to stay motivated, as a user, to continue to submit
bugs for older releases (older as in not even a year old). I can and
will continue to submit bugs as I see them, but as I'm working with
production systems, I can't just upgrade the entire OS to the latest
version, if that's the answer, which it often is.

This may sound bitchy, but at the time of this email there are 4920
non-closed PRs for FreeBSD. It'd be great if these could be handled before
or at the same time as developers move on to the bigger, better code. I
know the answer here is that it's an open system, and if I think it should
be done, I should do it -- unfortunately I do not have the skill set
necessary to fix these bugs.

If there is a way I could help out, I would. The FreeBSD Foundation
activity list doesn't include fixing old bugs --
http://www.freebsdfoundation.org/activities.shtml . Their goal appears to
be adding features, which is understandable; it's way sexier than bug
fixing, in any case. Is there another organization I could donate money to
(and perhaps even time, if possible), that would be working towards making
FreeBSD a stable platform on modern hardware?

I realize this email may turn people against me and my bug reports (if I'm
even important enough to remember, heh!) for being so bitchy. It's been on
my mind for a while, mainly because I still feel FreeBSD provides the best
platform for web services (at least the type I work with). I'd just prefer
that open PRs take priority over new features.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Newbie Q: No login prompt on startup

2005-08-11 Thread dpk
On Thu, 11 Aug 2005, Eric Lance wrote:

 Hello all,

 I just finished installing FreeBSD 5.4 on my PC.  When i boot it
 normally all of the startup scripts finish, no errors are displayed but
 afterwards a login prompt fails to appear. If i boot in single user mode
 I get my '#' prompt right off the bat and can edit files. What is going
 on? Does my machine think I have a terminal connected to the serial
 port? What config file do i need to edit?

 I used standard install method from CD. I'm running an Nforce2 chipset
 and my HDD is SATA, but FreeBSD seems to be dealing with these just fine.

 Thank you,
 Eric

I'd check /etc/ttys and make sure you have this line:

ttyv0   /usr/libexec/getty Pc cons25  on  secure

exactly as above. At the very least, with 'on' and 'secure' in the line.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re[2]: Newbie Q: No login prompt on startup

2005-08-11 Thread dpk
On Fri, 12 Aug 2005, Hexren wrote:

 The next time that happens try ^C.
 If the startup process hangs in bringing up a daemon you cann kill
 that like any other foreground running process. Maybe that was what
 hit you at least it sound a lot like that to me.

 Hexren

Also, try ^T. ^T should tell you what program is running, and, on
occasion, might give you additional information. Try it next time you're
waiting for the automatic fsck to finish, for an example.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: wizard mode docs

2005-08-11 Thread dpk
On Thu, 11 Aug 2005, Randy Schultz wrote:

 Hey all,

 Is there any documentation on wizard mode?  I'm just wondering what the
 scan function does.

Looking at the source, I would guess that it counts how many 512 byte
blocks there are on a device. It prints B: at the beginning and G: at the
end of the device, I believe. It appears to be capable of handling
multiple beginnings and ends, but I'm not sure how that works (would a
read of the full disk device, at the end of a slice, not read a full 512
bytes? I don't know.)
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Help on bash script?

2005-08-11 Thread dpk
On Fri, 12 Aug 2005, Xu Qiang wrote:

 Hi, all:

 I don't know if this is the right list to ask this question. But since I
 didn't find a bash script mail list and you guys are always so helpful,
 then...

 Here are an excerpt of a bash script:

 ---

 #!/bin/bash
 # saveLogs.sh - Bourne Again Shell script
 ...
   find / -type f -name core -print | while read COREFILE ; do
   NCOREFILES=$[ $NCOREFILES + 1 ] # a bit strange - xq
   echo $NCOREFILES # xq
 ...
 ---
 ...
 1. The way of incrementing the variable NCOREFILES. Why does it use the
 formula of NCOREFILES=$[ $NCOREFILES + 1 ], and not the direct way of
 NCOREFILES=$NCOREFILES+1?

If that was done, NCOREFILES would end up looking like:

+1+1+1+1+1+1+1

as bash does not automatically differentiate between numeric and string
variables. What the funky looking construct does is convince bash to treat
it as a numeric/arithmetic expression.

 2. What confused me most is the value of the variable NCOREFILES
 ($NCOREFILES). Say there is just 1 core file, then because the initial
 value of NCOREFILES is 0, it will be incremented to 1. Yes, this is the
 value in the do-while loop. But outside the loop and if-block, the value
 of NCOREFILES is reverted back to 0 - its initial value.

 It is a local variable, so any modification to it should be valid as
 long as we are still in the scope of the function, right? I am really
 lossed at this phenomenon.

As soon as you used the pipe, to the while, you entered a sub-shell.
There's no way (that I'm aware of anyways) to get the sub-shell's
variables sent back up to the parent.

You could do something along the lines of:

for cf in `find -X -type f -name core -print`
do
# do stuff with $cf
NCOREFILES=$[ $NCOREFILES + 1 ]
done

(the -X helps prevent problems when faced with spaces embedded in the
path)

By the way, on FreeBSD systems, the default core filename format is
%N.core (see man core) where %N is the name of the program that
dumped. You'd need to expand your find with -name \*.core . If you wanted
to get really fancy, you could parse sysctl kern.corefile to find out
the current filename format and use that in your find, but most people
leave that sysctl at its default, so it probably wouldn't be necessary.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


RE: Help on bash script?

2005-08-11 Thread dpk
On Fri, 12 Aug 2005, Xu Qiang wrote:

 This is my test script:

 -
 #!/bin/bash

 var=0
 var=$[3]

 vari=0
 ++vari

 echo $var
 echo $vari
 -

 The result is:
 ./test.sh: ++vari: command not found
 3
 0

 So the manual of bash is incorrect?

 Regards,
 Xu Qiang

It will work with either 'let' or within an 'arithmetic expansion':

$[++var]
let ++var

By the way, there is another syntax, from the man page, that seems to
operate identically:

$((++var)) and $((var+1))
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: fdisk maximum partition size

2005-08-10 Thread dpk
On Wed, 10 Aug 2005, Valerio daelli wrote:

 Hi all
 does anyone know any maximum limit to the slice size in freebsd?
 We would like to create a filesystem of 2.2T.
 Is there any limit in fdisk and bsdlabel?
 Thanks a lot

fdisk does not support 2TB partitions -- at least, I've been unable to
get it to work, just gives me zeros. From the advice I've read, what you'd
be best off doing is skipping fdisk and labeling and just run newfs
against the drive itself. IE:

newfs /dev/da0

Of course, this means you will not be able to boot off of the large
device. That's the price we pay for being on the cutting edge.

There are still several utilities that do not yet handle large partitions
-- the project page is available here:

http://www.freebsd.org/projects/bigdisk/

Another option for you is available if you're using the 3Ware cards: you
can enable auto-carving in its BIOS, to split the device into 2TB
chunks. FreeBSD 5.4-RELEASE can only read the first of these chunks, but
if you update to 5.4-STABLE, with its new twa driver, you can access the
other chunks. (* There may be a better term than chunks, but slices
and partitions are already used interchangably (and non-interchangably),
so I'm going with chunks.) This works out pretty well in practice except
for having to divide files between mount points by hand.

If you run into any panics or bugs about this please open PRs. There only
seems to be a few of us FreeBSD users with multi-terabyte partitions and
there are still some bugs to be worked out. You may not be able to get a
kernel dump (I wasn't, even with 5.4-R and 1TB partitions (had to put the
server into production before I could report this bug unfortunately))
but you could still get a backtrace for the developers to use.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD 5.4-RELEASE-p5 panic

2005-08-03 Thread dpk
On Tue, 2 Aug 2005, dpk wrote:

 (Another panic I would get would follow roughly the same path except it
 would die while trying to unlock a vnode lock that the thread didn't own.
 I'll try to get this information some time, too.)

Here's the backtrace from that panic:

#0  kdb_enter (msg=0x12 Address 0x12 out of bounds) at 
../../../kern/subr_kdb.c:266
#1  0xc033ea1f in panic (fmt=0xc04c99ff lockmgr: thread %p, not %s %p 
unlocking)
at ../../../kern/kern_shutdown.c:550
#2  0xc0333181 in lockmgr (lkp=0xc61f5e14, flags=6, interlkp=0x100, td=0x0)
at ../../../kern/kern_lock.c:419
#3  0xc038b08b in vop_stdunlock (ap=0x12) at ../../../kern/vfs_default.c:295
#4  0xc038af3b in vop_defaultop (ap=0x0) at ../../../kern/vfs_default.c:157
#5  0xc03010bb in spec_vnoperate (ap=0x0) at ../../../fs/specfs/spec_vnops.c:118
#6  0xc0301648 in spec_write (ap=0xeb858a94) at vnode_if.h:1044
#7  0xc03010bb in spec_vnoperate (ap=0x0) at ../../../fs/specfs/spec_vnops.c:118
#8  0xc0452ecd in vnode_pager_generic_putpages (vp=0xc61f5d68, m=0xeb858bf0, 
bytecount=4096,
flags=0, rtvals=0xeb858b70) at vnode_if.h:432
#9  0xc038b7e2 in vop_stdputpages (ap=0x12) at ../../../kern/vfs_default.c:650
#10 0xc038af3b in vop_defaultop (ap=0x0) at ../../../kern/vfs_default.c:157
#11 0xc03010bb in spec_vnoperate (ap=0x0) at ../../../fs/specfs/spec_vnops.c:118
#12 0xc0452c6a in vnode_pager_putpages (object=0xc085e7bc, m=0x12, count=18, 
sync=0, rtvals=0x12)
at vnode_if.h:1357
#13 0xc044a603 in vm_pageout_flush (mc=0xeb858bf0, count=1, flags=0) at 
vm_pager.h:147
#14 0xc044a52d in vm_pageout_clean (m=0x0) at ../../../vm/vm_pageout.c:347
#15 0xc044b3df in vm_pageout_scan (pass=0) at ../../../vm/vm_pageout.c:996
#16 0xc044c162 in vm_pageout () at ../../../vm/vm_pageout.c:1487
#17 0xc032911d in fork_exit (callout=0xc044be50 vm_pageout, arg=0x0, 
frame=0xeb858d48)
at ../../../kern/kern_fork.c:791
#18 0xc0474fcc in fork_trampoline () at ../../../i386/i386/exception.s:209

Again, vm_pageout_clean is being called with a NULL argument, and
eventually the spec_vnoperate function is called with a NULL (the other
panic, ufs_vnoperate was called with a NULL).

These couple of panics are relatively easy to reproduce on demand.

Interestingly (I think), vm_pageout_flush's m argument was the same with
each panic: 0xeb858bf0 .

That is decimal 3,951,397,872 . When you boot these servers without PAE
enabled, the real memory is 3,757,965,312. I think this indicates that
the page vnode_pager_generic_putpages is dealing with is within the PAE
range (I don't know exactly how to describe that). This could be a total
long shot, but I think it's unlikely that both panics would have something
like that in common without it being a bug of some sort.

If there's somewhere else I should be sending these please let me know.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


FreeBSD 5.4-RELEASE-p5 panic

2005-08-02 Thread dpk
After much struggling (documented elsewhere) I have a backtrace showing
one of a handful of panics I am getting on a FreeBSD 5.4-RELEASE-p5
system. The server has 4GB RAM, and is running with PAE and SMP enabled.
If this is not the appropriate list for this, I can send it elsewhere,
please let me know.

(gdb) bt
#0  kdb_enter (msg=0x12 Address 0x12 out of bounds) at 
../../../kern/subr_kdb.c:266
#1  0xc033ea1f in panic (fmt=0xc04d782d ffs_write: dir write) at 
../../../kern/kern_shutdown.c:550
#2  0xc04292de in ffs_write (ap=0xeb858a94) at ../../../ufs/ffs/ffs_vnops.c:614
#3  0xc0452e71 in vnode_pager_generic_putpages (vp=0xc6237630, m=0xeb858bf0, 
bytecount=4096,
flags=0, rtvals=0xeb858b70) at vnode_if.h:432
#4  0xc038b7e2 in vop_stdputpages (ap=0x12) at ../../../kern/vfs_default.c:650
#5  0xc038af3b in vop_defaultop (ap=0x0) at ../../../kern/vfs_default.c:157
#6  0xc0435ebf in ufs_vnoperate (ap=0x0) at ../../../ufs/ufs/ufs_vnops.c:2821
#7  0xc0452c0e in vnode_pager_putpages (object=0xc6901a50, m=0x12, count=18, 
sync=0, rtvals=0x12)
at vnode_if.h:1357
#8  0xc044a5db in vm_pageout_flush (mc=0xeb858bf0, count=1, flags=0) at 
vm_pager.h:147
#9  0xc044a505 in vm_pageout_clean (m=0x0) at ../../../vm/vm_pageout.c:347
#10 0xc044b386 in vm_pageout_scan (pass=1) at ../../../vm/vm_pageout.c:985
#11 0xc044c106 in vm_pageout () at ../../../vm/vm_pageout.c:1476
#12 0xc032911d in fork_exit (callout=0xc044bdf4 vm_pageout, arg=0x0, 
frame=0xeb858d48)
at ../../../kern/kern_fork.c:791
#13 0xc0474f6c in fork_trampoline () at ../../../i386/i386/exception.s:209

(Another panic I would get would follow roughly the same path except it
would die while trying to unlock a vnode lock that the thread didn't own.
I'll try to get this information some time, too.)

This might all trace back to vm_pageout_clean() being called with as NULL
argument. Looking at vm_pageout_clean, it looks as though that should
never happen -- at least, there's nothing there that checks if it is NULL
before it goes on to treat it as a pointer to a struct:

static int
vm_pageout_clean(m)
vm_page_t m;
{
vm_object_t object;
vm_page_t mc[2*vm_pageout_page_count];
int pageout_count;
int ib, is, page_base;
vm_pindex_t pindex = m-pindex;

mtx_assert(vm_page_queue_mtx, MA_OWNED);
VM_OBJECT_LOCK_ASSERT(m-object, MA_OWNED);

In frame #10, vm_pageout_scan:

#10 0xc044b386 in vm_pageout_scan (pass=1) at ../../../vm/vm_pageout.c:985
985 if (vm_pageout_clean(m) != 0) {
(gdb) p m
$65 = 0xc0da66f8
(gdb) p *m
$78 = {pageq = {tqe_next = 0xeb858cb0, tqe_prev = 0xc231c840},
listq = {tqe_next = 0x0, tqe_prev = 0xc6901a88}, left = 0x0,
right = 0x0, object = 0xc6901a50, pindex = 1, phys_addr = 296792064,
md = {pv_list_count = 0, pv_list = {tqh_first = 0x0,
tqh_last = 0xc0da6728}},queue = 33, flags = 4, pc = 11,
wire_count = 0, hold_count = 0, act_count = 0 '\0', busy = 1 '\001',
valid = 255 '', dirty = 255 '', cow = 0}

So it seems as though m is getting lost. What follows that seems to be
undefined behavior. (I have slightly modified the above. valid had a 'y'
shaped upper-8-bit symbol between the quotes, and formatted it to fit in
80 columns).

I'll admit I'm quite green when it comes to debugging kernels, especially
5.x kernels. It gets really tricky when some functions trace back to .h
files, and not all of the variables seem available to the debugger. The
servers appear to work fine without PAE enabled, if that's of interest.

This gdb session is still active and I hope to keep it active in case
there are other commands you'd like me to run that might help shed some
light on the situation.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


FreeBSD 5.4-RELEASE-p5 panics

2005-08-01 Thread dpk
I'm getting some panics on a few FreeBSD 5.4 boxes, and I'm trying to
figure out a way to get them to drop to some sort of debugger, or to dump
core. It used to be that you'd just add:

options DDB
options DDB_UNATTENDED

and build a kernel with -g, set 'dumpon', and then when it paniced it
would write it out to disk. For 5.4, I understand you also need options
KDB, and you need to replace DDB_UNATTENDED with KDB_UNATTENDED.

I've done all of that, but the panic did not trigger a core dump, and did
not leave me at a prompt (the latter probably because of KDB_UNATTENDED?).

The panic message, in the present case, in case someone's seen this
particular problem before:

Fatal trap 12: page fault while in kernel mode.
cpuid = 01; apic id = 06
fault virtual address   = 0x24
fault code  = supervisor read, page not present
instruction pointer = 0x8:0xc035c7cf
stack pointer   = 0x10:0xeb858c74
frame pointer   = 0x10:0xeb858c88
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, def32 1, gran 1
processor eflags= resume, IOPL equal 0
current process = 8 (pagedaemon)

Would this sort of panic prevent a dump from occuring? BTW: I have checked
the handbook and haven't found the answer (it doesn't seem to mention KDB
at all).

Unfortunately the kernel buffer appears insufficient to store all the
output from boot -v, so I'm booting it with no flags. I've included the
output of dmesg. Following that is the kernel's config. sysctl -a output
is available at http://dpk.net/5.4-sysctl.txt (35KB)

Copyright (c) 1992-2005 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.4-RELEASE-p5 #2: Sun Jul 31 20:58:46 PDT 2005
[EMAIL PROTECTED]:/usr/src/sys/i386/compile/STD-PAE
MPTable: INTELLINDENHURST 
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU 3.00GHz (3000.12-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0xf41  Stepping = 1
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Hyperthreading: 2 logical CPUs
real memory  = 4831838208 (4608 MB)
avail memory = 4194848768 (4000 MB)
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  6
ioapic0: Assuming intbase of 0
ioapic1: Assuming intbase of 24
ioapic2: Assuming intbase of 48
ioapic3: Assuming intbase of 72
ioapic0 Version 2.0 irqs 0-23 on motherboard
ioapic1 Version 2.0 irqs 24-47 on motherboard
ioapic2 Version 2.0 irqs 48-71 on motherboard
ioapic3 Version 2.0 irqs 72-95 on motherboard
npx0: math processor on motherboard
npx0: INT 16 interface
cpu0 on motherboard
cpu1 on motherboard
pcib0: MPTable Host-PCI bridge pcibus 0 on motherboard
pci0: PCI bus on pcib0
pcib1: PCI-PCI bridge irq 16 at device 2.0 on pci0
pci1: PCI bus on pcib1
pcib2: PCI-PCI bridge at device 0.0 on pci1
pci3: PCI bus on pcib2
pcib3: MPTable PCI-PCI bridge at device 0.2 on pci1
pci2: PCI bus on pcib3
3ware device driver for 9000 series storage controllers, version: 2.50.02.012
twa0: 3ware 9000 series Storage Controller port 0xa800-0xa8ff mem 
0xfb80-0xfbff,0xfc8ffc00-0xfc8ffcff irq 72 at device 1.0 on pci2
twa0: 4 ports, Firmware FE9X 2.04.00.005, BIOS BE9X 2.03.01.047
pcib4: PCI-PCI bridge irq 16 at device 3.0 on pci0
pci4: PCI bus on pcib4
pcib5: MPTable PCI-PCI bridge at device 28.0 on pci0
pci5: PCI bus on pcib5
em0: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
0xbc00-0xbc3f mem 0xfc9e-0xfc9f irq 26 at device 1.0 on pci5
em0: Ethernet address: 00:30:48:83:0a:f8
em0:  Speed:N/A  Duplex:N/A
em1: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
0xb800-0xb83f mem 0xfc9a-0xfc9b irq 27 at device 2.0 on pci5
em1: Ethernet address: 00:30:48:83:0a:f9
em1:  Speed:N/A  Duplex:N/A
pci0: serial bus, USB at device 29.0 (no driver attached)
pci0: serial bus, USB at device 29.1 (no driver attached)
pci0: base peripheral at device 29.4 (no driver attached)
pci0: base peripheral, interrupt controller at device 29.5 (no driver 
attached)
pci0: serial bus, USB at device 29.7 (no driver attached)
pcib6: MPTable PCI-PCI bridge at device 30.0 on pci0
pci6: PCI bus on pcib6
pci6: display, VGA at device 2.0 (no driver attached)
isab0: PCI-ISA bridge at device 31.0 on pci0
isa0: ISA bus on isab0
atapci0: Intel 6300ESB UDMA100 controller port 
0xfc00-0xfc0f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 31.1 on pci0
ata0: channel #0 on atapci0
ata1: channel #1 on atapci0
pci0: serial bus, SMBus at device 31.3 (no driver attached)
orm0: ISA Option ROMs at iomem 
0xca800-0xcb7ff,0xc9800-0xca7ff,0xc8000-0xc97ff,0xc-0xc7fff on isa0
pmtimer0 on isa0
atkbdc0: Keyboard controller (i8042) at port 0x64,0x60 on isa0

Syncing disks, buffers remaining - value increases?

2005-08-01 Thread dpk
Sorry for all the questions to the list, we're having a number of issues
deploying FreeBSD 5.4 and we're trying to get an understanding of what
we're seeing.

When I reboot these servers (using reboot on the command line), it will
go through the usual routine I'm familiar with from FreeBSD 4.x, but when
it goes to the syncing disks  phase, sometimes it will do something like
this:

Syncing disks, buffers remaining ... 1 1 1 1 1 2 1 1 1 3 1 1 1 1

Eventually it gives up on 1 or so buffers. When the server comes back up,
it gives warnings that the partitions were all improperly dismounted.

Is it normal for the number of buffers remaining to actually increase
while shutting down? I'm looking at the kernel source and the loop that
performs these syncs appears pretty small. Could something else still be
running on the server that is trying to write to disk, even while in the
shutdown phase? I guess it would only be able to run interrupt threads,
judging by the comments, while it releases Giant.

It may be a coincidence, but on a few occasions, the background fsck may
have still been running when the reboot was issued. I've assumed that
since fsck now runs in the background that it must be able to gracefully
handle reboots.

FreeBSD 5.4-RELEASE-p5, Dual proc Xeons (varying speed), varying memory,
PAE+SMP kernels, 3ware RAID cards (varying capacity, RAID levels).
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Large filesystem woes

2005-07-29 Thread dpk
The resolution to this problem turned out to be enabling the 3Ware 9500S
auto carving option. This splits all partitions into 2TB chunks, which
are then presented to the OS as separate LUNs.

FreeBSD 5.4-RELEASE's driver for the 3Ware card does not support multiple
LUNs, but the new driver (Common Layer) in 5.4-STABLE does. So if you're
running in to this same problem, here's a quick run down of what you need
to do:

1) Enable auto-carving in either the 3ware BIOS or in the 3dm web control
panel.

2) Delete and then re-create any RAID that is larger than 2TB, that you
want accessable from the OS. If you have sufficient drives available, you
can migrate the data to a new RAID, instead of deleting.

3) Install 5.4-RELEASE. sysinstall will show you a single 2TB device. Feel
free to use that device however you wish -- you can fill it to its
boundaries.

4) cvsup to 5.4-STABLE, make buildworld buildkernel installkernel, reboot.

5) You'll now see da1 and perhaps da2 and beyond, in dmesg or camcontrol
devlist.

If you should ever have to reboot into the 5.4-RELEASE GENERIC kernel, the
da1..n partitions will be unavailable. da0, however, will operate
normally. As such it is safe to use as a boot device. (For sanity's sake
it is probably best to create a much smaller slice or partition for system
files, but that's your decision).
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD 5.4+SMP, severe network degredation

2005-07-28 Thread dpk
By the way, I also compared GENERIC performance against GENERIC w/
options SMP added, and had the same results.

On Wed, 27 Jul 2005, dpk wrote:

 We just received several SuperMicro servers, 3.0Ghz Xeon x 2, 4GB RAM.
 They're using the em driver and the ports are set to 1000Mbit (we also
 tried 100Mbit/full duplex on the card and on the switch). They're running
 FreeBSD 5.4.

 I ran a steady ping on a couple of them while they were running GENERIC,
 and then rebooted them with a kernel built with the PAE kernel included
 with the installation, with option SMP added.

 The PAE-SMP-GENERIC kernel was built after cvsup'ing with tag=RELENG_5_4
 and the uname reports 5.4-RELEASE-p5.

 Here are the ping results:

 GENERIC:

 117 packets transmitted, 117 packets received, 0% packet loss
 round-trip min/avg/max/stddev = 0.451/0.554/0.856/0.059 ms

 PAE-SMP-GENERIC:

 102 packets transmitted, 102 packets received, 0% packet loss
 round-trip min/avg/max/stddev = 0.569/4.262/7.944/2.065 ms

 Fetching a 637MB ISO from a local server, also on 100/FDX:

 GENERIC:

 /dev/null 100% of  637 MB   10 MBps 00m00s

 real0m58.071s
 user0m1.954s
 sys 0m6.278s

 PAE-SMP-GENERIC:

 /dev/null 100% of  637 MB 5764 kBps 00m00s

 real1m53.324s
 user0m1.478s
 sys 0m5.624s

 Running GENERIC, systat shows about 7000 interrupts/second, and around 600
 interrupts/second using PAE-SMP-GENERIC, while fetch was running.

 I've checked the errata and hardware notes, as well as gnats, and was not
 able to find anything that explains or matches this behavior. We've run
 SMP servers for years, using 4.5-4.11, but we've never seen the network
 performance cut in half (or pings go up 10x).

 Removing option SMP makes the problem go away, but at a very significant
 performance cost obviously.

 Could it be something from -p5? Is this explained/examined in a PR I've
 missed, and if so can I add some information?

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD 5.4+SMP, severe network degredation

2005-07-28 Thread dpk
Woah. Well, I guess I didn't try *everything*. Removing device acpi from
the kernel config leaves me witih a PAE+SMP kernel that works fine. I can
fetch files at wire speed and everything.

So, I guess this issue is closed. acpi was the ultimate culprit.

On Thu, 28 Jul 2005, dpk wrote:

 By the way, I also compared GENERIC performance against GENERIC w/
 options SMP added, and had the same results.

 On Wed, 27 Jul 2005, dpk wrote:

  We just received several SuperMicro servers, 3.0Ghz Xeon x 2, 4GB RAM.
  They're using the em driver and the ports are set to 1000Mbit (we also
  tried 100Mbit/full duplex on the card and on the switch). They're running
  FreeBSD 5.4.
 
  I ran a steady ping on a couple of them while they were running GENERIC,
  and then rebooted them with a kernel built with the PAE kernel included
  with the installation, with option SMP added.
 
  The PAE-SMP-GENERIC kernel was built after cvsup'ing with tag=RELENG_5_4
  and the uname reports 5.4-RELEASE-p5.
 
  Here are the ping results:
 
  GENERIC:
 
  117 packets transmitted, 117 packets received, 0% packet loss
  round-trip min/avg/max/stddev = 0.451/0.554/0.856/0.059 ms
 
  PAE-SMP-GENERIC:
 
  102 packets transmitted, 102 packets received, 0% packet loss
  round-trip min/avg/max/stddev = 0.569/4.262/7.944/2.065 ms
 
  Fetching a 637MB ISO from a local server, also on 100/FDX:
 
  GENERIC:
 
  /dev/null 100% of  637 MB   10 MBps 
  00m00s
 
  real0m58.071s
  user0m1.954s
  sys 0m6.278s
 
  PAE-SMP-GENERIC:
 
  /dev/null 100% of  637 MB 5764 kBps 
  00m00s
 
  real1m53.324s
  user0m1.478s
  sys 0m5.624s
 
  Running GENERIC, systat shows about 7000 interrupts/second, and around 600
  interrupts/second using PAE-SMP-GENERIC, while fetch was running.
 
  I've checked the errata and hardware notes, as well as gnats, and was not
  able to find anything that explains or matches this behavior. We've run
  SMP servers for years, using 4.5-4.11, but we've never seen the network
  performance cut in half (or pings go up 10x).
 
  Removing option SMP makes the problem go away, but at a very significant
  performance cost obviously.
 
  Could it be something from -p5? Is this explained/examined in a PR I've
  missed, and if so can I add some information?
 

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Large filesystem woes

2005-07-28 Thread dpk
On Thu, 21 Jul 2005, Giorgos Keramidas wrote:

 Are you trying to start from scratch, or just making changes to the
 existing table?  If it's the second, then try the -u option:

   # fdisk -u /dev/da0

 This should give you a chance to interactively change the partition
 table of the disk.  If this fails too, please show us the exact command
 line you used and the exact error messages.

 - Giorgos

Unfortunately I can't run fdisk -u, if that command will wipe out the
partition table, since / is also on the array, just in a smaller
partition.

I found this page:

http://www.freebsd.org/projects/bigdisk/

It was last updated in January. Has there been more progress on this
front? I wonder if this is all a known problem, and I just overlooked it
because of my understanding that UFS2 is supposed to handle large
partitions.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Large filesystem woes

2005-07-28 Thread dpk
On Fri, 29 Jul 2005, Giorgos Keramidas wrote:

 It won't wipe away the table.  It will just let you edit the existing
 table interactively, through a series of questions like:

   - Do you want to edit partition 1?
   - Do you want to edit partition 2?
   - Do you want to edit partition 3?
   - Do you want to edit partition 4?
   - Do you want to change the active partition?
   - Do you want to save your changes to the disk?

# df -k
Filesystem  1K-blocksUsedAvail Capacity  Mounted on
/dev/da0s1a  35082074 1147642 31127868 4%/
devfs   1   10   100%/dev
procfs  4   40   100%/proc
# fdisk -u
fdisk: cannot open disk /dev/da0: No such file or directory
# ls -ald /dev/da0*
crw-r-  1 root  operator4,  12 Jul 28 15:56 /dev/da0
crw-r-  1 root  operator4,  13 Jul 28 15:56 /dev/da0s1
crw-r-  1 root  operator4,  18 Jul 28 08:56 /dev/da0s1a
crw-r-  1 root  operator4,  19 Jul 28 15:56 /dev/da0s1b
crw-r-  1 root  operator4,  20 Jul 28 15:56 /dev/da0s1c

truss indicates that fdisk may be getting the error from somewhere else:

stat(/dev/da0,0xbfbfeb30)  = 0 (0x0)
open(/dev/da0,0x2,00)  ERR#1 'Operation not permitted'
open(/dev/da0,0x0,027757765630)= 6 (0x6)
open(/dev/da0s1,0x2,01001210100)   ERR#1 'Operation not permitted'
open(/dev/da0s2,0x2,01001210100)   ERR#2 'No such file or 
directory'
open(/dev/da0s3,0x2,01001210100)   ERR#2 'No such file or 
directory'
open(/dev/da0s4,0x2,01001210100)   ERR#2 'No such file or 
directory'

Because it is using devfs, I'm not able to create these missing slices in
/dev. Most unfortunately, it appears it uses devfs in single user mode as
well, so I can't test the theory.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Large filesystem woes

2005-07-28 Thread dpk
On Thu, 28 Jul 2005, dpk wrote:

 truss indicates that fdisk may be getting the error from somewhere else:

 stat(/dev/da0,0xbfbfeb30)  = 0 (0x0)
 open(/dev/da0,0x2,00)  ERR#1 'Operation not 
 permitted'
 open(/dev/da0,0x0,027757765630)= 6 (0x6)
 open(/dev/da0s1,0x2,01001210100)   ERR#1 'Operation not 
 permitted'
 open(/dev/da0s2,0x2,01001210100)   ERR#2 'No such file or 
 directory'
 open(/dev/da0s3,0x2,01001210100)   ERR#2 'No such file or 
 directory'
 open(/dev/da0s4,0x2,01001210100)   ERR#2 'No such file or 
 directory'

 Because it is using devfs, I'm not able to create these missing slices in
 /dev. Most unfortunately, it appears it uses devfs in single user mode as
 well, so I can't test the theory.

Under a chrooted shell, I created the s1-4 /dev entries. fdisk -u now
reports 'Device not configured', which supports the theory that fdisk is
just using whatever error it last sees and gives up.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Large filesystem woes

2005-07-28 Thread dpk
On Fri, 29 Jul 2005, Giorgos Keramidas wrote:

 Hmmm, in multiuser mode, your root filesystem is mounted as read-write
 and it resides in da0, so GEOM will forbid opening the disk device in
 read-write mode for editing the partition table.

 In single user mode, devfs is still used, but your root filesystem
 should be mounted read-only (unless you manually mount it as
 read-write), so fdisk -u should work.

I've remounted the disk readonly, and was still seeing the same errors.
I've looked at the fdisk source and compared it to the truss output.

... /* rwmode is O_RDWR due to -u */
fd = open(disk, rwmode);
... /* errno is EPERM here, from truss */
if (fd == -1  errno == ENXIO)
return -2;
if (fd == -1  errno == EPERM  rwmode == O_RDWR) {
... /* this is successful: */
fd = open(disk, O_RDONLY);
... /* the following opens get device not configured, or no such file or
directory under normal operation, from truss */
for (p = 1; p  5; p++) {
asprintf(s, %ss%d, disk, p);
fdw = open(s, O_RDONLY);
free(s);
if (fdw == -1)
continue;
break;
}
... /* ah ha! open_disk is returning -4 because the last slice had some
error */
if (fdw == -1)
return -4;

This change was introduced with version 1.67 of the fdisk.c file.

Commenting out if (fdw == -1) return -4; allows fdisk -u to function. Here
are the results, not changing anything:

# ./fdisk -u
*** Working on device /dev/da0 ***
parameters extracted from in-core disklabel are:
cylinders=534921 heads=255 sectors/track=63 (16065 blks/cyl)

Figures below won't work with BIOS for partitions not in cyl 1
parameters to be used for BIOS calculations are:
cylinders=534921 heads=255 sectors/track=63 (16065 blks/cyl)

Do you want to change our idea of what BIOS thinks ? [n]
Media sector size is 512
Warning: BIOS sector numbering starts with sector 1
Information from DOS bootblock is:
The data for partition 1 is:
sysid 165 (0xa5),(FreeBSD/NetBSD/386BSD)
start 63, size 75489372 (36860 Meg), flag 80 (active)
beg: cyl 0/ head 1/ sector 1;
end: cyl 1023/ head 254/ sector 63
Do you want to change it? [n]
The data for partition 2 is:
UNUSED
Do you want to change it? [n]
The data for partition 3 is:
UNUSED
Do you want to change it? [n]
The data for partition 4 is:
UNUSED
Do you want to change it? [n]
Partition 1 is marked active
Do you want to change the active partition? [n]

Here are the results of ./fdisk -u when trying to add the second
partition:

# ./fdisk -u
*** Working on device /dev/da0 ***
parameters extracted from in-core disklabel are:
cylinders=534921 heads=255 sectors/track=63 (16065 blks/cyl)

Figures below won't work with BIOS for partitions not in cyl 1
parameters to be used for BIOS calculations are:
cylinders=534921 heads=255 sectors/track=63 (16065 blks/cyl)

Do you want to change our idea of what BIOS thinks ? [n] n
Media sector size is 512
Warning: BIOS sector numbering starts with sector 1
Information from DOS bootblock is:
The data for partition 1 is:
sysid 165 (0xa5),(FreeBSD/NetBSD/386BSD)
start 63, size 75489372 (36860 Meg), flag 80 (active)
beg: cyl 0/ head 1/ sector 1;
end: cyl 1023/ head 254/ sector 63
Do you want to change it? [n] n
The data for partition 2 is:
UNUSED
Do you want to change it? [n] y
Supply a decimal value for sysid (165=FreeBSD) [0] 165
Supply a decimal value for start [0]
Supply a decimal value for size [0]
fdisk: ERROR: size of partition is zero
fdisk: ERROR: failed to adjust; setting sysid to 0
Explicitly specify beg/end address ? [n] y
Supply a decimal value for beginning cylinder [0] 1024
Supply a decimal value for beginning head [0]
Supply a decimal value for beginning sector [0]
Supply a decimal value for ending cylinder [0] 534921
Supply a decimal value for ending head [0] 254
Supply a decimal value for ending sector [0] 63
sysid 0 (),(unused)
start 0, size 0 (0 Meg), flag 0
beg: cyl 0/ head 0/ sector 0;
end: cyl 393/ head 254/ sector 63

Writing it fails, but I didn't really expect it to succeed with the above
values. There's some overflowing going on.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


FreeBSD 5.4+SMP, severe network degredation

2005-07-27 Thread dpk
We just received several SuperMicro servers, 3.0Ghz Xeon x 2, 4GB RAM.
They're using the em driver and the ports are set to 1000Mbit (we also
tried 100Mbit/full duplex on the card and on the switch). They're running
FreeBSD 5.4.

I ran a steady ping on a couple of them while they were running GENERIC,
and then rebooted them with a kernel built with the PAE kernel included
with the installation, with option SMP added.

The PAE-SMP-GENERIC kernel was built after cvsup'ing with tag=RELENG_5_4
and the uname reports 5.4-RELEASE-p5.

Here are the ping results:

GENERIC:

117 packets transmitted, 117 packets received, 0% packet loss
round-trip min/avg/max/stddev = 0.451/0.554/0.856/0.059 ms

PAE-SMP-GENERIC:

102 packets transmitted, 102 packets received, 0% packet loss
round-trip min/avg/max/stddev = 0.569/4.262/7.944/2.065 ms

Fetching a 637MB ISO from a local server, also on 100/FDX:

GENERIC:

/dev/null 100% of  637 MB   10 MBps 00m00s

real0m58.071s
user0m1.954s
sys 0m6.278s

PAE-SMP-GENERIC:

/dev/null 100% of  637 MB 5764 kBps 00m00s

real1m53.324s
user0m1.478s
sys 0m5.624s

Running GENERIC, systat shows about 7000 interrupts/second, and around 600
interrupts/second using PAE-SMP-GENERIC, while fetch was running.

I've checked the errata and hardware notes, as well as gnats, and was not
able to find anything that explains or matches this behavior. We've run
SMP servers for years, using 4.5-4.11, but we've never seen the network
performance cut in half (or pings go up 10x).

Removing option SMP makes the problem go away, but at a very significant
performance cost obviously.

Could it be something from -p5? Is this explained/examined in a PR I've
missed, and if so can I add some information?
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Large filesystem woes

2005-07-20 Thread dpk
On Wed, 20 Jul 2005, Lowell Gilbert wrote:

 dpk [EMAIL PROTECTED] writes:

  Has anyone here had any luck whatsoever getting FreeBSD 5.4-RELEASE to use
  4TB of space from a 4TB RAID array? It is apparent that there is a
  1TB/slice limit, and sysinstall doesn't seem to be able to handle creating
  more than 2 slices.

 Since you're using slices, I would try working with fdisk directly...

fdisk gives a No such file or directory error when you run fdisk -i
/dev/da0, but fdisk /dev/da0 shows the partition table.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Large filesystem woes

2005-07-19 Thread dpk
Has anyone here had any luck whatsoever getting FreeBSD 5.4-RELEASE to use
4TB of space from a 4TB RAID array? It is apparent that there is a
1TB/slice limit, and sysinstall doesn't seem to be able to handle creating
more than 2 slices.

I read somewhere that people recommend using gpt to manage large
filesystems. Unfortunately, this tool is not to be found on the fixit
floppy or the mfsroot during installation. If the server is up, gpt
complains about Operation not permitted because the disk is in use.

I managed to get gpt on a floppy -- along with a hand written cp/mkdir
all-in-one binary because, when you use the Fixit option it does not
chroot you to the fixit floppy, meaning you'll need /libexec/ld-elf.so.1
and /lib/libc.so.5, because for some reason critical system utilities are
built dynamically now but I am straying from the topic -- and ran gpt
migrate as recommended in its man page. Now gpt no longer recognizes the
the disk at all. gpt show fails entirely. gpt -vvv show lists some
entries, but the sizes are all very wrong, as if they were truncated.

Anyone know the specific tricks involved in getting all 4TB used, even if
it has to be in 4 different slices, or am I stuck with using just 2TB?
We're gonna have to go with Linux on this particular server since we need
it up today but we'll be getting another one soon and I can try things
there.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Server throwing NMI ISA 20, EISA ff messages

2004-11-28 Thread dpk
We have various servers, with various versions of FreeBSD throwing these
NMI messages:

NMI ISA 3c, EISA ff
NMI ISA 2c, EISA ff
NMI ISA 20, EISA ff

There's no information about this anywhere that I can see, but at least
two other people have had the problem according to Google searches.

If tripwire is running while one of these messages appears, sometimes it
shows false alarms about files, indicating either memory or drive
controller problems, maybe?

At first we thought it might just be dodgy hardware. We were only getting
the first two errors on some of our older 2U white box machines. However,
this morning, we just got the third error (just prior to a complete
freeze) on a modern Dell 1750, 1GB RAM, 2.8Ghz.

The kernel source reveals nothing of use about what these errors could be.
Does anyone have a chart of what the numbers next to ISA mean, or have any
ideas where I should turn for answers?

Thanks! I'm off-list, so please Cc: me on any replies.
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Memory leak on 5.1-RELEASE?

2004-01-31 Thread dpk
 leak in 5.1, since I've
 seen running 5.1 just fine on a PE2650 with 2 GB RAM. You shouldn't
 rely on top too much acually. Vmstat is a better program when looking
 at memory.

 Cheers,

 Jorn
 - Original Message -
 From: dpk [EMAIL PROTECTED]
 To: [EMAIL PROTECTED]
 Sent: Saturday, January 31, 2004 1:10 AM
 Subject: Memory leak on 5.1-RELEASE?


  (I'm not a member of the list; please Cc me on any replies.)
 
  We're running Apache 1.3.28 on a 5.1-RELEASE machine. It's a Dell PE 2650
  w/ 2GB RAM. The site contains a lot of large files (multi-megabyte) -
  otherwise there's nothing unusual running.
 
  The Active memory use, according to top, seems rather high:
 
  last pid: 21487;  load averages:  0.19,  0.33,  0.32up 2+16:45:20
 15:52:21
  76 processes:  1 running, 75 sleeping
  CPU states:  0.5% user,  0.0% nice,  4.0% system,  1.4% interrupt, 94.2%
 idle
  Mem: 1413M Active, 187M Inact, 299M Wired, 93M Cache, 112M Buf, 2632K Free
  Swap: 1024M Total, 21M Used, 1003M Free, 2% Inuse
 
  We can't seem to get the Active number down much, even after stopping
  Apache it still stays around 1100M. There's no shared memory in use, and
  nothing in vmstat -m seems to indicate where the missing memory is. top,
  sorting by size, does not indicate anything unusual either.
 
  sysctl vm.vmtotal says:
 
  vm.vmtotal:
  System wide totals computed every five seconds: (values in kilobytes)
  ===
  Processes:  (RUNQ: 1 Disk Wait: 0 Page Wait: 0 Sleep: 76)
  Virtual Memory: (Total: 8172K, Active 636472K)
  Real Memory:(Total: 2051312K Active 389176K)
  Shared Virtual Memory:  (Total: 16436K Active: 11760K)
  Shared Real Memory: (Total: 6004K Active: 4436K)
  Free Memory Pages:  79228K
 
  whereas on other servers, the Real Memory Active number seems to match
  the one found in top, on this one it is about 1GB lower.
 
  A similar machine running Apache on 5.1-R, generally serving smaller
  files, has the same problem in a smaller scale (about 640M even when
  Apache is stopped).
 
  Are there any other data that I should send to help diagnose this problem,
  or any programs I can run to try and track this stray memory use down?
 
  - dpk
  ___
  [EMAIL PROTECTED] mailing list
  http://lists.freebsd.org/mailman/listinfo/freebsd-questions
  To unsubscribe, send any mail to
 [EMAIL PROTECTED]
 


___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Memory leak on 5.1-RELEASE?

2004-01-30 Thread dpk
(I'm not a member of the list; please Cc me on any replies.)

We're running Apache 1.3.28 on a 5.1-RELEASE machine. It's a Dell PE 2650
w/ 2GB RAM. The site contains a lot of large files (multi-megabyte) -
otherwise there's nothing unusual running.

The Active memory use, according to top, seems rather high:

last pid: 21487;  load averages:  0.19,  0.33,  0.32up 2+16:45:20  15:52:21
76 processes:  1 running, 75 sleeping
CPU states:  0.5% user,  0.0% nice,  4.0% system,  1.4% interrupt, 94.2% idle
Mem: 1413M Active, 187M Inact, 299M Wired, 93M Cache, 112M Buf, 2632K Free
Swap: 1024M Total, 21M Used, 1003M Free, 2% Inuse

We can't seem to get the Active number down much, even after stopping
Apache it still stays around 1100M. There's no shared memory in use, and
nothing in vmstat -m seems to indicate where the missing memory is. top,
sorting by size, does not indicate anything unusual either.

sysctl vm.vmtotal says:

vm.vmtotal:
System wide totals computed every five seconds: (values in kilobytes)
===
Processes:  (RUNQ: 1 Disk Wait: 0 Page Wait: 0 Sleep: 76)
Virtual Memory: (Total: 8172K, Active 636472K)
Real Memory:(Total: 2051312K Active 389176K)
Shared Virtual Memory:  (Total: 16436K Active: 11760K)
Shared Real Memory: (Total: 6004K Active: 4436K)
Free Memory Pages:  79228K

whereas on other servers, the Real Memory Active number seems to match
the one found in top, on this one it is about 1GB lower.

A similar machine running Apache on 5.1-R, generally serving smaller
files, has the same problem in a smaller scale (about 640M even when
Apache is stopped).

Are there any other data that I should send to help diagnose this problem,
or any programs I can run to try and track this stray memory use down?

- dpk
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]