Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-27 Thread Ted Mittelstaedt
Your broadcom is one of the problem chips, you can probably see
the workarounds in the driver.  So I would not rule out a problem
with the ethernet controller.  If you have money to burn on this
then install an Intel pci express or 64 bit pci card and disable the
onboard ethernet chip and see what happens.

However what you really need to do at this time is get your
seemingly-random panics to become repeatable.  We start with
the assumption that this is a hardware problem.  PC's in general
have 4 areas of hardware that cause problems:

ethernet
disk IO
memory
cpu

sometimes video is a problem but you can check for this
by simply not logging into the console and doing all your access
over the network.

What you need to do now is go into the ports directories
and build some of the stress-testing utilities then run them.
Start by running a disk stresser, see if that causes it to panic.
Next run a network stresser, see if that causes a panic, next
run a cpu stresser, etc.etc.  I think you get the idea here.  What
we are trying to do is find a program that can cause the system
to panic on demand, that is a program that really mostly does
only 1 thing, either disk, network, memory, etc.

Once you got that then you can start hardware substitution on
the affected area (if possible)

Ted

- Original Message - 
From: Jeremy Kister [EMAIL PROTECTED]
To: Ted Mittelstaedt [EMAIL PROTECTED]
Cc: freebsd-questions@freebsd.org
Sent: Sunday, June 25, 2006 10:16 PM
Subject: Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]


 On 6/25/2006 5:51 PM, Ted Mittelstaedt wrote:
  Please post a dmesg output.

 two of them have now crashed:

 http://jeremy.kister.net/tmp/unix32.dmesg
 http://jeremy.kister.net/tmp/unix35.dmesg


 Thanks for your interest

 -- 

 Jeremy Kister
 http://jeremy.kister.net./
 ___
 freebsd-questions@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to
[EMAIL PROTECTED]


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-25 Thread Jeremy Kister

On 6/23/2006 4:05 AM, Ted Mittelstaedt wrote:

If you read the complete thread you would have come across this:

http://www.freebsd.org/cgi/query-pr.cgi?pr=83220

And if you read that you would see it was resolved to the unsatisfaction
of the PR filer.  That is, replace ipf with pf or fix the program.


I replaced ipf with pf on all four machines on friday.

Unfortunately, one of them just crashed.

Still at a loss.

--

Jeremy Kister
http://jeremy.kister.net./
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-25 Thread Ted Mittelstaedt
Please post a dmesg output.

Ted

- Original Message - 
From: Jeremy Kister [EMAIL PROTECTED]
To: freebsd-questions@freebsd.org
Sent: Sunday, June 25, 2006 12:38 PM
Subject: Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]


 On 6/23/2006 4:05 AM, Ted Mittelstaedt wrote:
  If you read the complete thread you would have come across this:
 
  http://www.freebsd.org/cgi/query-pr.cgi?pr=83220
 
  And if you read that you would see it was resolved to the unsatisfaction
  of the PR filer.  That is, replace ipf with pf or fix the program.

 I replaced ipf with pf on all four machines on friday.

 Unfortunately, one of them just crashed.

 Still at a loss.

 -- 

 Jeremy Kister
 http://jeremy.kister.net./
 ___
 freebsd-questions@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to
[EMAIL PROTECTED]


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-25 Thread Jeremy Kister

On 6/25/2006 5:51 PM, Ted Mittelstaedt wrote:

Please post a dmesg output.


two of them have now crashed:

http://jeremy.kister.net/tmp/unix32.dmesg
http://jeremy.kister.net/tmp/unix35.dmesg


Thanks for your interest

--

Jeremy Kister
http://jeremy.kister.net./
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-23 Thread Ted Mittelstaedt

- Original Message - 
From: Jeremy Kister [EMAIL PROTECTED]
To: freebsd-questions@freebsd.org
Sent: Wednesday, June 21, 2006 12:27 PM
Subject: Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]


 On 6/21/2006 2:25 AM, Jeremy Kister wrote:
  I have sudden [seemingly random] reboots with all four (see
 
http://unix.derkeiler.com/Mailing-Lists/FreeBSD/questions/2006-02/msg01605.html)
  while running FreeBSD 6.0-RELEASE through the PreReleases and now with
  6.1-STABLE.

 I just found a post suggesting that IPF with SMP is bad.  See:
 http://www.monkey.org/freebsd/archive/freebsd-stable/200507/msg00481.html

 I do have IPF running on all four boxes and SMP is obviously also
 configured.

 Does anyone have data beyond this that says I should remove IPF from the
 kernel or set ipf_enable=NO in my rc.conf ?  or was this issue
 resolved in FreeBSD 6 ?


If you read the complete thread you would have come across this:

http://www.freebsd.org/cgi/query-pr.cgi?pr=83220

And if you read that you would see it was resolved to the unsatisfaction
of the PR filer.  That is, replace ipf with pf or fix the program.

This is in the category of doctor it hurts when I do this

Root can rm -r / and once most of the system is gone, it will
probably panic pretty soon.

Root can unmount volumes and make the system panic as well.

Neither operation is a failure of the operating system.

For ipf to work, the program must be given pretty deep control of
the OS.  If the program uses that control to make the system
crash, then who is at fault?  The OS?  I don't think so.

You might also consider that most people use ipfw.

One of the benefits of Open Source is that you can run what
you want instead of running what the rest of the lemmings run.
The downside of this is that if your doing that and it blows up
on you, you have to fix it yourself.  The developers are going to
fix the things that the rest of the lemmings have problems with
first, before working on off-the-beaten-trail stuff, simply because
there's more lemmings than you.

If you don't like it, see Microsoft - oh I forgot, they handle their
bugs exactly the same way - except you can't even fix them for
them.


Ted

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-22 Thread Jeremy Kister

On 6/21/2006 6:01 AM, Ted Mittelstaedt wrote:

Did you install the Broadcom patch?


Still no takers.  Is there a better place to get the attention of a 
willing kernel developer than freebsd-questions ?


--

Jeremy Kister
http://jeremy.kister.net./
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


RE: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-21 Thread Ted Mittelstaedt

Did you install the Broadcom patch?

Ted

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] Behalf Of Jeremy Kister
Sent: Tuesday, June 20, 2006 11:26 PM
To: freebsd-questions@freebsd.org
Subject: Crashing with HP/Compaq DL360 G3 [paging kernel developer]


I've got four identical HP/Compaq DL360 G3 1u servers.  Each has the
latest available firmware:

Proliant P31 (03/03/2005) BIOS
HP SmartArray 5i RAID onboard controller (v2.62)
(2) HPNC7781 (Broadcom BCM5703) (ASIC rev. 0x1002) 10/100/1000 onboard
network card
iLO v1.80 (Jul/12/2005)

and each has:
two Intel Xeon 2.8Ghz w/ Hyperthreading
four sticks of 512MB ECC RAM
two 36GB SCA disks in RAID1 via SmartArray 5i

I have sudden [seemingly random] reboots with all four (see
http://unix.derkeiler.com/Mailing-Lists/FreeBSD/questions/2006-0
2/msg01605.html)
while running FreeBSD 6.0-RELEASE through the PreReleases and now with
6.1-STABLE.


In an effort to make FreeBSD stable with DL360's, I'm wondering if
there's a driver/kernel developer interested in having full remote
access to this machine for a week or two (or three) via iLO and
root/ssh.

Anyone qualified and interested, please let me know.

--

Jeremy Kister
http://jeremy.kister.net./




___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to
[EMAIL PROTECTED]

--
No virus found in this incoming message.
Checked by AVG Free Edition.
Version: 7.1.394 / Virus Database: 268.9.1/369 - Release Date: 6/19/2006


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-21 Thread Jeremy Kister

On 6/21/2006 6:01 AM, Ted Mittelstaedt wrote:

Did you install the Broadcom patch?


We did apply kern/96806 to one of the four systems, which we found in 
our massive google search to try to fix the problem.


The machine rebooted about 6 hours after having the new code installed, 
so we undid it.


--

Jeremy Kister
http://jeremy.kister.net./
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Crashing with HP/Compaq DL360 G3 [paging kernel developer]

2006-06-21 Thread Jeremy Kister

On 6/21/2006 2:25 AM, Jeremy Kister wrote:
I have sudden [seemingly random] reboots with all four (see 
http://unix.derkeiler.com/Mailing-Lists/FreeBSD/questions/2006-02/msg01605.html) 
while running FreeBSD 6.0-RELEASE through the PreReleases and now with 
6.1-STABLE.


I just found a post suggesting that IPF with SMP is bad.  See: 
http://www.monkey.org/freebsd/archive/freebsd-stable/200507/msg00481.html


I do have IPF running on all four boxes and SMP is obviously also 
configured.


Does anyone have data beyond this that says I should remove IPF from the 
kernel or set ipf_enable=NO in my rc.conf ?  or was this issue 
resolved in FreeBSD 6 ?


--

Jeremy Kister
http://jeremy.kister.net./



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]