Re: 5.2.1 SMP problems

2004-06-05 Thread Toomas Aas
Hi!

 Installed 5.2.1 on a high load server - albeit not recommended - and 
 it was working perfectly with SMP support. It picked up 4 cpu's and 
 appeared to be using them all. (The box is a dual 2.4gig Xeon machine). 
 Anyway, the box was as I said under quite a bit of load, and would die 
 about once a day - just reboot. I looked into it a bit and found 
 whenever I put more load on (e.g. a make install in ports) it would 
 panic - vm_load_page or something like that - sorry about being so 
 vague. Anyway I took out SMP support, and now it boots with 2 cpu's - 
 one normal Xeon + HT for the second (no idea how to disable that - but I 
 don't need to)... its been running perfect for 4 days now and I put it 
 under SERIOUS load.. has anyone else had SMP problems on 5.2.1? 

I put 5.2.1-RELEASE on an IBM Netfinity 5000 (dual Pentium III) and I 
had problems, although different from yours. This system has some 
warning LEDs on the motherboard which light up when something is wrong. 
Quite often, when booting, the NMI and PCI1 LEDs would light up, the 
message NMI ISA 34, EISA ff appears among the boot messages and 
system reboots shortly after that. PCI1 LED indicates a problem with 
the first PCI bus, which is where I have IBM ServeRAID controller.

Interesting thing is that sometimes this problem during booting does 
not happen and once the system boots successfully there are never any 
problems.

When I removed the second CPU, the problem disappeared. I am not quite 
convinced yet that the problem is completely gone (haven't rebooted too 
many times) but it certainly has been greatly reduced.

Downgrading the firmware of ServeRAID controller from 6.10 to 5.10 (as 
recommended by another poster on this list) may also have contributed 
to resolving the problem. But the first thing I did was removing the 
2nd CPU and I successfully booted several times between that and 
downgrading the ServeRAID firmware. I *seem* to remember one NMI-boot 
after removing the second CPU, but I'm not quite sure my memory isn't 
failing me - it has been quite a rough ride with this NF5000 during the 
past few days. Anyway, *before* removing the second CPU, the boot 
failure rate was 50%, so I'm sure removing the second CPU did help, 
even if downgrading the ServeRAID firmware may have helped further.
--
Toomas Aas | [EMAIL PROTECTED] | http://www.raad.tartu.ee/~toomas/
* Feeling compressed ARJ you?

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.2.1 SMP problems

2004-06-04 Thread Albert Shih
 Le 04/06/2004 à 11:47:14+0200, Dave Raven a écrit
 Hi all,
Installed 5.2.1 on a high load server - albeit not recommended - and 
 it was working perfectly with SMP support. It picked up 4 cpu's and 
 appeared to be using them all. (The box is a dual 2.4gig Xeon machine). 
 Anyway, the box was as I said under quite a bit of load, and would die 
 about once a day - just reboot. I looked into it a bit and found 
 whenever I put more load on (e.g. a make install in ports) it would 
 panic - vm_load_page or something like that - sorry about being so 
 vague. Anyway I took out SMP support, and now it boots with 2 cpu's - 
 one normal Xeon + HT for the second (no idea how to disable that - but I 
 don't need to)... its been running perfect for 4 days now and I put it 
 under SERIOUS load.. has anyone else had SMP problems on 5.2.1? I 
 couldn't find anything about it at first glimpse...
 

Have you shure this problem is with FreeBSD ?

First check your memory with some software like memtest86 (http://www.memtest86.com/)

When I've this kind of problem, always it's because some error in RAM.

Regards



--
Albert SHIH
Universite de Paris 7 (Denis DIDEROT)
U.F.R. de Mathematiques.
Heure local/Local time:
Fri Jun 4 13:43:52 CEST 2004
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


RE: 5.2.1 SMP problems

2004-06-04 Thread Dave Raven
Well all the tests I ran (about 2 memory testers and 2 benchmarks) caused
the box to die - even the make installs did half the time. Now that I've
disabled SMP there are no problems, and the same tests are working
perfectly...

The only thing I've changed is the SMP support, and now its been running for
4 days, before it wouldn't run for 12 hours

Thanks
Dave

-Original Message-
From: Albert Shih [mailto:[EMAIL PROTECTED] 
Sent: 04 June 2004 01:47 PM
To: Dave Raven
Cc: [EMAIL PROTECTED]
Subject: Re: 5.2.1 SMP problems


 Le 04/06/2004 à 11:47:14+0200, Dave Raven a écrit
 Hi all,
Installed 5.2.1 on a high load server - albeit not recommended - and 
 it was working perfectly with SMP support. It picked up 4 cpu's and 
 appeared to be using them all. (The box is a dual 2.4gig Xeon machine). 
 Anyway, the box was as I said under quite a bit of load, and would die 
 about once a day - just reboot. I looked into it a bit and found 
 whenever I put more load on (e.g. a make install in ports) it would 
 panic - vm_load_page or something like that - sorry about being so 
 vague. Anyway I took out SMP support, and now it boots with 2 cpu's - 
 one normal Xeon + HT for the second (no idea how to disable that - but I 
 don't need to)... its been running perfect for 4 days now and I put it 
 under SERIOUS load.. has anyone else had SMP problems on 5.2.1? I 
 couldn't find anything about it at first glimpse...
 

Have you shure this problem is with FreeBSD ?

First check your memory with some software like memtest86
(http://www.memtest86.com/)

When I've this kind of problem, always it's because some error in RAM.

Regards



--
Albert SHIH
Universite de Paris 7 (Denis DIDEROT)
U.F.R. de Mathematiques.
Heure local/Local time:
Fri Jun 4 13:43:52 CEST 2004

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.2.1 SMP problems

2004-06-04 Thread Bill Moran
Dave Raven [EMAIL PROTECTED] wrote:

 Well all the tests I ran (about 2 memory testers and 2 benchmarks) caused
 the box to die - even the make installs did half the time. Now that I've
 disabled SMP there are no problems, and the same tests are working
 perfectly...
 
 The only thing I've changed is the SMP support, and now its been running for
 4 days, before it wouldn't run for 12 hours

You may want to take this problem to hackers@ (with more details about the panic
and the hardware involved - the actual panic message and dmesg output) and see
if anyone is aware of the problem.

Since 5 is in heavy development, it's possible that this problem has already
been dealt with and fixed in 5-CURRENT.  If not, I'm sure the development team
would be interested in your assistance in fixing it.

 
 Thanks
 Dave
 
 -Original Message-
 From: Albert Shih [mailto:[EMAIL PROTECTED] 
 Sent: 04 June 2004 01:47 PM
 To: Dave Raven
 Cc: [EMAIL PROTECTED]
 Subject: Re: 5.2.1 SMP problems
 
 
  Le 04/06/2004 à 11:47:14+0200, Dave Raven a écrit
  Hi all,
 Installed 5.2.1 on a high load server - albeit not recommended - and 
  it was working perfectly with SMP support. It picked up 4 cpu's and 
  appeared to be using them all. (The box is a dual 2.4gig Xeon machine). 
  Anyway, the box was as I said under quite a bit of load, and would die 
  about once a day - just reboot. I looked into it a bit and found 
  whenever I put more load on (e.g. a make install in ports) it would 
  panic - vm_load_page or something like that - sorry about being so 
  vague. Anyway I took out SMP support, and now it boots with 2 cpu's - 
  one normal Xeon + HT for the second (no idea how to disable that - but I 
  don't need to)... its been running perfect for 4 days now and I put it 
  under SERIOUS load.. has anyone else had SMP problems on 5.2.1? I 
  couldn't find anything about it at first glimpse...
  
 
 Have you shure this problem is with FreeBSD ?
 
 First check your memory with some software like memtest86
 (http://www.memtest86.com/)
 
 When I've this kind of problem, always it's because some error in RAM.
 
 Regards
 
 
 
 --
 Albert SHIH
 Universite de Paris 7 (Denis DIDEROT)
 U.F.R. de Mathematiques.
 Heure local/Local time:
 Fri Jun 4 13:43:52 CEST 2004
 
 ___
 [EMAIL PROTECTED] mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Bill Moran
Potential Technologies
http://www.potentialtech.com
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]