Re: EM stability

2006-11-13 Thread Gleb Smirnoff
On Sun, Nov 12, 2006 at 02:26:36PM -0600, Barry Boes wrote:
B After the last hang I added giant locks back in and the machine has
B been up since.
B 
B I don't have a serial console, just a graphic console.  When the
B machine hangs it stops replying to ethernet packets at all protocol
B levels and doesn't respond to keyboard input in any way, virtual
B console or otherwise.  If I run a script of the form
Bwhile(1)
B  sleep 1
B  date  datelog
Bend
B 
B the file stops updating when the machine hangs.
B 
B I will define the debugger in the kernel (options DDB, right?), attach
B a serial console, and do what I can to get more information on the
B problem.

Yes, this looks like something is running in an endless loop. Once
you compile kernel with debugger, you should enter in several times
and see the backtraces. Usually, they will be inside this cycle.

-- 
Totus tuus, Glebius.
GLEBIUS-RIPN GLEB-RIPE
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: EM stability

2006-11-12 Thread Barry Boes

After the last hang I added giant locks back in and the machine has
been up since.

I don't have a serial console, just a graphic console.  When the
machine hangs it stops replying to ethernet packets at all protocol
levels and doesn't respond to keyboard input in any way, virtual
console or otherwise.  If I run a script of the form
   while(1)
 sleep 1
 date  datelog
   end

the file stops updating when the machine hangs.

I will define the debugger in the kernel (options DDB, right?), attach
a serial console, and do what I can to get more information on the
problem.

-Barry


Jack Vogel writes:
  On 11/10/06, Barry Boes [EMAIL PROTECTED] wrote:
  
   Luck ran out.  Hard must press the reset button hang.  No console
   messages.   The system was idle at the time.
  Is there anything you'd like me to do to attempt to narrow down the
   problem or get debugging output?  I do not know if the freeze was
   related to em or something else.
  
  Is this a machine running some graphic head? If not can you see anything
  on the console? Are you sure the machine is dead, like can you get in
  over the network... ? One thing I often do when you are dealing with
  unpredictable hangs is run 'vmstat 3' on one of the virtual terminals.
  
  You might also define the kernel debugger into your kernel, its best to have
  a serial console for this, I've seen the hardware console be locked but the
  serial will still work.
  
  The only way we will track this down is thru repetitive reproduction I'm
  afraid.
  
  Jack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: EM stability

2006-11-10 Thread Gleb Smirnoff
  Hello Barry,

On Fri, Nov 10, 2006 at 08:56:30AM -0600, Barry Boes wrote:
BI see you listed on the EM stability issues list.  I have a Tyan
B H1000S with dual em ports on 6.1, and it won't stay up 5 minutes
B without EM watchdog resets unless I use giant locks.
BIs there any way you'd like me to help you with testing the updated
B drivers?

Yes, please upgrade to the latemost RELENG_6 via cvsup, build a new
kernel and report whether the problem is fixed or not.

You see, I have added a o lot of people and two mailing lists to Cc.
Please do not remove them, when replying. Thanks!

-- 
Totus tuus, Glebius.
GLEBIUS-RIPN GLEB-RIPE
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: EM stability

2006-11-10 Thread Barry Boes

So far so good.  I updated to the latest, including jfv's revision
1.65.2.21 from this AM.

With the 6.1 ISO distribution, I would get watchdogs within seconds of
starting a file transfer (except giant locked which worked fine).

With RELENG_6 I've transfered 100's of GB via ftp and NFS over both
ethernet ports and no problems yet.

Thanks for all the hard work!
Barry



Gleb Smirnoff writes:
Hello Barry,
  
  On Fri, Nov 10, 2006 at 08:56:30AM -0600, Barry Boes wrote:
  BI see you listed on the EM stability issues list.  I have a Tyan
  B H1000S with dual em ports on 6.1, and it won't stay up 5 minutes
  B without EM watchdog resets unless I use giant locks.
  BIs there any way you'd like me to help you with testing the updated
  B drivers?
  
  Yes, please upgrade to the latemost RELENG_6 via cvsup, build a new
  kernel and report whether the problem is fixed or not.
  
  You see, I have added a o lot of people and two mailing lists to Cc.
  Please do not remove them, when replying. Thanks!
  
  -- 
  Totus tuus, Glebius.
  GLEBIUS-RIPN GLEB-RIPE
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: EM stability

2006-11-10 Thread Barry Boes

Luck ran out.  Hard must press the reset button hang.  No console
messages.   The system was idle at the time.
   Is there anything you'd like me to do to attempt to narrow down the
problem or get debugging output?  I do not know if the freeze was
related to em or something else.

-Barry


Barry Boes writes:
  
  So far so good.  I updated to the latest, including jfv's revision
  1.65.2.21 from this AM.
  
  With the 6.1 ISO distribution, I would get watchdogs within seconds of
  starting a file transfer (except giant locked which worked fine).
  
  With RELENG_6 I've transfered 100's of GB via ftp and NFS over both
  ethernet ports and no problems yet.
  
  Thanks for all the hard work!
  Barry
  
  
  
  Gleb Smirnoff writes:
  Hello Barry,

On Fri, Nov 10, 2006 at 08:56:30AM -0600, Barry Boes wrote:
BI see you listed on the EM stability issues list.  I have a Tyan
B H1000S with dual em ports on 6.1, and it won't stay up 5 minutes
B without EM watchdog resets unless I use giant locks.
BIs there any way you'd like me to help you with testing the updated
B drivers?

Yes, please upgrade to the latemost RELENG_6 via cvsup, build a new
kernel and report whether the problem is fixed or not.

You see, I have added a o lot of people and two mailing lists to Cc.
Please do not remove them, when replying. Thanks!

-- 
Totus tuus, Glebius.
GLEBIUS-RIPN GLEB-RIPE
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: EM stability

2006-11-10 Thread Gleb Smirnoff
On Fri, Nov 10, 2006 at 04:28:30PM -0600, Barry Boes wrote:
B 
B Luck ran out.  Hard must press the reset button hang.  No console
B messages.   The system was idle at the time.
BIs there anything you'd like me to do to attempt to narrow down the
B problem or get debugging output?  I do not know if the freeze was
B related to em or something else.

In cases like this you need to prepare a kernel with debugger compiled
in and try to exit into the debugger, when the hang occurs. You can
try keyboard debugger sequence, and if it fails try serial break.

-- 
Totus tuus, Glebius.
GLEBIUS-RIPN GLEB-RIPE
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: EM stability

2006-11-10 Thread Jack Vogel

On 11/10/06, Barry Boes [EMAIL PROTECTED] wrote:


Luck ran out.  Hard must press the reset button hang.  No console
messages.   The system was idle at the time.
   Is there anything you'd like me to do to attempt to narrow down the
problem or get debugging output?  I do not know if the freeze was
related to em or something else.


Is this a machine running some graphic head? If not can you see anything
on the console? Are you sure the machine is dead, like can you get in
over the network... ? One thing I often do when you are dealing with
unpredictable hangs is run 'vmstat 3' on one of the virtual terminals.

You might also define the kernel debugger into your kernel, its best to have
a serial console for this, I've seen the hardware console be locked but the
serial will still work.

The only way we will track this down is thru repetitive reproduction I'm
afraid.

Jack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]