Doh!  forgot the attachment;)

----- Forwarded message from [EMAIL PROTECTED] -----

Date: Thu, 21 Oct 1999 16:42:40 -0400
From: [EMAIL PROTECTED]
To: David Calder <[EMAIL PROTECTED]>
Cc: [EMAIL PROTECTED]
Subject: Re: Stonebeat on AIX
X-Mailer: Mutt 0.95.1i
In-Reply-To: <[EMAIL PROTECTED]>; from David Calder on Wed, Oct 
20, 1999 at 08:26:20AM +0000

Attached is an excerpt from our log files.  It seems that we have a broken pipe
error msg every time this happens.  I'm not sure what is causing this, but any
help would be greatly appreciated.

-Rolando

On Wed, Oct 20, 1999 at 08:26:20AM +0000, David Calder wrote:
> We are running Stonebeat V3 with all the latest patches. When the problem 
> occurs the following messages appear in the syslog (priority crit) you could 
> check to see if you get the same. Does anyone have any ideas (slrsdc11 is 
> the hostname of our secondary FW)??
> 
> 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: error 
> reading active heartbeat link 0
> 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: heartbeat 
> link 0 closed, backup link 1 activated
> 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: heartbeat 
> link 0 closed at duplication step 4
> 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: heartbeat 
> duplication failed at step 4
> 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: error 
> reading active heartbeat link 1
> 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: heartbeat 
> link 1 closed at duplication step 0
> 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd-online[7212]: 
> start
> 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd-online[7212]: 
> switching secondary online
> 15:34:57 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: went online 
> by heartbeat failure
> 
> 
> >
> >We're having the same problem, we've tried just about everything - patching 
> >the
> >os, reconfiguring the network to clear up some bandwidth, etc...
> >
> >We're running Solaris and Stonebeat 2.x  What version of Stonebeat are you 
> >running?
> >
> >On Mon, Oct 18, 1999 at 11:10:45AM +0000, David Calder wrote:
> > > My company has decided to implement FireWall-1 on AIX using Stonebeat 
> >for
> > > High-Availability. Everything seems to be working well except for 
> >stonebeat.
> > >
> > > At seemingly random times it has a problem establishing the heartbeat
> > > between the two Firewalls so it will (wrongly) deduce that the primary
> > > firewall is down, lock it offline and bring up the secondary. This is
> > > happening hourly at least! Not good!!
> > >
> > > There are two heart beat connections, one is through the network the 
> >other
> > > is a crossover cable. Can ANYONE help. Everyone including Stonesoft 
> >seems to
> > > be stumped. Has anyone had experience with Stonebeat on AIX?? Any
> > > suggestions??
> > >
> 
> ______________________________________________________
> Get Your Private, Free Email at http://www.hotmail.com

----- End forwarded message -----
Oct 21 02:59:59 pwall01 sbd[512]: write: Broken pipe
Oct 21 02:59:59 pwall01 sbd[512]: write: Broken pipe
Oct 21 02:59:59 pwall01 sbd[512]: connection_write: active channel 0 : write failed
Oct 21 02:59:59 pwall01 sbd[512]: connection_write: active channel 0 : write failed
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, backup channel 1 activated
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, backup channel 1 activated
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, sync=4
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, sync=4
Oct 21 02:59:59 pwall01 sbd[512]: heartbeat duplication failed at step 4
Oct 21 02:59:59 pwall01 sbd[512]: heartbeat duplication failed at step 4
Oct 21 02:59:59 pwall01 sbd[512]: write: Broken pipe
Oct 21 02:59:59 pwall01 sbd[512]: connection_write: active channel 1 : write failed
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 1 closed, sync=0
Oct 21 02:59:59 pwall01 sbd[512]: timing: query=6.217 reply=6.216 peer query=0.123 
peer reply=6.157 secs
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, sync=0
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 1 closed, sync=0
Oct 21 02:59:59 pwall01 sbd[512]: write: Broken pipe
Oct 21 02:59:59 pwall01 sbd[512]: connection_write: active channel 1 : write failed
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 1 closed, sync=0
Oct 21 02:59:59 pwall01 sbd[512]: timing: query=6.217 reply=6.216 peer query=0.123 
peer reply=6.157 secs
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, sync=0
Oct 21 02:59:59 pwall01 sbd[512]: peer channel 1 closed, sync=0
Oct 21 02:59:59 pwall01 sbd[512]: from 6 --- 8 ---> 4
Oct 21 02:59:59 pwall01 sbd[512]: from 6 --- 8 ---> 4
Oct 21 03:00:03 pwall01 sbd[512]: from 4 --- 7 ---> 6
Oct 21 03:00:03 pwall01 sbd[512]: from 4 --- 7 ---> 6
Oct 21 03:00:03 pwall01 sbd[512]: refused peer's request to lock offline
Oct 21 03:00:03 pwall01 sbd[512]: refused peer's request to lock offline
Oct 21 03:00:09 pwall01 sbd[512]: heartbeat failed : unreplied message
Oct 21 03:00:09 pwall01 sbd[512]: timing: query=3.033 reply=3.020 peer query=3.020 
peer reply=15.962 secs
Oct 21 03:00:09 pwall01 sbd[512]: peer channel 0 closed, backup channel 1 activated
Oct 21 03:00:09 pwall01 sbd[512]: peer channel 0 closed, sync=1
Oct 21 03:00:09 pwall01 sbd[512]: heartbeat duplication failed at step 1
Oct 21 03:00:09 pwall01 sbd[512]: peer channel 1 closed, sync=0
Oct 21 03:00:09 pwall01 sbd[512]: from 6 --- 8 ---> 4
Oct 21 03:00:09 pwall01 sbd[512]: heartbeat failed : unreplied message
Oct 21 03:00:09 pwall01 sbd[512]: timing: query=3.033 reply=3.020 peer query=3.020 
peer reply=15.962 secs
Oct 21 03:00:09 pwall01 sbd[512]: peer channel 0 closed, backup channel 1 activated
Oct 21 03:00:09 pwall01 sbd[512]: peer channel 0 closed, sync=1
Oct 21 03:00:09 pwall01 sbd[512]: heartbeat duplication failed at step 1
Oct 21 03:00:09 pwall01 sbd[512]: peer channel 1 closed, sync=0
Oct 21 03:00:09 pwall01 sbd[512]: from 6 --- 8 ---> 4
Oct 21 03:00:13 pwall01 sbd[512]: from 4 --- 7 ---> 6
Oct 21 03:00:13 pwall01 sbd[512]: refused peer's request to lock offline
Oct 21 03:00:13 pwall01 sbd[512]: from 4 --- 7 ---> 6
Oct 21 03:00:13 pwall01 sbd[512]: refused peer's request to lock offline
Oct 21 03:00:18 pwall01 sbd[512]: heartbeat duplicated
Oct 21 03:00:18 pwall01 sbd[512]: heartbeat duplicated
Oct 21 03:00:40 pwall01 sbd[512]: both daemons online
Oct 21 03:00:40 pwall01 sbd[512]: both daemons online
Oct 21 03:00:40 pwall01 sbd[512]: from 6 --- 1 ---> 3
Oct 21 03:00:40 pwall01 sbd[512]: from 6 --- 1 ---> 3
Oct 21 03:00:44 pwall01 sbd[512]: went offline, 11 ---> 3
Oct 21 03:00:44 pwall01 sbd[512]: went offline, 11 ---> 3
Oct 21 05:00:49 pwall01 unix: FW-1: log message queue is full
Oct 21 06:12:09 pwall01 sbd[512]: from 3 --- 2 ---> 6
Oct 21 06:12:09 pwall01 sbd[512]: from 3 --- 2 ---> 6
Oct 21 06:12:13 pwall01 sbd[512]: went online, 14 ---> 6
Oct 21 06:12:13 pwall01 sbd[512]: went online, 14 ---> 6

Reply via email to