Doh! forgot the attachment;) ----- Forwarded message from [EMAIL PROTECTED] ----- Date: Thu, 21 Oct 1999 16:42:40 -0400 From: [EMAIL PROTECTED] To: David Calder <[EMAIL PROTECTED]> Cc: [EMAIL PROTECTED] Subject: Re: Stonebeat on AIX X-Mailer: Mutt 0.95.1i In-Reply-To: <[EMAIL PROTECTED]>; from David Calder on Wed, Oct 20, 1999 at 08:26:20AM +0000 Attached is an excerpt from our log files. It seems that we have a broken pipe error msg every time this happens. I'm not sure what is causing this, but any help would be greatly appreciated. -Rolando On Wed, Oct 20, 1999 at 08:26:20AM +0000, David Calder wrote: > We are running Stonebeat V3 with all the latest patches. When the problem > occurs the following messages appear in the syslog (priority crit) you could > check to see if you get the same. Does anyone have any ideas (slrsdc11 is > the hostname of our secondary FW)?? > > 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: error > reading active heartbeat link 0 > 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: heartbeat > link 0 closed, backup link 1 activated > 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: heartbeat > link 0 closed at duplication step 4 > 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: heartbeat > duplication failed at step 4 > 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: error > reading active heartbeat link 1 > 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: heartbeat > link 1 closed at duplication step 0 > 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd-online[7212]: > start > 15:34:52 FW-SLRSDC11 Message forwarded from slrsdc11: sbd-online[7212]: > switching secondary online > 15:34:57 FW-SLRSDC11 Message forwarded from slrsdc11: sbd[7622]: went online > by heartbeat failure > > > > > >We're having the same problem, we've tried just about everything - patching > >the > >os, reconfiguring the network to clear up some bandwidth, etc... > > > >We're running Solaris and Stonebeat 2.x What version of Stonebeat are you > >running? > > > >On Mon, Oct 18, 1999 at 11:10:45AM +0000, David Calder wrote: > > > My company has decided to implement FireWall-1 on AIX using Stonebeat > >for > > > High-Availability. Everything seems to be working well except for > >stonebeat. > > > > > > At seemingly random times it has a problem establishing the heartbeat > > > between the two Firewalls so it will (wrongly) deduce that the primary > > > firewall is down, lock it offline and bring up the secondary. This is > > > happening hourly at least! Not good!! > > > > > > There are two heart beat connections, one is through the network the > >other > > > is a crossover cable. Can ANYONE help. Everyone including Stonesoft > >seems to > > > be stumped. Has anyone had experience with Stonebeat on AIX?? Any > > > suggestions?? > > > > > ______________________________________________________ > Get Your Private, Free Email at http://www.hotmail.com ----- End forwarded message -----
Oct 21 02:59:59 pwall01 sbd[512]: write: Broken pipe Oct 21 02:59:59 pwall01 sbd[512]: write: Broken pipe Oct 21 02:59:59 pwall01 sbd[512]: connection_write: active channel 0 : write failed Oct 21 02:59:59 pwall01 sbd[512]: connection_write: active channel 0 : write failed Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, backup channel 1 activated Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, backup channel 1 activated Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, sync=4 Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, sync=4 Oct 21 02:59:59 pwall01 sbd[512]: heartbeat duplication failed at step 4 Oct 21 02:59:59 pwall01 sbd[512]: heartbeat duplication failed at step 4 Oct 21 02:59:59 pwall01 sbd[512]: write: Broken pipe Oct 21 02:59:59 pwall01 sbd[512]: connection_write: active channel 1 : write failed Oct 21 02:59:59 pwall01 sbd[512]: peer channel 1 closed, sync=0 Oct 21 02:59:59 pwall01 sbd[512]: timing: query=6.217 reply=6.216 peer query=0.123 peer reply=6.157 secs Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, sync=0 Oct 21 02:59:59 pwall01 sbd[512]: peer channel 1 closed, sync=0 Oct 21 02:59:59 pwall01 sbd[512]: write: Broken pipe Oct 21 02:59:59 pwall01 sbd[512]: connection_write: active channel 1 : write failed Oct 21 02:59:59 pwall01 sbd[512]: peer channel 1 closed, sync=0 Oct 21 02:59:59 pwall01 sbd[512]: timing: query=6.217 reply=6.216 peer query=0.123 peer reply=6.157 secs Oct 21 02:59:59 pwall01 sbd[512]: peer channel 0 closed, sync=0 Oct 21 02:59:59 pwall01 sbd[512]: peer channel 1 closed, sync=0 Oct 21 02:59:59 pwall01 sbd[512]: from 6 --- 8 ---> 4 Oct 21 02:59:59 pwall01 sbd[512]: from 6 --- 8 ---> 4 Oct 21 03:00:03 pwall01 sbd[512]: from 4 --- 7 ---> 6 Oct 21 03:00:03 pwall01 sbd[512]: from 4 --- 7 ---> 6 Oct 21 03:00:03 pwall01 sbd[512]: refused peer's request to lock offline Oct 21 03:00:03 pwall01 sbd[512]: refused peer's request to lock offline Oct 21 03:00:09 pwall01 sbd[512]: heartbeat failed : unreplied message Oct 21 03:00:09 pwall01 sbd[512]: timing: query=3.033 reply=3.020 peer query=3.020 peer reply=15.962 secs Oct 21 03:00:09 pwall01 sbd[512]: peer channel 0 closed, backup channel 1 activated Oct 21 03:00:09 pwall01 sbd[512]: peer channel 0 closed, sync=1 Oct 21 03:00:09 pwall01 sbd[512]: heartbeat duplication failed at step 1 Oct 21 03:00:09 pwall01 sbd[512]: peer channel 1 closed, sync=0 Oct 21 03:00:09 pwall01 sbd[512]: from 6 --- 8 ---> 4 Oct 21 03:00:09 pwall01 sbd[512]: heartbeat failed : unreplied message Oct 21 03:00:09 pwall01 sbd[512]: timing: query=3.033 reply=3.020 peer query=3.020 peer reply=15.962 secs Oct 21 03:00:09 pwall01 sbd[512]: peer channel 0 closed, backup channel 1 activated Oct 21 03:00:09 pwall01 sbd[512]: peer channel 0 closed, sync=1 Oct 21 03:00:09 pwall01 sbd[512]: heartbeat duplication failed at step 1 Oct 21 03:00:09 pwall01 sbd[512]: peer channel 1 closed, sync=0 Oct 21 03:00:09 pwall01 sbd[512]: from 6 --- 8 ---> 4 Oct 21 03:00:13 pwall01 sbd[512]: from 4 --- 7 ---> 6 Oct 21 03:00:13 pwall01 sbd[512]: refused peer's request to lock offline Oct 21 03:00:13 pwall01 sbd[512]: from 4 --- 7 ---> 6 Oct 21 03:00:13 pwall01 sbd[512]: refused peer's request to lock offline Oct 21 03:00:18 pwall01 sbd[512]: heartbeat duplicated Oct 21 03:00:18 pwall01 sbd[512]: heartbeat duplicated Oct 21 03:00:40 pwall01 sbd[512]: both daemons online Oct 21 03:00:40 pwall01 sbd[512]: both daemons online Oct 21 03:00:40 pwall01 sbd[512]: from 6 --- 1 ---> 3 Oct 21 03:00:40 pwall01 sbd[512]: from 6 --- 1 ---> 3 Oct 21 03:00:44 pwall01 sbd[512]: went offline, 11 ---> 3 Oct 21 03:00:44 pwall01 sbd[512]: went offline, 11 ---> 3 Oct 21 05:00:49 pwall01 unix: FW-1: log message queue is full Oct 21 06:12:09 pwall01 sbd[512]: from 3 --- 2 ---> 6 Oct 21 06:12:09 pwall01 sbd[512]: from 3 --- 2 ---> 6 Oct 21 06:12:13 pwall01 sbd[512]: went online, 14 ---> 6 Oct 21 06:12:13 pwall01 sbd[512]: went online, 14 ---> 6
