[strongSwan] charon appears to either crash and/or restart during HA takeover

FINLEY, DAVID BRIAN Thu, 02 Dec 2021 10:33:52 -0800

Hello,
Experiencing an issue with version 5.8.0. We have two gateways in an HA 
arrangement. When the current master goes down, the backup takes over ok but 
when the old master comes backup (as the back up) and attempts to the re-sync 
the tunnel list from the new master (took over for the old master when the 
master was rebooted), I see the following msgs in the charon.log:


Nov 30 04:10:40.571 01[CFG] HA: accepted new connection request from 
fd00:2600:2600:115:1::1  # The rebooted old master is now coming back up
Nov 30 04:10:40.571 01[CFG] HA: successfully accepted incoming connection       
                                  # The rebooted old master starts its 
strongswan HA "sync" connection to us (the new master)
Nov 30 04:10:41.569 05[CFG] resyncing HA segment 1
Nov 30 04:10:41.995 05[CFG] HA: failed to receive 4 bytes : Connection reset by 
peer
Nov 30 04:10:41.995 05[CFG] HA: failed to read size (4 bytes)
Nov 30 04:10:41.995 05[CFG] HA: pulling message failed
Nov 30 04:10:41.995 05[DMN] thread 5 received 11                                
                                                       # charon crashes??
Nov 30 04:10:41.995 06[CFG] HA: failed to send 17284930 bytes: Bad file 
descriptor
Nov 30 04:10:41.995 01[CFG] HA: accepted new connection request from 
fd00:2600:2600:115:1::1
Nov 30 04:10:41.995 01[CFG] HA: successfully accepted incoming connection
Nov 30 04:10:42.615 00[LIB] openssl FIPS mode(1) - enabled                      
                                                # charon restarts ???
Nov 30 04:10:42.622 00[CFG] crl caching to /etc/ike/swanctl/x509crl enabled
Nov 30 04:10:42.622 00[CFG] loaded 0 RADIUS server configurations

3 questions:

  1.  The crash appears to have been caused by the "pulling message failed 
condition?
  2.  I don't find a core anywhere, should charon have generated a core as 
result of the "thread 5 received 11" condition?
  3.  Are there significant HA fixes/enhancements in strongswan 5.9.x ? Maybe 
we should upgrade?

thx

Dave Finley
[email protected]<mailto:[email protected]>
(630) 719-4391  (desk)
(630) 740-5198  (mobile)

[strongSwan] charon appears to either crash and/or restart during HA takeover

Reply via email to