Hello list
I'm trying to setup a 3 nodes Cluster with 2 failover Domain for an HA
mail solution.
I want 1 run active for the Imap server in the Imap Failover domain , 1
node active for the Smtp in the Smtp Failover domain and the 3rd in the
2 failover domain as a backup node.
I run Centos 5.3
My fence device is a wti power switch
My cluster.conf is in attachement
My SMTP service is composed of:
1 IP
1 amavisd scritp
1 postfix script
2 NFS mount for postfix and amavis
If I manually kill the postfix master process (to simulate a crash), my
node is not fence and the logs said:
Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <info> Executing
/etc/init.d/postfix status
Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <err> script:postfix:
status of /etc/init.d/postfix failed (returned 3)
Jul 6 10:00:40 centos-smtp1 clurgmgrd[4228]: <notice> status on script
"postfix" returned 1 (generic error)
Jul 6 10:00:40 centos-smtp1 clurgmgrd[4228]: <notice> Stopping service
service:Postfix
Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <info> Executing
/etc/init.d/amavisd stop
Jul 6 10:00:40 centos-smtp1 kernel: do_vfs_lock: VFS is out of sync
with lock manager!
Jul 6 10:00:40 centos-smtp1 last message repeated 8 times
Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <info> Executing
/etc/init.d/postfix stop
Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <err> script:postfix:
stop of /etc/init.d/postfix failed (returned 1)
Jul 6 10:00:41 centos-smtp1 clurgmgrd[4228]: <notice> stop on script
"postfix" returned 1 (generic error)
Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <info> Removing IPv4
address 195.83.126.201/24 from bond0
Jul 6 10:00:41 centos-smtp1 avahi-daemon[3552]: Withdrawing address
record for 195.83.126.201 on bond0.
Jul 6 10:00:51 centos-smtp1 clurgmgrd: [4228]: <info> unmounting
/var/lib/amavis
Jul 6 10:00:51 centos-smtp1 clurgmgrd: [4228]: <info> unmounting
/var/spool/postfix
Jul 6 10:00:51 centos-smtp1 clurgmgrd[4228]: <crit> #12: RG
service:Postfix failed to stop; intervention required
Jul 6 10:00:51 centos-smtp1 clurgmgrd[4228]: <notice> Service
service:Postfix is failed
Jul 6 10:00:52 centos-smtp1 ntpd[3322]: synchronized to 195.83.126.119,
stratum 1
Clustat said:
Cluster Status for cluster-test @ Mon Jul 6 10:02:39 2009
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
centos-imap1.ill.fr 1
Online, Local, rgmanager
centos-imap2.ill.fr 2
Online, rgmanager
centos-smtp1.ill.fr 3
Online, rgmanager
/dev/disk/by-id/scsi-360a98000567247514634507447594661-part1 0
Online, Quorum Disk
Service Name Owner
(Last) State
------- ---- -----
------ -----
service:Imap
centos-imap2.ill.fr started
service:Postfix
(centos-smtp1.ill.fr) failed
So I have to disable the Postfix servcie with:
clusvcadm -d Postfix
and re-enable
clusvcadm -e Postfix
Could you explain my why my original smtp node is not fenced and why my
service is not start on the 2nd node ???
Is there a way to force the fencing ???
--
ARMANET Stephane
Division Projet Technique
Service Informatique
Groupe Infrastructure
Institut Laue langevin
<?xml version="1.0"?>
<cluster alias="cluster-test" config_version="57" name="cluster-test">
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="centos-imap1.test.fr" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="psu2" option="off" port="1"/>
<device name="psu2" option="off" port="5"/>
</method>
<method name="2">
<device name="centos_manual-fence" nodename="centos-imap1.test.fr"/>
</method>
</fence>
</clusternode>
<clusternode name="centos-imap2.test.fr" nodeid="2" votes="1">
<fence>
<method name="1">
<device name="psu2" option="off" port="2"/>
<device name="psu2" option="off" port="6"/>
</method>
<method name="2">
<device name="centos_manual-fence" nodename="centos-imap2.test.fr"/>
</method>
</fence>
</clusternode>
<clusternode name="centos-smtp1.test.fr" nodeid="3" votes="1">
<fence>
<method name="1">
<device name="psu1" option="off" port="1"/>
<device name="psu1" option="off" port="5"/>
</method>
<method name="2">
<device name="centos_manual-fence" nodename="centos-smtp1.test.fr"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman/>
<fencedevices>
<fencedevice agent="fence_manual" name="centos_manual-fence"/>
<fencedevice agent="fence_wti" ipaddr="192.168.0.215" name="psu1" passwd="passwd"/>
<fencedevice agent="fence_wti" ipaddr="192.168.0.216" name="psu2" passwd="passwd"/>
</fencedevices>
<rm log_facility="local4" log_level="7">
<failoverdomains>
<failoverdomain name="imap-FOD" nofailback="0" ordered="1" restricted="1">
<failoverdomainnode name="centos-imap1.test.fr" priority="1"/>
<failoverdomainnode name="centos-imap2.test.fr" priority="2"/>
</failoverdomain>
<failoverdomain name="smtp-FOD" ordered="1" restricted="1">
<failoverdomainnode name="centos-smtp1.test.fr" priority="1"/>
<failoverdomainnode name="centos-imap2.test.fr" priority="2"/>
</failoverdomain>
</failoverdomains>
<resources>
<netfs export="/vol/volSMTP/postfix" force_unmount="1" fstype="nfs" host="romulus.test.fr" mountpoint="/var/spool/postfix" name="NFS-postfix" options="rw,nolock"/>
<fs device="/dev/mapper/vgMail-lvMailboxes" force_fsck="1" force_unmount="1" fsid="34650" fstype="ext3" mountpoint="/var/spool/imap" name="lvMailboxes" options="commit=1" self_fence="1"/>
<fs device="/dev/mapper/vgMail-lvDBMail" force_fsck="1" force_unmount="1" fsid="4277" fstype="ext3" mountpoint="/var/lib/imap" name="lvDBMail" options="commit=1" self_fence="1"/>
<netfs export="/vol/volSMTP/amavis" force_unmount="1" fstype="nfs" host="romulus.test.fr" mountpoint="/var/lib/amavis" name="NFS Amavis" options=""/>
</resources>
<service autostart="1" domain="imap-FOD" name="Imap" recovery="relocate">
<ip address="192.168.0.198/24" monitor_link="1">
<script file="/etc/init.d/cyrus-imapd" name="Cyrus-imapd"/>
</ip>
<fs ref="lvMailboxes"/>
<fs ref="lvDBMail"/>
</service>
<service autostart="1" domain="smtp-FOD" name="Postfix" recovery="relocate">
<ip address="192.168.0.201" monitor_link="1">
<script file="/etc/init.d/postfix" name="postfix"/>
<script file="/etc/init.d/amavisd" name="amavisd"/>
</ip>
<netfs ref="NFS-postfix"/>
<netfs ref="NFS Amavis"/>
</service>
</rm>
<quorumd interval="2" label="QDISK" min_score="1" tko="5" votes="2">
<heuristic interval="2" program="/bin/ping -c 1 -t 1 192.168.0.254" score="1"/>
<heuristic interval="5" program="/bin/ping -c 3 -t 1 192.168.0.254" score="1"/>
</quorumd>
<totem consensus="4800" join="60" token="25000" token_retransmits_before_loss_const="20"/>
</cluster>
--
Linux-cluster mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/linux-cluster