Hello everyone,
It's the first time I use a mailing list. I send my first answer before my
subscription and i'm not sure if you receive it.
I send it back to be sure.
Sorry if you receive twice.
Here is my problem
I try to experiment DRBD and
heartbeat.
I am a beginner in Linux.
I followed several tutorials to develop DRBD and heartbeat.
DRBD works perfectly.
Also I am setting V1 style, so it is easily readable.
When I start heartbeat via:
/etc/init.d/heartbeat restart
I get to folow message:
Stopping High-Availability services:
Done.
Waiting to allow resource takeover to complete:
Done.
Starting High-Availability services:
2010/05/31_22:12:33 INFO: Resource is stopped
Done.
And the ip alias is not created
My stucture look like this
LAN0 sur eth0: 192.168.0.0 /24 # Lan users + heartbeat
LAN1 sur eth1: 192.168.1.0 /30 # Lan DRBD: is working.
LAN2 sur eth2: 192.168.2.0 /28 # Lan apps servers: Not yet used.
Heartbeat have 2 node: frontal1 and frontal2
Frontal1| eth1------DRBD------eth1 | frontal2
------------ ------------
eth0 eth0
| |
|_________________________ |
heartbeat
frontal1 eth0: 192.168.0.2
frontal2 eth0: 192.168.0.3
heartbeat ip alias eth0:0: 192.168.0.1
frontal1 eth1: 192.168.1.1
frontal2 eth2: 192.168.1.2
My sotware are Debian 5.0 Lenny and heartbeat 2.1.3-6lenny4
I folowed the folow guides without any succes
http://howtoforge.net/highly-available-nfs-server-using-drbd-and-heartbeat-on-debian-5.0-lenny
http://doc.ubuntu-fr.org/tutoriel/mirroring_sur_deux_serveurs
http://www.drbd.org/users-guide/ch-heartbeat.html
http://www.linux-ha.org/doc/
Here you have my logs, commands result
When I do a BasicSanityCheck (2) I see a problem with IPaddr
But when I launch manualy the script Ipaddr ou Ipaddr2 the ip alias is created
and avaliable on the network.
I looked on a few forum about the subject, and I don't find any solution on my
problem
Thanks for your help
(1) vim /etc/ha.d/ha.cf [/b]
Code:
autojoin none
mcast eth0 239.0.0.43 694 1 0
warntime 5
deadtime 5
initdead 15
keepalive 2
node frontal1
node frontal2
(2) sh /usr/share/heartbeat/BasicSanityCheck
Code:
RTNETLINK answers: Network is unreachable
Using interface: eth0
Should not run tests with heartbeat already running.
Starting base64 and md5 algorithm tests
base64 and md5 algorithm tests succeeded.
Starting Resource Agent tests
Testing RA: Dummy
Testing RA: IPaddr
ERROR: IPaddr RA failed
Starting IPC tests
That's weird. Heartbeat seems to be running...
Stopping heartbeat
Stopping High-Availability services:
Done.
Starting heartbeat
Starting High-Availability services:
2010/05/31_22:16:04 INFO: Resource is stopped
Done.
Does not look like we ARPed the address
Looks like monitor operation failed
Reloading heartbeat
Reloading heartbeat
Stopping heartbeat
Stopping High-Availability services:
Done.
Checking STONITH basic sanity.
Performing apphbd success case tests
Performing apphbd failure case tests
Starting LRM tests
Starting heartbeat
Starting High-Availability services:
2010/05/31_22:18:25 INFO: Resource is stopped
Done.
(3)sh /usr/share/heartbeat/ResourceManager listkeys frontal1
192.168.0.1
(4)sh /usr/share/heartbeat/ResourceManager listkeys frontal2
(5)ip addr show
Code:
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state
UNKNOWN qlen 1000
link/ether 00:0c:29:cb:86:45 brd ff:ff:ff:ff:ff:ff
inet 192.168.0.2/24 brd 192.168.0.255 scope global eth0
3: eth1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state
UNKNOWN qlen 1000
link/ether 00:0c:29:cb:86:4f brd ff:ff:ff:ff:ff:ff
inet 192.168.1.1/30 brd 192.168.1.3 scope global eth1
4: eth2: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN qlen 1000
link/ether 00:0c:29:cb:86:59 brd ff:ff:ff:ff:ff:ff
(6)/etc/ha.d/resource.d/IPaddr 192.168.0.1 start
Code:
2010/05/31_22:30:37 INFO: Success
(7)/etc/ha.d/resource.d/IPaddr2 192.168.0.1 start
Code:
2010/05/31_22:30:24 INFO: Using calculated nic for 192.168.0.1: eth0
2010/05/31_22:30:24 INFO: Using calculated netmask for 192.168.0.1:
255.255.255.0
2010/05/31_22:30:25 INFO: eval ifconfig eth0:0 192.168.0.1 netmask
255.255.255.0 broadcast 192.168.0.255
2010/05/31_22:30:25 INFO: Success
INFO: Success
(8) cat /etc/ha.d/haresources
frontal1 IPaddr2::192.168.0.1/24/eth0/192.168.0.255
OU frontal1 drbddisk::r0 Filesystem::/dev/drbd1::/serveur::ext3 dhcp3-server
(9) cat /var/log/heartbeat/log
heartbeat[7179]: 2010/05/31_22:38:40 info: Version 2 support: false
heartbeat[7179]: 2010/05/31_22:38:40 WARN: Deprecated 'legacy' auto_failback
option selected.
heartbeat[7179]: 2010/05/31_22:38:40 WARN: Please convert to 'auto_failback on'.
heartbeat[7179]: 2010/05/31_22:38:40 WARN: See documentation for conversion
details.
heartbeat[7179]: 2010/05/31_22:38:40 WARN: Logging daemon is disabled
--enabling logging daemon is recommended
heartbeat[7179]: 2010/05/31_22:38:40 info: **************************
heartbeat[7179]: 2010/05/31_22:38:40 info: Configuration validated. Starting
heartbeat 2.1.3
heartbeat[7180]: 2010/05/31_22:38:40 info: heartbeat: version 2.1.3
heartbeat[7180]: 2010/05/31_22:38:40 info: Heartbeat generation: 1275221613
heartbeat[7180]: 2010/05/31_22:38:40 info: glib: UDP multicast heartbeat started
for group 239.0.0.43 port 694 interface eth0 (ttl=1 loop=0)
heartbeat[7180]: 2010/05/31_22:38:40 info: G_main_add_TriggerHandler: Added
signal manual handler
heartbeat[7180]: 2010/05/31_22:38:40 info: G_main_add_TriggerHandler: Added
signal manual handler
heartbeat[7180]: 2010/05/31_22:38:40 info: G_main_add_SignalHandler: Added
signal handler for signal 17
heartbeat[7180]: 2010/05/31_22:38:40 info: Local status now set to: 'up'
heartbeat[7180]: 2010/05/31_22:38:41 info: Link frontal2:eth0 up.
heartbeat[7180]: 2010/05/31_22:38:41 info: Status update for node frontal2:
status active
harc[7188]: 2010/05/31_22:38:41 info: Running /etc/ha.d/rc.d/status status
heartbeat[7180]: 2010/05/31_22:38:42 info: Comm_now_up(): updating status to
active
heartbeat[7180]: 2010/05/31_22:38:42 info: Local status now set to: 'active'
IPaddr2[7242]: 2010/05/31_22:38:42 INFO: Resource is stopped
heartbeat[7204]: 2010/05/31_22:38:42 info: Local Resource acquisition
completed.frontal1:~# cat /var/log/heartbeat/log|more
heartbeat[7179]: 2010/05/31_22:38:40 info: Version 2 support: false
heartbeat[7179]: 2010/05/31_22:38:40 WARN: Deprecated 'legacy' auto_failback
option selected.
heartbeat[7179]: 2010/05/31_22:38:40 WARN: Please convert to 'auto_failback on'.
heartbeat[7179]: 2010/05/31_22:38:40 WARN: See documentation for conversion
details.
heartbeat[7179]: 2010/05/31_22:38:40 WARN: Logging daemon is disabled
--enablinglogging daemon is recommended
heartbeat[7179]: 2010/05/31_22:38:40 info: **************************
heartbeat[7179]: 2010/05/31_22:38:40 info: Configuration validated. Starting
heartbeat 2.1.3
heartbeat[7180]: 2010/05/31_22:38:40 info: heartbeat: version 2.1.3
heartbeat[7180]: 2010/05/31_22:38:40 info: Heartbeat generation: 1275221613
heartbeat[7180]: 2010/05/31_22:38:40 info: glib: UDP multicast heartbeat
started for group 239.0.0.43 port 694 interface eth0 (ttl=1 loop=0)
heartbeat[7180]: 2010/05/31_22:38:40 info: G_main_add_TriggerHandler: Added
signal manual handler
heartbeat[7180]: 2010/05/31_22:38:40 info: G_main_add_TriggerHandler: Added
signal manual handler
heartbeat[7180]: 2010/05/31_22:38:40 info: G_main_add_SignalHandler: Added
signal handler for signal 17
heartbeat[7180]: 2010/05/31_22:38:40 info: Local status now set to: 'up'
heartbeat[7180]: 2010/05/31_22:38:41 info: Link frontal2:eth0 up.
heartbeat[7180]: 2010/05/31_22:38:41 info: Status update for node frontal2:
status active
harc[7188]: 2010/05/31_22:38:41 info: Running /etc/ha.d/rc.d/status status
heartbeat[7180]: 2010/05/31_22:38:42 info: Comm_now_up(): updating status to
active
heartbeat[7180]: 2010/05/31_22:38:42 info: Local status now set to: 'active'
IPaddr2[7242]: 2010/05/31_22:38:42 INFO: Resource is stopped
heartbeat[7204]: 2010/05/31_22:38:42 info: Local Resource acquisition completed.
harc[7337]: 2010/05/31_22:39:06 info: Running /etc/ha.d/rc.d/ip-request-resp
ip-request-resp
ip-request-resp[7337]: 2010/05/31_22:39:06 received ip-request-resp
IPaddr2::192.168.0.1/24/eth0/192.168.0.255 OK no
ResourceManager[7356]: 2010/05/31_22:39:06 info: Acquiring resource group:
frontal1 IPaddr2::192.168.0.1/24/eth0/192.168.0.255
IPaddr2[7382]: 2010/05/31_22:39:06 INFO: Resource is stopped
ResourceManager[7356]: 2010/05/31_22:39:06 info: Running
/etc/ha.d/resource.d/IPaddr2 192.168.0.1/24/eth0/192.168.0.255 start
IPaddr2[7491]: 2010/05/31_22:39:07 INFO: ip -f inet addr add 192.168.0.1/24
brd 192.168.0.255 dev eth0
IPaddr2[7491]: 2010/05/31_22:39:07 INFO: ip link set eth0 up
IPaddr2[7491]: 2010/05/31_22:39:07 INFO: /usr/lib/heartbeat/send_arp -i 200 -r
5 -p /var/run/heartbeat/rsctmp/send_arp/send_arp-192.168.0.1 eth0 192.168.0.1
auto not_used not_used
IPaddr2[7462]: 2010/05/31_22:39:07 INFO: Success
heartbeat[7180]: 2010/05/31_22:39:07 info: Initial resource acquisition
complete(ip-request-resp)
harc[7549]: 2010/05/31_22:39:07 info: Running
/etc/ha.d/rc.d/ip-request-respip-request-resp
ip-request-resp[7549]: 2010/05/31_22:39:07 received ip-request-resp
drbddisk::r0 OK no
ResourceManager[7568]: 2010/05/31_22:39:07 info: Acquiring resource group: fron
tal1 drbddisk::r0 Filesystem::/dev/drbd1::/serveur::ext3 dhcp3-server tftpd-hpa
ResourceManager[7568]: 2010/05/31_22:39:07 info: Running
/etc/ha.d/resource.d/drbddisk r0 start
Filesystem[7633]: 2010/05/31_22:39:07 INFO: Resource is stopped
ResourceManager[7568]: 2010/05/31_22:39:07 info: Running
/etc/ha.d/resource.d/Filesystem /dev/drbd1 /serveur ext3 start
Filesystem[7711]: 2010/05/31_22:39:07 INFO: Running start for /dev/drbd1 on
/serveur
Filesystem[7700]: 2010/05/31_22:39:07 INFO: Success
ResourceManager[7568]: 2010/05/31_22:39:07 info: Running
/etc/init.d/dhcp3-server start
ResourceManager[7568]: 2010/05/31_22:39:09 info: Running /etc/init.d/tftpd-hpa
start
ResourceManager[7568]: 2010/05/31_22:39:09 ERROR: Return code 71 from
/etc/init.d/tftpd-hpa
ResourceManager[7568]: 2010/05/31_22:39:09 CRIT: Giving up resources due to
failure of tftpd-hpa
ResourceManager[7568]: 2010/05/31_22:39:09 info: Releasing resource group: fron
tal1 drbddisk::r0 Filesystem::/dev/drbd1::/serveur::ext3 dhcp3-server tftpd-hpa
ResourceManager[7568]: 2010/05/31_22:39:09 info: Running /etc/init.d/tftpd-hpa
stop
ResourceManager[7568]: 2010/05/31_22:39:09 info: Running
/etc/init.d/dhcp3-server stop
ResourceManager[7568]: 2010/05/31_22:39:09 info: Running
/etc/ha.d/resource.d/Filesystem /dev/drbd1 /serveur ext3 stop
Filesystem[7898]: 2010/05/31_22:39:09 INFO: Running stop for /dev/drbd1
on/serveur
Filesystem[7898]: 2010/05/31_22:39:09 INFO: Trying to unmount /serveur
Filesystem[7898]: 2010/05/31_22:39:10 INFO: unmounted /serveur successfully
Filesystem[7887]: 2010/05/31_22:39:10 INFO: Success
ResourceManager[7568]: 2010/05/31_22:39:10 info: Running
/etc/ha.d/resource.d/drbddisk r0 sto
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems