subject:"Re\: \[Linux\-HA\] heartbeat"

Hello,

 ipv6addr=2600:3c00::0034:c007

from the manpage of ocf_heartbeat_IPv6addr it looks like that you have
to specify the netmask so try:

ipv6addr=2600:3c00::0034:c007/64 assuiming that you're in a /64.

Cheers,
Thomas
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

Thanks for the tip, however, it did not work.  That's actually a /116.  So
I put in 2600:3c00::0034:c007/116 and am getting the same error.  I
requested that it restart the resource as well, just to make sure it wasn't
the previous error.

Nick


On Sun, Mar 24, 2013 at 3:55 AM, Thomas Glanzmann tho...@glanzmann.dewrote:

 Hello,

  ipv6addr=2600:3c00::0034:c007

 from the manpage of ocf_heartbeat_IPv6addr it looks like that you have
 to specify the netmask so try:

 ipv6addr=2600:3c00::0034:c007/64 assuiming that you're in a /64.

 Cheers,
 Thomas
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

2013-03-24 Thread emmanuel segura

Hello Nick

Try to use nic=eth0 instead of nic=eth0:3

thanks

2013/3/24 Nick Walke tubaguy50...@gmail.com

 Thanks for the tip, however, it did not work.  That's actually a /116.  So
 I put in 2600:3c00::0034:c007/116 and am getting the same error.  I
 requested that it restart the resource as well, just to make sure it wasn't
 the previous error.

 Nick


 On Sun, Mar 24, 2013 at 3:55 AM, Thomas Glanzmann tho...@glanzmann.de
 wrote:

  Hello,
 
   ipv6addr=2600:3c00::0034:c007
 
  from the manpage of ocf_heartbeat_IPv6addr it looks like that you have
  to specify the netmask so try:
 
  ipv6addr=2600:3c00::0034:c007/64 assuiming that you're in a /64.
 
  Cheers,
  Thomas
  ___
  Linux-HA mailing list
  Linux-HA@lists.linux-ha.org
  http://lists.linux-ha.org/mailman/listinfo/linux-ha
  See also: http://linux-ha.org/ReportingProblems
 
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems




-- 
esta es mi vida e me la vivo hasta que dios quiera
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

Hello Nick,

 Thanks for the tip, however, it did not work.  That's actually a /116.
 So I put in 2600:3c00::0034:c007/116 and am getting the same
 error.  I requested that it restart the resource as well, just to make
 sure it wasn't the previous error.

now, I had to try it:

node $id=9d9b62d2-405d-459a-a724-cb2643d7d9a1 node-62
primitive ipv6test ocf:heartbeat:IPv6addr \
params ipv6addr=2a01:4f8:bb:400::2/64 \
op monitor interval=15 timeout=15 \
meta target-role=Started
property $id=cib-bootstrap-options \
dc-version=1.1.7-ee0730e13d124c3d58f00016c3376a1de5323cff \
cluster-infrastructure=Heartbeat \
stonith-enabled=false

And it works:

(node-62) [~] ifconfig
eth0  Link encap:Ethernet  HWaddr 00:25:90:97:db:b0
  inet addr:10.100.4.62  Bcast:10.100.255.255  Mask:255.255.0.0
  inet6 addr: 2a01:4f8:bb:400:225:90ff:fe97:dbb0/64 Scope:Global
  inet6 addr: fe80::225:90ff:fe97:dbb0/64 Scope:Link
  inet6 addr: 2a01:4f8:bb:400::2/64 Scope:Global
  UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
  RX packets:40345 errors:0 dropped:0 overruns:0 frame:0
  TX packets:10270 errors:0 dropped:0 overruns:0 carrier:0
  collisions:0 txqueuelen:1000
  RX bytes:52540127 (50.1 MiB)  TX bytes:1127817 (1.0 MiB)
  Memory:fb58-fb60

(infra) [~] traceroute 2a01:4f8:bb:400::2
traceroute to 2a01:4f8:bb:400::2 (2a01:4f8:bb:400::2), 30 hops max, 80 byte 
packets
 1  merlin.glanzmann.de (2a01:4f8:bb:4ff::1)  1.413 ms  1.550 ms  1.791 ms
 2  2a01:4f8:bb:400::2 (2a01:4f8:bb:400::2)  0.204 ms  0.202 ms  0.270 ms

Cheers,
Thomas
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

2013-03-24 Thread Greg Woods

On Sun, 2013-03-24 at 01:36 -0700, tubaguy50035 wrote:

 params ipv6addr=2600:3c00::0034:c007 nic=eth0:3 \

Are you sure that's a valid IPV6 address? I get headaches every time I
look at these, but it seems a valid address is 8 groups, and you've got
5 there. Maybe you mean 2600:3c00::0034:c007?

--Greg

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

I don't know what I'm doing wrong then.  I copied exactly what you put in
and now I'm getting these errors:

ipv6test_start_0 (node=tek-lin-lb1, call=25, rc=1, status=complete):
unknown error
ipv6test_start_0 (node=tek-lin-lb2, call=20, rc=1, status=complete):
unknown error

Looking in my syslog I see:

Mar 24 14:37:13 tek-lin-lb2 IPv6addr: [8038]: ERROR: no valid mecahnisms
Mar 24 14:37:13 tek-lin-lb2 lrmd: [3005]: info: operation start[18] on
ipv6test for client 3008: pid 8038 exited with return code 1
Mar 24 14:37:13 tek-lin-lb2 crmd: [3008]: info: process_lrm_event: LRM
operation ipv6test_start_0 (call=18, rc=1, cib-update=65, confirmed=true)
unknown error

Anything I need to do to allow IPv6... or something?



Nick


On Sun, Mar 24, 2013 at 4:29 AM, Thomas Glanzmann tho...@glanzmann.dewrote:

 Hello Nick,

  Thanks for the tip, however, it did not work.  That's actually a /116.
  So I put in 2600:3c00::0034:c007/116 and am getting the same
  error.  I requested that it restart the resource as well, just to make
  sure it wasn't the previous error.

 now, I had to try it:

 node $id=9d9b62d2-405d-459a-a724-cb2643d7d9a1 node-62
 primitive ipv6test ocf:heartbeat:IPv6addr \
 params ipv6addr=2a01:4f8:bb:400::2/64 \
 op monitor interval=15 timeout=15 \
 meta target-role=Started
 property $id=cib-bootstrap-options \
 dc-version=1.1.7-ee0730e13d124c3d58f00016c3376a1de5323cff \
 cluster-infrastructure=Heartbeat \
 stonith-enabled=false

 And it works:

 (node-62) [~] ifconfig
 eth0  Link encap:Ethernet  HWaddr 00:25:90:97:db:b0
   inet addr:10.100.4.62  Bcast:10.100.255.255  Mask:255.255.0.0
   inet6 addr: 2a01:4f8:bb:400:225:90ff:fe97:dbb0/64 Scope:Global
   inet6 addr: fe80::225:90ff:fe97:dbb0/64 Scope:Link
   inet6 addr: 2a01:4f8:bb:400::2/64 Scope:Global
   UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
   RX packets:40345 errors:0 dropped:0 overruns:0 frame:0
   TX packets:10270 errors:0 dropped:0 overruns:0 carrier:0
   collisions:0 txqueuelen:1000
   RX bytes:52540127 (50.1 MiB)  TX bytes:1127817 (1.0 MiB)
   Memory:fb58-fb60

 (infra) [~] traceroute 2a01:4f8:bb:400::2
 traceroute to 2a01:4f8:bb:400::2 (2a01:4f8:bb:400::2), 30 hops max, 80
 byte packets
  1  merlin.glanzmann.de (2a01:4f8:bb:4ff::1)  1.413 ms  1.550 ms  1.791 ms
  2  2a01:4f8:bb:400::2 (2a01:4f8:bb:400::2)  0.204 ms  0.202 ms  0.270 ms

 Cheers,
 Thomas
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

Hello Nick,

 Anything I need to do to allow IPv6... or something?

I agree with Greg here. Have you tried setting the address manually?

ip -6 addr add ip/cidr dev eth0
ip -6 addr show dev eth0
ip -6 addr del ip/cidr dev eth0
ip -6 addr show dev eth0

(node-62) [~] ip -6 addr add 2a01:4f8:bb:400::3/64 dev eth0
(node-62) [~] ip -6 addr show dev eth0
2: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
inet6 2a01:4f8:bb:400::3/64 scope global
   valid_lft forever preferred_lft forever
inet6 2a01:4f8:bb:400::2/64 scope global
   valid_lft forever preferred_lft forever
inet6 2a01:4f8:bb:400:225:90ff:fe97:dbb0/64 scope global dynamic
   valid_lft 2591998sec preferred_lft 604798sec
inet6 fe80::225:90ff:fe97:dbb0/64 scope link
   valid_lft forever preferred_lft forever
(node-62) [~] ip -6 addr del 2a01:4f8:bb:400::3/64 dev eth0
(node-62) [~] ip -6 addr show dev eth0
2: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
inet6 2a01:4f8:bb:400::2/64 scope global
   valid_lft forever preferred_lft forever
inet6 2a01:4f8:bb:400:225:90ff:fe97:dbb0/64 scope global dynamic
   valid_lft 2591990sec preferred_lft 604790sec
inet6 fe80::225:90ff:fe97:dbb0/64 scope link
   valid_lft forever preferred_lft forever

Do you see a link local address on your eth0? A link local address is one that
starts with fe80:: otherwise try loading the ipv6 module:

modprobe ipv6 # Don't know if that is the right module name, all my
  # kernels have ipv6 build in (Debian wheezy / squeeze / 
backports)

Cheers,
Thomas
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

From the first node:

nick@tek-lin-lb1:~$ sudo ip -6 addr add 2600:3c00::34:c007/116 dev eth0

nick@tek-lin-lb1:~$ sudo ip -6 addr show dev eth0
3: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
inet6 2600:3c00::34:c007/116 scope global
   valid_lft forever preferred_lft forever
inet6 2600:3c00::f03c:91ff:fe70:7541/64 scope global dynamic
   valid_lft 43200sec preferred_lft 43200sec
inet6 2600:3c00::34:c003/64 scope global
   valid_lft forever preferred_lft forever
inet6 fe80::f03c:91ff:fe70:7541/64 scope link
   valid_lft forever preferred_lft forever

nick@tek-lin-lb1:~$ sudo ip -6 addr del 2600:3c00::34:c007/116 dev eth0

nick@tek-lin-lb1:~$ sudo ip -6 addr show dev eth0
3: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
inet6 2600:3c00::f03c:91ff:fe70:7541/64 scope global dynamic
   valid_lft 43200sec preferred_lft 43200sec
inet6 2600:3c00::34:c003/64 scope global
   valid_lft forever preferred_lft forever
inet6 fe80::f03c:91ff:fe70:7541/64 scope link
   valid_lft forever preferred_lft forever



From the second node:

nick@tek-lin-lb2:~$ sudo ip -6 addr add 2600:3c00::34:c007/116 dev eth0

nick@tek-lin-lb2:~$ sudo ip -6 addr show dev eth0
3: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
inet6 2600:3c00::34:c007/116 scope global
   valid_lft forever preferred_lft forever
inet6 2600:3c00::f03c:91ff:fe70:f0a4/64 scope global dynamic
   valid_lft 43190sec preferred_lft 43190sec
inet6 2600:3c00::34:c005/64 scope global
   valid_lft forever preferred_lft forever
inet6 fe80::f03c:91ff:fe70:f0a4/64 scope link
   valid_lft forever preferred_lft forever

nick@tek-lin-lb2:~$ sudo ip -6 addr del 2600:3c00::34:c007/116 dev eth0

nick@tek-lin-lb2:~$ sudo ip -6 addr show dev eth0
3: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
inet6 2600:3c00::f03c:91ff:fe70:f0a4/64 scope global dynamic
   valid_lft 43197sec preferred_lft 43197sec
inet6 2600:3c00::34:c005/64 scope global
   valid_lft forever preferred_lft forever
inet6 fe80::f03c:91ff:fe70:f0a4/64 scope link
   valid_lft forever preferred_lft forever

I shouldn't be able to do that if the IPv6 module wasn't loaded, correct?
 So it seems like it is.



Nick


On Sun, Mar 24, 2013 at 3:16 PM, Thomas Glanzmann tho...@glanzmann.dewrote:

 Hello Nick,

  Anything I need to do to allow IPv6... or something?

 I agree with Greg here. Have you tried setting the address manually?

 ip -6 addr add ip/cidr dev eth0
 ip -6 addr show dev eth0
 ip -6 addr del ip/cidr dev eth0
 ip -6 addr show dev eth0

 (node-62) [~] ip -6 addr add 2a01:4f8:bb:400::3/64 dev eth0
 (node-62) [~] ip -6 addr show dev eth0
 2: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
 inet6 2a01:4f8:bb:400::3/64 scope global
valid_lft forever preferred_lft forever
 inet6 2a01:4f8:bb:400::2/64 scope global
valid_lft forever preferred_lft forever
 inet6 2a01:4f8:bb:400:225:90ff:fe97:dbb0/64 scope global dynamic
valid_lft 2591998sec preferred_lft 604798sec
 inet6 fe80::225:90ff:fe97:dbb0/64 scope link
valid_lft forever preferred_lft forever
 (node-62) [~] ip -6 addr del 2a01:4f8:bb:400::3/64 dev eth0
 (node-62) [~] ip -6 addr show dev eth0
 2: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
 inet6 2a01:4f8:bb:400::2/64 scope global
valid_lft forever preferred_lft forever
 inet6 2a01:4f8:bb:400:225:90ff:fe97:dbb0/64 scope global dynamic
valid_lft 2591990sec preferred_lft 604790sec
 inet6 fe80::225:90ff:fe97:dbb0/64 scope link
valid_lft forever preferred_lft forever

 Do you see a link local address on your eth0? A link local address is one
 that
 starts with fe80:: otherwise try loading the ipv6 module:

 modprobe ipv6 # Don't know if that is the right module name, all my
   # kernels have ipv6 build in (Debian wheezy /
 squeeze / backports)

 Cheers,
 Thomas
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

Hello Nick,

 I shouldn't be able to do that if the IPv6 module wasn't loaded,
 correct?

that is correct. I tried modifying my netmask to copy yours. And I get
the same error, you do:

ipv6test_start_0 (node=node-62, call=6, rc=1, status=complete): unknown 
error

So probably a bug in the resource agent. Manually adding and removing
works:

(node-62) [~] ip -6 addr add 2a01:4f8:bb:400::2/116 dev eth0
(node-62) [~] ip -6 addr show dev eth0
2: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
inet6 2a01:4f8:bb:400::2/116 scope global
   valid_lft forever preferred_lft forever
inet6 2a01:4f8:bb:400:225:90ff:fe97:dbb0/64 scope global dynamic
   valid_lft 2591887sec preferred_lft 604687sec
inet6 fe80::225:90ff:fe97:dbb0/64 scope link
   valid_lft forever preferred_lft forever
(node-62) [~] ip -6 addr del 2a01:4f8:bb:400::2/116 dev eth0

Nick, you can do the following things to resolve this:

- Hunt down the bug and fix it or let someone else do it for you

- Use another netmask, if possible (fighting the symptoms instead of
  resolving the root cause)

- Write your own resource agent (fighting the symptoms instead of
  resolving the root cause)

Cheers,
Thomas
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat IPv6addr OCF

This the correct place to report bugs?
https://github.com/ClusterLabs/resource-agents

Nick


On Sun, Mar 24, 2013 at 10:45 PM, Thomas Glanzmann tho...@glanzmann.dewrote:

 Hello Nick,

  I shouldn't be able to do that if the IPv6 module wasn't loaded,
  correct?

 that is correct. I tried modifying my netmask to copy yours. And I get
 the same error, you do:

 ipv6test_start_0 (node=node-62, call=6, rc=1, status=complete):
 unknown error

 So probably a bug in the resource agent. Manually adding and removing
 works:

 (node-62) [~] ip -6 addr add 2a01:4f8:bb:400::2/116 dev eth0
 (node-62) [~] ip -6 addr show dev eth0
 2: eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qlen 1000
 inet6 2a01:4f8:bb:400::2/116 scope global
valid_lft forever preferred_lft forever
 inet6 2a01:4f8:bb:400:225:90ff:fe97:dbb0/64 scope global dynamic
valid_lft 2591887sec preferred_lft 604687sec
 inet6 fe80::225:90ff:fe97:dbb0/64 scope link
valid_lft forever preferred_lft forever
 (node-62) [~] ip -6 addr del 2a01:4f8:bb:400::2/116 dev eth0

 Nick, you can do the following things to resolve this:

 - Hunt down the bug and fix it or let someone else do it for you

 - Use another netmask, if possible (fighting the symptoms instead
 of
   resolving the root cause)

 - Write your own resource agent (fighting the symptoms instead of
   resolving the root cause)

 Cheers,
 Thomas
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat/Pacemaker resource agent incorrect

2012-12-13 Thread Dejan Muhamedagic

On Thu, Dec 13, 2012 at 12:41:29AM +0100, Lars Marowsky-Bree wrote:
 On 2012-12-13T10:31:55, Andrew Beekhof and...@beekhof.net wrote:
 
   We once moved the ocf-shellfuncs file, which didn't work out here when
  I thought we never did this sort of thing because we don't know how
  people are using our stuff externally.
 
 We did it in a backwards-compatible manner; or at least if the packagers
 choose to, they could symlink the old location to the new one. (That is
 the default for the included spec files, I think.)

Right. Whichever way some other RA may have used the shellfuncs,
it would continue to work with the new package. That obviously
needed to be supported. The old filenames were starting with '.'
which is a precedent and not well received by all distributions.

Thanks,

Dejan

 So yes, we try hard to never break updating. And to provide migration
 over several releases. None of the functions changed names, all
 variables still there, we don't drop agent attributes, that kind of
 stuff.
 
 But copying a new agent that has the new path embedded obviously doesn't
 work in the old environment.
 
 If you were trying to be snarky, I think this failed. ;-)
 
 
 Regards,
 Lars
 
 -- 
 Architect Storage/HA
 SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, 
 HRB 21284 (AG Nürnberg)
 Experience is the name everyone gives to their mistakes. -- Oscar Wilde
 
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat/Pacemaker resource agent incorrect

2012-12-12 Thread Michael Schwartzkopff

Am Montag, 10. Dezember 2012, 15:59:16 schrieb codey koble:
 To anyone who could help possibly:
 
 My current setup:
 2 Ubuntu 10.04 LTS servers running heartbeat, pacemaker, apache, and mysql
 Heartbeat and pacemaker are running great for my needs with one exception,
 currently both nodes are showing mysql as slaves.
 I have mysql configured in a master/slave setup and that is working great
 on its own.
 
 I noticed when I tried to promote one of the servers that an error occurred
 stating that the ocf:heartbeat:mysql did not support the feature.  I
 evaluated the script and realized it was an older version and did not
 contain any of the promote/demote code.  I found the newest code for the
 script in the github repo and replaced the entire mysql file with the new
 code.  Upon doing this it then gave an error stating that the
 ocf:heartbeat:mysql resource agent was not installed.
 
 My question would be is there a simple way to update the script instead of
 manually replacing it like I did, or is there a way to get the code I
 changed to working?
 
 Thanks in advance for any help!

It seems that you have three options:

1) Go back to the old script and use it as a primitive resource, not a 
Master/Slave resource.

2) Keep the new script and debug, why the new script does not work in your 
environment. Perhaps some PATH is set wrong or some packages are not 
installed.

3) Upgrade to 12.04 LTS. This version should reflect recent developments in 
the cluster coftware.

Perhaps you try option 2) first but in the mid term go for option 3).

-- 
Dr. Michael Schwartzkopff
Guardinistr. 63
81375 München

Tel: (0163) 172 50 98
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat/Pacemaker resource agent incorrect

2012-12-12 Thread Fabian Herschel

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

On 12/10/2012 10:59 PM, codey koble wrote:
 To anyone who could help possibly:
 
 My current setup: 2 Ubuntu 10.04 LTS servers running heartbeat,
 pacemaker, apache, and mysql Heartbeat and pacemaker are running
 great for my needs with one exception, currently both nodes are
 showing mysql as slaves. I have mysql configured in a master/slave
 setup and that is working great on its own.
 
 I noticed when I tried to promote one of the servers that an error
 occurred stating that the ocf:heartbeat:mysql did not support the
 feature.  I evaluated the script and realized it was an older
 version and did not contain any of the promote/demote code.  I
 found the newest code for the script in the github repo and
 replaced the entire mysql file with the new code.  Upon doing this
 it then gave an error stating that the ocf:heartbeat:mysql resource
 agent was not installed.

Could you send the error message more precise? Does the cluster tell
you the RA si not installed  (check path and file permissions) or does
the LRM tell that the RA itself has returned a exit code not
installed (this would mean the RA does not find your mysql
binaries/config/or whatever)?

 
 My question would be is there a simple way to update the script
 instead of manually replacing it like I did, or is there a way to
 get the code I changed to working?
 
 Thanks in advance for any help! 
 ___ Linux-HA mailing
 list Linux-HA@lists.linux-ha.org 
 http://lists.linux-ha.org/mailman/listinfo/linux-ha See also:
 http://linux-ha.org/ReportingProblems
 

-BEGIN PGP SIGNATURE-
Version: GnuPG v2.0.19 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://www.enigmail.net/

iQEcBAEBAgAGBQJQyH9xAAoJEJ1uHhrzMvZRjW4H/RUxkgL/nXyKZqz6xl8dDn3P
bPcCqqOvSX2x32umwkEaS2JZ7Gabo8O7sHIZNC/HcrmDttoRo6L4BNR+W2QkQtMV
FEuTVqktOq6WdeaZ2Hn66S42+IkzHOOJRRJzp0GSLfdlxzRiM2E+an/QmPwWbpZZ
EFvZbyDScqrKyQo7vN5CE0K1yb9JCrOxLMO2NX1D2reiOv7f3pvslKO03eohLcy/
k4ZagdO9GvIPs7PPj+pI5aUYbH7ypejPR+z8e6OXpAgbfSQg7AJuTgllMcCsODAe
BEb78ZWpa4pANAugRvJZ87A1ATjgJy2MBubyewqGRqghnNeqAjq5hPgzH9cuWoQ=
=OfyW
-END PGP SIGNATURE-
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat/Pacemaker resource agent incorrect

2012-12-12 Thread Lars Marowsky-Bree

On 2012-12-12T13:58:25, Fabian Herschel fabian.hersc...@arcor.de wrote:

  I noticed when I tried to promote one of the servers that an error
  occurred stating that the ocf:heartbeat:mysql did not support the
  feature.  I evaluated the script and realized it was an older
  version and did not contain any of the promote/demote code.  I
  found the newest code for the script in the github repo and
  replaced the entire mysql file with the new code.  Upon doing this
  it then gave an error stating that the ocf:heartbeat:mysql resource
  agent was not installed.
 Could you send the error message more precise? Does the cluster tell
 you the RA si not installed  (check path and file permissions) or does
 the LRM tell that the RA itself has returned a exit code not
 installed (this would mean the RA does not find your mysql
 binaries/config/or whatever)?

We once moved the ocf-shellfuncs file, which didn't work out here when
only a single script is updated and not the whole package.

I suggest to upgrade the whole package and then investigate.


Mit freundlichen Grüßen,
Lars

-- 
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 
21284 (AG Nürnberg)
Experience is the name everyone gives to their mistakes. -- Oscar Wilde

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat/Pacemaker resource agent incorrect

2012-12-12 Thread Andrew Beekhof

On Thu, Dec 13, 2012 at 12:24 AM, Lars Marowsky-Bree l...@suse.com wrote:
 On 2012-12-12T13:58:25, Fabian Herschel fabian.hersc...@arcor.de wrote:

  I noticed when I tried to promote one of the servers that an error
  occurred stating that the ocf:heartbeat:mysql did not support the
  feature.  I evaluated the script and realized it was an older
  version and did not contain any of the promote/demote code.  I
  found the newest code for the script in the github repo and
  replaced the entire mysql file with the new code.  Upon doing this
  it then gave an error stating that the ocf:heartbeat:mysql resource
  agent was not installed.
 Could you send the error message more precise? Does the cluster tell
 you the RA si not installed  (check path and file permissions) or does
 the LRM tell that the RA itself has returned a exit code not
 installed (this would mean the RA does not find your mysql
 binaries/config/or whatever)?

 We once moved the ocf-shellfuncs file, which didn't work out here when

I thought we never did this sort of thing because we don't know how
people are using our stuff externally.

 only a single script is updated and not the whole package.

 I suggest to upgrade the whole package and then investigate.


 Mit freundlichen Grüßen,
 Lars

 --
 Architect Storage/HA
 SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, 
 HRB 21284 (AG Nürnberg)
 Experience is the name everyone gives to their mistakes. -- Oscar Wilde

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat/Pacemaker resource agent incorrect

2012-12-12 Thread Lars Marowsky-Bree

On 2012-12-13T10:31:55, Andrew Beekhof and...@beekhof.net wrote:

  We once moved the ocf-shellfuncs file, which didn't work out here when
 I thought we never did this sort of thing because we don't know how
 people are using our stuff externally.

We did it in a backwards-compatible manner; or at least if the packagers
choose to, they could symlink the old location to the new one. (That is
the default for the included spec files, I think.)

So yes, we try hard to never break updating. And to provide migration
over several releases. None of the functions changed names, all
variables still there, we don't drop agent attributes, that kind of
stuff.

But copying a new agent that has the new path embedded obviously doesn't
work in the old environment.

If you were trying to be snarky, I think this failed. ;-)


Regards,
Lars

-- 
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 
21284 (AG Nürnberg)
Experience is the name everyone gives to their mistakes. -- Oscar Wilde

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat with Oracle's ASM

2012-11-15 Thread Digimer

On 11/15/2012 05:00 AM, Hill Fang wrote:
 Hi friend:
 
 I want know heartbeat is support oracle ASM now??

The heartbeat project has been deprecated for some time. There are no
plans to continue it's development. I am unsure of it's supported state
on Oracle, but regardless, I would advice you plan to use corosync.

-- 
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without
access to education?
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat with Oracle's ASM

2012-11-15 Thread Serge Dubrouski

There is an RA for Oracle that can be used with Pacemaker. Generally ASM
behaves like a regular Oracle instance, so you can try it.
 On Nov 15, 2012 8:57 AM, Hill Fang hill.f...@ericsson.com wrote:

 Hi friend:

 I want know heartbeat is support oracle ASM now??




 HILL FANG
 Engineer

 Guangzhou Ericsson Communication Services Co.,Ltd.(GTC)
 SI Support
 2 /F, NO. 1025 Gaopu Road, Tianhe Software Park,Tianhe District, Guangzhou,
 510663, PR China
 Phone +86 020-85117631
 Fax +86 020-29002699
 SMS/MMS 15813329521
 hill.f...@ericsson.com
 www.ericsson.com


 [http://www.ericsson.com/]http://www.ericsson.com/

 This Communication is Confidential. We only send and receive email on the
 basis of the terms set out at www.ericsson.com/email_disclaimer
 http://www.ericsson.com/email_disclaimer


 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat with Oracle's ASM

2012-11-15 Thread Lars Marowsky-Bree

On 2012-11-15T10:00:21, Hill Fang hill.f...@ericsson.com wrote:

 Hi friend:
 
 I want know heartbeat is support oracle ASM now??

No - and yes.

Oracle RAC (I assume that's the context for ASM?) does not tolerate any
cluster solution except itself. This is not supported together with
Pacemaker.

Pacemaker with the Oracle resource agent can manage a single instance
fail-over for Oracle, yes. That is supported. Postgres/MySQL too.


Regards,
Lars

-- 
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 
21284 (AG Nürnberg)
Experience is the name everyone gives to their mistakes. -- Oscar Wilde

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat not starting when both nodes are down

2012-10-09 Thread Nicolás

El 08/10/2012 20:56, Andreas Kurz escribió:
 On 10/08/2012 09:42 PM, Nicolás wrote:
 El 28/09/2012 20:42, Nicolás escribió:
 Hi all!

 I'm new to this list, I've been looking to get some info about this but
 I haven't seen anything, so I'm trying this way.

 I've successfully configured a 2-node cluster with DRBD + Heartbeat +
 Pacemaker. It works as expected.

 The problem comes when both nodes are down. Having this, after powering
 on one of the nodes, I can see it configuring the network but after this
 I never see the console for this machine. So I try to connect via SSH
 and realize that Heartbeat is not running. After I run it manually I can
 see the console for this node. This only happens when BOTH nodes are
 down. When just one is, everything goes right as Heartbeat starts
 automatically on the powering-on node.

 I see nothing relevant in logs, my conf is as follows:

 root@cluster1:~# cat /etc/ha.d/ha.cf | grep -e ^[^#]
 logfacility local0
 ucast eth1 192.168.0.91
 ucast eth0 192.168.20.51
 auto_failback on
 nodecluster1.gamez.es cluster2.gamez.es
 use_logd yes
 crm  on
 autojoin none

 Any ideas on what am I doing wrong?
 [...]

 For a new cluster use Corosync and not Heartbeat,disable DRBD init
 script and configure it as a Pacemaker master-slave resource.


Thanks for this! Once I disabled DRBD init script it worked as it should.

Regards,

Nicolás

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat not starting when both nodes are down

2012-10-08 Thread Nicolás

El 28/09/2012 20:42, Nicolás escribió:
 Hi all!

 I'm new to this list, I've been looking to get some info about this but
 I haven't seen anything, so I'm trying this way.

 I've successfully configured a 2-node cluster with DRBD + Heartbeat +
 Pacemaker. It works as expected.

 The problem comes when both nodes are down. Having this, after powering
 on one of the nodes, I can see it configuring the network but after this
 I never see the console for this machine. So I try to connect via SSH
 and realize that Heartbeat is not running. After I run it manually I can
 see the console for this node. This only happens when BOTH nodes are
 down. When just one is, everything goes right as Heartbeat starts
 automatically on the powering-on node.

 I see nothing relevant in logs, my conf is as follows:

 root@cluster1:~# cat /etc/ha.d/ha.cf | grep -e ^[^#]
 logfacility local0
 ucast eth1 192.168.0.91
 ucast eth0 192.168.20.51
 auto_failback on
 nodecluster1.gamez.es cluster2.gamez.es
 use_logd yes
 crm  on
 autojoin none

 Any ideas on what am I doing wrong?

 Thanks a lot in advance.

 Nicolás
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

Any ideas with this?

Thanks!
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat not starting when both nodes are down

2012-10-08 Thread Andreas Kurz

On 10/08/2012 09:42 PM, Nicolás wrote:
 El 28/09/2012 20:42, Nicolás escribió:
 Hi all!

 I'm new to this list, I've been looking to get some info about this but
 I haven't seen anything, so I'm trying this way.

 I've successfully configured a 2-node cluster with DRBD + Heartbeat +
 Pacemaker. It works as expected.

 The problem comes when both nodes are down. Having this, after powering
 on one of the nodes, I can see it configuring the network but after this
 I never see the console for this machine. So I try to connect via SSH
 and realize that Heartbeat is not running. After I run it manually I can
 see the console for this node. This only happens when BOTH nodes are
 down. When just one is, everything goes right as Heartbeat starts
 automatically on the powering-on node.

 I see nothing relevant in logs, my conf is as follows:

 root@cluster1:~# cat /etc/ha.d/ha.cf | grep -e ^[^#]
 logfacility local0
 ucast eth1 192.168.0.91
 ucast eth0 192.168.20.51
 auto_failback on
 nodecluster1.gamez.es cluster2.gamez.es
 use_logd yes
 crm  on
 autojoin none

 Any ideas on what am I doing wrong?

Looks like enabled DRBD init script with default startup-timeout
parameters ... that script blocks until peer is connected or timeout --
default forever (depending on some configuration parameters) or manual
confirmation on console ... as heartbeat is typically last in boot
process it is not (yet) started.

For a new cluster use Corosync and not Heartbeat,disable DRBD init
script and configure it as a Pacemaker master-slave resource.

Regards,
Andreas

-- 
Need help with Pacemaker?
http://www.hastexo.com/now


 Thanks a lot in advance.

 Nicolás
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
 
 Any ideas with this?
 
 Thanks!
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
 







signature.asc
Description: OpenPGP digital signature
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat and n-to1 clusters

2012-08-07 Thread Andrew Beekhof

On Tue, Aug 7, 2012 at 1:42 AM, Andy Furtado awf...@yahoo.com wrote:
 Hello,


 Is it possible to setup an n-to-1 cluster configuration and have heartbeat 
 manage a different VIP for each virtual pair.
 The n-to-1 configuration would have a single slave node, able to take over 
 for any one of the failed N masters at a time.

You'll want pacemaker on top of heartbeat for that.

   http://www.clusterlabs.org

 In this configuration each masternode would have a staticip-addr and a VIP. 
 When the master fails, the VIP for that master is configured on the 
 slavenode, and the slave acts as that master.
 Once the slavenode is acting as a master, it remains in this state and cannot 
 takeover of another failed master until the original masternode is restored, 
 and the original slave transitions back to the slave state.


 Example configuration
 masternodeA
 staticip:10.1.1.1
 VIP:10.1.1.101


 masternodeB
 staticip:10.1.1.2

 VIP: 10.1.1.102


 slavenode
 staticip:10.1.1.3


 If masternodeA fails, slavenode becomes active as masternodeA and is 
 configured with VIP 10.1.1.101
 If masternodeB fails, there is no failover available since slavenode is 
 currently acting as masternodeA


 When masternodeA is restored, slavenode releases VIP 10.1.1.101, and is now 
 ready to take over for either masternodeA or masternodeB.


 I understand this is not an ideal fail over solution, but one I must live 
 with until further design can be done.

 I've searched the internet, and the HA mailing lists without much success.

 Any info, or input would be appreciated.


 Best Regards,
 Andy
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Error

2012-08-05 Thread Andrew Beekhof

On Fri, Aug 3, 2012 at 5:18 PM, Yount, William D
yount.will...@menloworldwide.com wrote:
 I am using pacemaker and corosync. For some reason I keep getting this error 
 in my messages log:

 ERROR: Cannot chdir to [/var/lib/heartbeat/cores/root]: No such file or 
 directory

 Should I not worry about that since I am using corosync and not heartbeat

Pacemaker (until a few days ago) used these directories even when used
with corosync.
Best to create it.



 William

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Error [Solved]

2012-08-05 Thread Andrew Beekhof

More recent versions will create the leaf directory for you when
pacemaker starts.

On Fri, Aug 3, 2012 at 5:39 PM, Yount, William D
yount.will...@menloworldwide.com wrote:
 I was able to fix the error by creating the directory manually. 
 /var/lib/heartbeat/cores was already there, I just added root.

 Kind of an odd problem though.


 -Original Message-
 From: linux-ha-boun...@lists.linux-ha.org 
 [mailto:linux-ha-boun...@lists.linux-ha.org] On Behalf Of Yount, William D
 Sent: Friday, August 03, 2012 2:18 AM
 To: linux-ha@lists.linux-ha.org
 Subject: [Linux-HA] Heartbeat Error

 I am using pacemaker and corosync. For some reason I keep getting this error 
 in my messages log:

 ERROR: Cannot chdir to [/var/lib/heartbeat/cores/root]: No such file or 
 directory

 Should I not worry about that since I am using corosync and not heartbeat


 William

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Error [Solved]

2012-08-03 Thread Yount, William D

I was able to fix the error by creating the directory manually. 
/var/lib/heartbeat/cores was already there, I just added root.

Kind of an odd problem though.


-Original Message-
From: linux-ha-boun...@lists.linux-ha.org 
[mailto:linux-ha-boun...@lists.linux-ha.org] On Behalf Of Yount, William D
Sent: Friday, August 03, 2012 2:18 AM
To: linux-ha@lists.linux-ha.org
Subject: [Linux-HA] Heartbeat Error

I am using pacemaker and corosync. For some reason I keep getting this error in 
my messages log:

ERROR: Cannot chdir to [/var/lib/heartbeat/cores/root]: No such file or 
directory

Should I not worry about that since I am using corosync and not heartbeat


William

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat isn't switching to the 2nd node when Httpd is down!

2012-07-31 Thread Lars Ellenberg

On Tue, Jul 24, 2012 at 04:01:40PM +0100, Aboubakr Seddik Ouahabi wrote:
 Hey there, I've created a thread somewhere, but I guess this is the right
 place to seek help for this, and here is my issue as stated there:
 
 
 Ok guys, that was very much appreciated and I thank you again. For now, I
 just want to get heartbeat to function as it should and I don't want to
 create a whole new thread for it.
 
 As I said before, I have one public IP to access the server, and 2 nodes
 with 2 internal IPs, both are connected using Eth0, and what I want exactly
 is, if either one of Httpd or MySQL went down, the second node should take
 control and the virtual IP shall be assigned to it, until everything is in
 Sync again, then the primary or the favorible node should be taking over
 again.
 
 Heartbeat is starting just fine, detecting the 2 nodes, then I tried to
 shutdown one of them and see what would it say
 
  Code:
 
 cl_status nodestatus node02
 dead
 
 And it found it was dead, but the failover isn't happening. I've tried to:
 
  Code:
 
 service httpd stop
 
 On node01, but it didn't switch anything to anything, so what I've been
 missing in my config? And here are the config I've tried in my ha.cf:
 
  Code:
 
 # Logging
  debug  1
  use_logd   true
  logfacilitydaemon
 
  # Misc Options
  traditional_compressionoff
  compressionbz2
  coredumps  true
 
  # Communications
  udpport21xxx
  bcast  eth0
  ucast  eth010.25.45.81
  ucast  eth010.25.45.82
 
  autojoin   any
 
  # Thresholds (in seconds)
  keepalive  1
  warntime   6
  deadtime   10
  initdead   15
 
 crm respawn
 
 node node01
 node node02
 
 And I've tried 2 combinations for my cib.xml:

learn to use the crm shell, so much easier to the eyes...

 
 1:
  Code:
 
 cib
 configuration

I think you are missing no-quorum-policy=ignore


-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat isn't switching to the 2nd node when Httpd is down!

2012-07-29 Thread Andrew Beekhof

On Wed, Jul 25, 2012 at 1:01 AM, Aboubakr Seddik Ouahabi
ouaha...@gmail.com wrote:
 Hey there, I've created a thread somewhere, but I guess this is the right
 place to seek help for this, and here is my issue as stated there:


 Ok guys, that was very much appreciated and I thank you again. For now, I
 just want to get heartbeat to function as it should and I don't want to
 create a whole new thread for it.

 As I said before, I have one public IP to access the server, and 2 nodes
 with 2 internal IPs, both are connected using Eth0, and what I want exactly
 is, if either one of Httpd or MySQL went down, the second node should take
 control and the virtual IP shall be assigned to it,

Is apache and mysql intended to be running on both machines at the same time?

Btw. haresources is not used for crm/pacemaker clusters

 until everything is in
 Sync again, then the primary or the favorible node should be taking over
 again.

 Heartbeat is starting just fine, detecting the 2 nodes, then I tried to
 shutdown one of them and see what would it say

  Code:

 cl_status nodestatus node02
 dead

 And it found it was dead, but the failover isn't happening. I've tried to:

  Code:

 service httpd stop

 On node01, but it didn't switch anything to anything, so what I've been
 missing in my config? And here are the config I've tried in my ha.cf:

  Code:

 # Logging
  debug  1
  use_logd   true
  logfacilitydaemon

  # Misc Options
  traditional_compressionoff
  compressionbz2
  coredumps  true

  # Communications
  udpport21xxx
  bcast  eth0
  ucast  eth010.25.45.81
  ucast  eth010.25.45.82

  autojoin   any

  # Thresholds (in seconds)
  keepalive  1
  warntime   6
  deadtime   10
  initdead   15

 crm respawn

 node node01
 node node02

 And I've tried 2 combinations for my cib.xml:

 1:
  Code:

 cib
 configuration

   crm_config/
   nodes/
   resources

 ###
 ###

group id=group_apache
  primitive id=ipaddr class=ocf type=IPaddr 
 provider=heartbeat
  instance_attributes id=ia_ipaddr
attributes
  nvpair id=ia_ipaddr_ip name=ip value=91.xxx.xxx.xx/
  nvpair id=ia_ipaddr_nic name=nic value=eth0/
  nvpair id=ia_ipaddr_netmask name=netmask value=24/
/attributes
  /instance_attributes
/primitive
primitive id=apache class=ocf type=apache provider=heartbeat
  instance_attributes id=ia_apache
attributes
  nvpair id=ia_apache_configfile name=configfile
 value=/etc/httpd/conf/httpd.conf/
/attributes
  /instance_attributes
  /primitive
/group

 #
 #

 group id=node01

   primitive class=ocf id=IP1 provider=heartbeat type=IPaddr
 operations
   op id=IP1_mon interval=10s name=monitor timeout=5s/
 /operations
 instance_attributes id=IP1_inst_attr
   attributes
 nvpair id=IP1_attr_0 name=ip value=10.25.45.81/
 nvpair id=IP1_attr_1 name=netmask value=255.255.255.0/
 nvpair id=IP1_attr_2 name=nic value=eth0/
   /attributes
 /instance_attributes
   /primitive

   primitive class=lsb id=httpd1 provider=heartbeat type=httpd
 operations
   op id=jboss1_mon interval=30s name=monitor timeout=20s/
 /operations
   /primitive
 /group

 group id=node02

   primitive class=ocf id=IP2 provider=heartbeat type=IPaddr
 operations
   op id=IP2_mon interval=10s name=monitor timeout=5s/
 /operations
 instance_attributes id=IP2_inst_attr
   attributes
 nvpair id=IP2_attr_0 name=ip value=10.25.45.82/
 nvpair id=IP2_attr_1 name=netmask value=255.255.255.0/
 nvpair id=IP2_attr_2 name=nic value=eth0/
   /attributes
 /instance_attributes
   /primitive
   primitive class=lsb id=httpd2 provider=heartbeat type=httpd
 operations
   op id=jboss2_mon interval=30s name=monitor timeout=20s/
 /operations
   /primitive
 /group
   /resources

   constraints
 rsc_location id=location_server1 rsc=node01
   rule id=best_location_server1 score=100
 expression_attribute=node01 id=best_location_server1_expr
 operation=eq
 value=10.25.45.81/
   /rule
 /rsc_location

 rsc_location id=location_server2 rsc=node02
   rule id=best_location_server2 score=100
 expression_attribute=node02 id=best_location_server2_expr
 operation=eq

Re: [Linux-HA] Heartbeat over VPN

2012-07-12 Thread Dejan Muhamedagic

Hi,

On Wed, Jul 11, 2012 at 04:24:42AM +0700, Nanang Purnomo wrote:
 I want to implement a failover cluster server with heartbeat, but the
 problem I use vpn network. Is the heartbeat can be run through two
 different networks?

Sure. Just make sure that the port is open and that various
parameters fit your network.

Now, if it's a two-node cluster, you need a stonith solution
which runs over another independent media. If that's not
possible, you'll need an arbitrator in the third site.

Thanks,

Dejan

 I hope you give me solution,please
 
 
 Best Regards,
 Nanang
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat question about multiple services

2012-05-09 Thread Nikita Michalko

Am Freitag, 20. April 2012 12:42:16 schrieb sgm:
 Hi,
 I have a question about heartbeat, if I have three services, apache, mysql
  and sendmail,if apache is down, heartbeat will switch all the services to
  the standby server, right?
It's depending on configuration - also possible ...

  If so, how to configure heartbeat to avoid this
  happen?

You can configure your 2 services (mysql and sendmail for example )  with 
colocations  constraints, or as a group - there are many possibilities.
Did you already RTFM (read the f... manuals)?


 Very Appreciated.gm
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
 

HTH

Nikita Michalko 
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat question about multiple services

2012-05-09 Thread RaSca

Il giorno Ven 20 Apr 2012 12:42:16 CEST, sgm ha scritto:
 Hi,
 I have a question about heartbeat, if I have three services, apache, mysql 
 and sendmail,if apache is down, heartbeat will switch all the services to the 
 standby server, right?
 If so, how to configure heartbeat to avoid this happen?
 Very Appreciated.gm

You may want to start from here:
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch/

-- 
RaSca
Mia Mamma Usa Linux: Niente è impossibile da capire, se lo spieghi bene!
ra...@miamammausalinux.org
http://www.miamammausalinux.org
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat question about multiple services

2012-05-09 Thread David Gersic

 On 4/20/2012 at 05:42 AM, sgm sgm...@yahoo.com.cn wrote: 
 Hi,
 I have a question about heartbeat, if I have three services, apache, mysql 
 and sendmail,if apache is down, heartbeat will switch all the services to the 
 standby server, right?

Maybe. It depends on how you have built and configured your cluster.



___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat strange behavior

2012-05-07 Thread Douglas Pasqua

Thanks Lars..

problem solved. I changed the asterisk init script to be idempotent.

Regards,
Douglas

On Wed, May 2, 2012 at 9:25 AM, Lars Ellenberg lars.ellenb...@linbit.comwrote:

 On Mon, Apr 30, 2012 at 01:52:05PM -0300, Douglas Pasqua wrote:
  Hi friends,
 
  I create a linux ha solution using 2 nodes: node-a and node-b.
 
  My /etc/ha.d/ha.cf:
 
  use_logd yes
  keepalive 1
  deadtime 90
  warntime 5
  initdead 120
  bcast eth6
  node node-a
  node node-b
  crm off
  auto_failback off
 
  My /etc/ha.d/haresources
  node-a x.x.x.x/24 x.x.x.x/24 x.x.x.x/24 service1 service2 service3
 
  I booted the two nodes together. node-a become master and node-b become
  slave. After, I booted the node-a. Then node-b become master. When node-a
  return from boot, it become slave, because *auto_failback is off* i
 think.
  All as expected until here.
 
  As the node-a as a slave, I decide to halt the node-a (using halt
 command).
  Then heartbeat in node-b go standby and my cluster was down. The virtual
  ips was down too. I expected the node-b stay on. Why did this happen ?
 
  Some log from node2:
 
  Apr 30 00:02:57 node-b heartbeat: [3082]: info: Received shutdown notice
  from 'node-a'.
  Apr 30 00:02:57 node-b heartbeat: [3082]: info: Resources being acquired
  from node-a.
  Apr 30 00:02:57 node-b heartbeat: [4414]: debug: notify_world: setting
  SIGCHLD Handler to SIG_DFL
  Apr 30 00:02:57 node-b harc[4414]: [4428]: info: Running
  /etc/ha.d/rc.d/status status
  Apr 30 00:02:57 node-b heartbeat: [4416]: info: No local resources
  [/usr/share/heartbeat/ResourceManager listkeys node-b] to acquire.
  Apr 30 00:02:57 node-b heartbeat: [3082]: debug: StartNextRemoteRscReq():
  child count 1
 
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4657]: debug:
  /etc/init.d/asterisk  start done. RC=1
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4658]: ERROR: Return code
 1
  from /etc/init.d/asterisk
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4659]: CRIT: Giving up
  resources due to failure of asterisk

 Because of the above error when starting asterisk.  Maybe your asterisk
 init script is simply not idempotent.  Maybe it is broken, or maybe
 there really was some problem trying to start asterisk.


  Apr 30 00:02:58 node-b ResourceManager[4462]: [4660]: info: Releasing
  resource group: node-a x.x.x.x/24 x.x.x.x/24 x.x.x.x/24 asterisk
  sincronismo notificacao
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4670]: info: Running
  /etc/init.d/notificacao  stop
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4671]: debug: Starting
  /etc/init.d/notificacao  stop
 
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4694]: debug:
  /etc/init.d/notificacao  stop done. RC=0
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4704]: info: Running
  /etc/init.d/sincronismo  stop
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4705]: debug: Starting
  /etc/init.d/sincronismo  stop
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4711]: debug:
  /etc/init.d/sincronismo  stop done. RC=0
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4720]: info: Running
  /etc/init.d/asterisk  stop
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4721]: debug: Starting
  /etc/init.d/asterisk  stop
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4725]: debug:
  /etc/init.d/asterisk  stop done. RC=0
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4741]: info: Running
  /etc/ha.d/resource.d/IPaddr x.x.x.x/24 stop
  Apr 30 00:02:58 node-b ResourceManager[4462]: [4742]: debug: Starting
  /etc/ha.d/resource.d/IPaddr x.x.x.x/24 stop
 
  Apr 30 00:03:29 node-b heartbeat: [3082]: info: node-b wants to go
 standby
  [foreign]
  Apr 30 00:03:39 node-b heartbeat: [3082]: WARN: No reply to standby
  request.  Standby request cancelled.
  Apr 30 00:04:29 node-b heartbeat: [3082]: WARN: node node-a: is dead
  Apr 30 00:04:29 node-b heartbeat: [3082]: info: Dead node node-a gave up
  resources.
  Apr 30 00:04:29 node-b heartbeat: [3082]: info: Link node-a:eth6 dead.

 --
 : Lars Ellenberg
 : LINBIT | Your Way to High Availability
 : DRBD/HA support and consulting http://www.linbit.com

 DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat strange behavior

2012-05-02 Thread Lars Ellenberg

On Mon, Apr 30, 2012 at 01:52:05PM -0300, Douglas Pasqua wrote:
 Hi friends,
 
 I create a linux ha solution using 2 nodes: node-a and node-b.
 
 My /etc/ha.d/ha.cf:
 
 use_logd yes
 keepalive 1
 deadtime 90
 warntime 5
 initdead 120
 bcast eth6
 node node-a
 node node-b
 crm off
 auto_failback off
 
 My /etc/ha.d/haresources
 node-a x.x.x.x/24 x.x.x.x/24 x.x.x.x/24 service1 service2 service3
 
 I booted the two nodes together. node-a become master and node-b become
 slave. After, I booted the node-a. Then node-b become master. When node-a
 return from boot, it become slave, because *auto_failback is off* i think.
 All as expected until here.
 
 As the node-a as a slave, I decide to halt the node-a (using halt command).
 Then heartbeat in node-b go standby and my cluster was down. The virtual
 ips was down too. I expected the node-b stay on. Why did this happen ?
 
 Some log from node2:
 
 Apr 30 00:02:57 node-b heartbeat: [3082]: info: Received shutdown notice
 from 'node-a'.
 Apr 30 00:02:57 node-b heartbeat: [3082]: info: Resources being acquired
 from node-a.
 Apr 30 00:02:57 node-b heartbeat: [4414]: debug: notify_world: setting
 SIGCHLD Handler to SIG_DFL
 Apr 30 00:02:57 node-b harc[4414]: [4428]: info: Running
 /etc/ha.d/rc.d/status status
 Apr 30 00:02:57 node-b heartbeat: [4416]: info: No local resources
 [/usr/share/heartbeat/ResourceManager listkeys node-b] to acquire.
 Apr 30 00:02:57 node-b heartbeat: [3082]: debug: StartNextRemoteRscReq():
 child count 1
 
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4657]: debug:
 /etc/init.d/asterisk  start done. RC=1
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4658]: ERROR: Return code 1
 from /etc/init.d/asterisk
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4659]: CRIT: Giving up
 resources due to failure of asterisk

Because of the above error when starting asterisk.  Maybe your asterisk
init script is simply not idempotent.  Maybe it is broken, or maybe
there really was some problem trying to start asterisk.


 Apr 30 00:02:58 node-b ResourceManager[4462]: [4660]: info: Releasing
 resource group: node-a x.x.x.x/24 x.x.x.x/24 x.x.x.x/24 asterisk
 sincronismo notificacao
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4670]: info: Running
 /etc/init.d/notificacao  stop
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4671]: debug: Starting
 /etc/init.d/notificacao  stop
 
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4694]: debug:
 /etc/init.d/notificacao  stop done. RC=0
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4704]: info: Running
 /etc/init.d/sincronismo  stop
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4705]: debug: Starting
 /etc/init.d/sincronismo  stop
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4711]: debug:
 /etc/init.d/sincronismo  stop done. RC=0
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4720]: info: Running
 /etc/init.d/asterisk  stop
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4721]: debug: Starting
 /etc/init.d/asterisk  stop
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4725]: debug:
 /etc/init.d/asterisk  stop done. RC=0
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4741]: info: Running
 /etc/ha.d/resource.d/IPaddr x.x.x.x/24 stop
 Apr 30 00:02:58 node-b ResourceManager[4462]: [4742]: debug: Starting
 /etc/ha.d/resource.d/IPaddr x.x.x.x/24 stop
 
 Apr 30 00:03:29 node-b heartbeat: [3082]: info: node-b wants to go standby
 [foreign]
 Apr 30 00:03:39 node-b heartbeat: [3082]: WARN: No reply to standby
 request.  Standby request cancelled.
 Apr 30 00:04:29 node-b heartbeat: [3082]: WARN: node node-a: is dead
 Apr 30 00:04:29 node-b heartbeat: [3082]: info: Dead node node-a gave up
 resources.
 Apr 30 00:04:29 node-b heartbeat: [3082]: info: Link node-a:eth6 dead.

-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Failover Configuration Question

2012-04-23 Thread Marcus Bointon

On 23 Apr 2012, at 02:23, Net Warrior wrote:

 auto_failback on

No. As far as I'm aware this is to control what happens when your initial node 
recovers. If you have 2 nodes, a and b, and a is active, but then fails, b will 
take over, but when a is fixed and recovers, heartbeat will 'fail back' to a 
automatically if this property is on. You might want this if a is a 
faster/better server.

Marcus
-- 
Marcus Bointon
Synchromedia Limited: Creators of http://www.smartmessages.net/
UK info@hand CRM solutions
mar...@synchromedia.co.uk | http://www.synchromedia.co.uk/



___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Failover Configuration Question

2012-04-23 Thread Nikita Michalko

Hi, Net Warrior!


What version of HA/Pacemaker do you use?
Did you already RTFM - e.g. 
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained
- or:
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch


HTH


Nikita Michalko 

Am Montag, 23. April 2012 02:23:20 schrieb Net Warrior:
 Hi There
 
 I configured heartbeat to failover an IP address  , if I for example
 shutdown one node, the other takes it's ip address, so far so good, now
 my doubt is if there is a way to configure it not to make the failover
 automatically and have someone run the failover manually, can you provide
 any configuration example please? is this stanza the one that does the
 magic?
 
 auto_failback on
 
 
 Thanks for your time and support
 Best regards
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
 
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Failover Configuration Question

2012-04-23 Thread Net Warrior

Hi Nikita

This is the version
heartbeat-3.0.0-0.7

My aim is to, if node1 is powered off or losts it's ethernet
connection,. node2 wont make the failover automatically,  I want to
make it manually, but could not find how to accomplish that.


Thanks for your time and support
Best regards



2012/4/23, Nikita Michalko michalko.sys...@a-i-p.com:
 Hi, Net Warrior!


 What version of HA/Pacemaker do you use?
 Did you already RTFM - e.g.
 http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained
 - or:
 http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch


 HTH


 Nikita Michalko

 Am Montag, 23. April 2012 02:23:20 schrieb Net Warrior:
 Hi There

 I configured heartbeat to failover an IP address  , if I for example
 shutdown one node, the other takes it's ip address, so far so good, now
 my doubt is if there is a way to configure it not to make the failover
 automatically and have someone run the failover manually, can you provide
 any configuration example please? is this stanza the one that does the
 magic?

 auto_failback on


 Thanks for your time and support
 Best regards
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Failover Configuration Question

2012-04-23 Thread David Coulson

Why even use heartbeat then - Just manually ifconfig the interface.

On 4/23/12 7:39 AM, Net Warrior wrote:
 Hi Nikita

 This is the version
 heartbeat-3.0.0-0.7

 My aim is to, if node1 is powered off or losts it's ethernet
 connection,. node2 wont make the failover automatically,  I want to
 make it manually, but could not find how to accomplish that.


 Thanks for your time and support
 Best regards



 2012/4/23, Nikita Michalkomichalko.sys...@a-i-p.com:
 Hi, Net Warrior!


 What version of HA/Pacemaker do you use?
 Did you already RTFM - e.g.
 http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained
 - or:
 http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch


 HTH


 Nikita Michalko

 Am Montag, 23. April 2012 02:23:20 schrieb Net Warrior:
 Hi There

 I configured heartbeat to failover an IP address  , if I for example
 shutdown one node, the other takes it's ip address, so far so good, now
 my doubt is if there is a way to configure it not to make the failover
 automatically and have someone run the failover manually, can you provide
 any configuration example please? is this stanza the one that does the
 magic?

 auto_failback on


 Thanks for your time and support
 Best regards
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Failover Configuration Question

2012-04-23 Thread Net Warrior

True, but even on the most expensive software likve Veritas Cluster or
Red Hat Cluster I can configure how I want to failover the resources (
auto or manual ), that's why my curiosity to acomplish the same in
here.

Thanks for your time
Best Regards

2012/4/23, David Coulson da...@davidcoulson.net:
 Why even use heartbeat then - Just manually ifconfig the interface.

 On 4/23/12 7:39 AM, Net Warrior wrote:
 Hi Nikita

 This is the version
 heartbeat-3.0.0-0.7

 My aim is to, if node1 is powered off or losts it's ethernet
 connection,. node2 wont make the failover automatically,  I want to
 make it manually, but could not find how to accomplish that.


 Thanks for your time and support
 Best regards



 2012/4/23, Nikita Michalkomichalko.sys...@a-i-p.com:
 Hi, Net Warrior!


 What version of HA/Pacemaker do you use?
 Did you already RTFM - e.g.
 http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained
 - or:
 http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch


 HTH


 Nikita Michalko

 Am Montag, 23. April 2012 02:23:20 schrieb Net Warrior:
 Hi There

 I configured heartbeat to failover an IP address  , if I for example
 shutdown one node, the other takes it's ip address, so far so good, now
 my doubt is if there is a way to configure it not to make the failover
 automatically and have someone run the failover manually, can you
 provide
 any configuration example please? is this stanza the one that does the
 magic?

 auto_failback on


 Thanks for your time and support
 Best regards
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Failover Configuration Question

2012-04-23 Thread Andreas Kurz

On 04/23/2012 01:47 PM, Net Warrior wrote:
 True, but even on the most expensive software likve Veritas Cluster or
 Red Hat Cluster I can configure how I want to failover the resources (
 auto or manual ), that's why my curiosity to acomplish the same in
 here.

with the help of the meat-ware stonith plugin a manual acknowledge of
the failover process is required.

Regards,
Andreas

-- 
Need help with Pacemaker?
http://www.hastexo.com/now

 
 Thanks for your time
 Best Regards
 
 2012/4/23, David Coulson da...@davidcoulson.net:
 Why even use heartbeat then - Just manually ifconfig the interface.

 On 4/23/12 7:39 AM, Net Warrior wrote:
 Hi Nikita

 This is the version
 heartbeat-3.0.0-0.7

 My aim is to, if node1 is powered off or losts it's ethernet
 connection,. node2 wont make the failover automatically,  I want to
 make it manually, but could not find how to accomplish that.


 Thanks for your time and support
 Best regards



 2012/4/23, Nikita Michalkomichalko.sys...@a-i-p.com:
 Hi, Net Warrior!


 What version of HA/Pacemaker do you use?
 Did you already RTFM - e.g.
 http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained
 - or:
 http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch


 HTH


 Nikita Michalko

 Am Montag, 23. April 2012 02:23:20 schrieb Net Warrior:
 Hi There

 I configured heartbeat to failover an IP address  , if I for example
 shutdown one node, the other takes it's ip address, so far so good, now
 my doubt is if there is a way to configure it not to make the failover
 automatically and have someone run the failover manually, can you
 provide
 any configuration example please? is this stanza the one that does the
 magic?

 auto_failback on


 Thanks for your time and support
 Best regards
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems




signature.asc
Description: OpenPGP digital signature
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat doesnt create the socket /var/run/heartbeat/register

2012-01-22 Thread Efrat Lefeber

The heartbeat I install is from debian packages.
dpkg -l | grep  heartbeat
ii  heartbeat  1:3.0.3-2~bpo50+1  Subsystem 
for High-Availability Linux
ii  libheartbeat2   1:3.0.3-2~bpo50+1  Subsystem 
for High-Availability Linux (libraries)

version 3.0.2

I install the same packages and builds on all devices. I have an automatic 
installation. Some devices are installed ok and some suffers from the problem 
that the socket isn't created.
Is there a way I can create the socket from outside heartbeat (from perl or 
bash)? I have a watchdog and I wish to create the socket automatically in case 
the socket doesn't exist.

-Original Message-
From: linux-ha-boun...@lists.linux-ha.org 
[mailto:linux-ha-boun...@lists.linux-ha.org] On Behalf Of Lars Ellenberg
Sent: Friday, January 20, 2012 8:48 PM
To: linux-ha@lists.linux-ha.org
Subject: Re: [Linux-HA] heartbeat doesnt create the socket 
/var/run/heartbeat/register

On Thu, Jan 19, 2012 at 02:18:53PM +, Efrat Lefeber wrote:
 Hi,
 
 I am using linux-ha heartbeat on a two simple nodes cluster.
 For some reason which I can't figure out, the socket 
 /var/run/heartbeat/register is not created though the directory 
 /var/run/heartbeat/ exist:
 
 ll /var/run/heartbeat/
 total 24
 drwxr-x---  6 hacluster haclient 4096 2012-01-19 14:30 .
 drwxr-xr-x 16 root  root 4096 2012-01-19 14:30 ..
 drwxr-x---  2 hacluster haclient 4096 2012-01-19 14:30 ccm
 drwxr-x---  2 hacluster haclient 4096 2012-01-19 14:30 crm
 drwxr-x---  2 hacluster haclient 4096 2012-01-19 14:30 dopd
 drwxr-xr-t  2 root  root 4096 2012-01-19 14:30 rsctmp
 
 
 /etc/init.d/heartbeat status
 heartbeat OK [pid 14685 et al] is running on vs-158 [vs-158]...
 
 cl_status hbstatus
 Heartbeat is stopped on this machine.
 
 I ran cl_status with strace and I saw this error:
 connect(3, {sa_family=AF_FILE, path=/var/run/heartbeat/register...}, 
 110) = -1 ENOENT (No such file or directory)
 
 
 Who created this socket?

That's one of the first things the heartbeat binary does when it starts, If it 
can not create that socket, heartbeat will not even start up.

Of course, in theory someone may remove that socket after it was created. If 
so, make sure that does not happen again ;)

 How can I find out why isn't the socket created?

Where did you get your packages/binaries?
Double check your build?
lsof -n -p your heartbeat master control process?

 Is there a workaround I can do to create the socket?

Fix your installation.

 This problem doesn't happen all the time. I have another node with the 
 same configuration and the socket was created there.

Same packages and build?

--
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com 
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
#
Scanned by MailMarshal - M86 Security's comprehensive email content security 
solution. 
Download a free evaluation of MailMarshal at www.m86security.com
#
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat doesnt create the socket /var/run/heartbeat/register

2012-01-20 Thread Lars Ellenberg

On Thu, Jan 19, 2012 at 02:18:53PM +, Efrat Lefeber wrote:
 Hi,
 
 I am using linux-ha heartbeat on a two simple nodes cluster.
 For some reason which I can't figure out, the socket 
 /var/run/heartbeat/register is not created though the directory 
 /var/run/heartbeat/ exist:
 
 ll /var/run/heartbeat/
 total 24
 drwxr-x---  6 hacluster haclient 4096 2012-01-19 14:30 .
 drwxr-xr-x 16 root  root 4096 2012-01-19 14:30 ..
 drwxr-x---  2 hacluster haclient 4096 2012-01-19 14:30 ccm
 drwxr-x---  2 hacluster haclient 4096 2012-01-19 14:30 crm
 drwxr-x---  2 hacluster haclient 4096 2012-01-19 14:30 dopd
 drwxr-xr-t  2 root  root 4096 2012-01-19 14:30 rsctmp
 
 
 /etc/init.d/heartbeat status
 heartbeat OK [pid 14685 et al] is running on vs-158 [vs-158]...
 
 cl_status hbstatus
 Heartbeat is stopped on this machine.
 
 I ran cl_status with strace and I saw this error:
 connect(3, {sa_family=AF_FILE, path=/var/run/heartbeat/register...}, 110) = 
 -1 ENOENT (No such file or directory)
 
 
 Who created this socket?

That's one of the first things the heartbeat binary does when it starts,
If it can not create that socket, heartbeat will not even start up.

Of course, in theory someone may remove that socket after it was
created. If so, make sure that does not happen again ;)

 How can I find out why isn't the socket created?

Where did you get your packages/binaries?
Double check your build?
lsof -n -p your heartbeat master control process?

 Is there a workaround I can do to create the socket?

Fix your installation.

 This problem doesn't happen all the time. I have another node with the
 same configuration and the socket was created there.

Same packages and build?

-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] [Heartbeat][Pacemaker] VIP doesn't swith to other server

2011-11-18 Thread Andreas Kurz

Hello Mathieu,

On 11/17/2011 07:22 PM, SEILLIER Mathieu wrote:
 Hi all,
 
 I have to use Heartbeat with Pacemaker for High Availability between 2 Tomcat 
 5.5 servers under Linux RedHat 5.4.
 The first server is active, the other one is passive. The master is called 
 servappli01, with IP address 186.20.100.81, the slave is called servappli02, 
 with IP address 186.20.100.82.
 I configured a virtual IP 186.20.100.83. Each Tomcat is not launched when 
 server is started, this is Heartbeat which starts Tomcat when it's running.
 All seem to be OK, each server see the other as active, and the crm_mon 
 command shows this below :
 
 
 Last updated: Thu Nov 17 19:03:34 2011
 Stack: Heartbeat
 Current DC: servappli01 (bf8e9a46-8691-4838-82d9-942a13aeedca) - partition 
 with quorum
 Version: 1.0.11-1554a83db0d3c3e546cfd3aaff6af1184f79ee87
 2 Nodes configured, 2 expected votes
 2 Resources configured.
 
 
 Online: [ servappli01 servappli02 ]
 
  Clone Set: ClusterIPClone (unique)
  ClusterIP:0(ocf::heartbeat:IPaddr2):   Started servappli01
  ClusterIP:1(ocf::heartbeat:IPaddr2):   Started servappli02

Your did not only configured a simple VIP but a cluster IP which acts
like a simple static loadbalancer ... man iptables ... search for CLUSTERIP.

If this was not your intention, simply don't clone it.

If you want a clusterip you have to choose correct meta attributes:

clone ClusterIPClone ClusterIP \
meta globally-unique=true clone-node-max=2 interleave=true

  Clone Set: TomcatClone (unique)
  Tomcat:0   (ocf::heartbeat:tomcat):Started servappli01
  Tomcat:1   (ocf::heartbeat:tomcat):Started servappli02
 
 
 The 2 Tomcat servers as identical, and the same webapps are deployed on each 
 server in order to be able to access webapps on the other server if one is 
 down.
 By default, requests from clients are processed by the first server because 
 it's the master.
 My problem is that when I crash the Tomcat on the first server, requests from 
 clients are not redirected to the second server. For a while, requests are 
 not processed, then Heartbeat restarts Tomcat itself and requests are 
 processed again by the first server.
 Requests are never forwarded to the second Tomcat if the first is down.

Default behavior on monitoring errors is a local restart. If you always
test from the same IP I would expect your requests to fail while Tomcat
is not running on the one node you are redirected ... so if you choose
the clusterip_hash sourceip-sourceport your chance should be 50/50 to
get redirected ... if you want a real loadbalancer you might want to
integrate a service likde ldirectord with realserver checks to remove a
non-working service from the loadbalancing.

... use ip addr show or define a label to see your VIP ...

Regards,
Andreas

-- 
Need help with Pacemaker?
http://www.hastexo.com/now

 
 Here is my configuration :
 
 ha.cf file (the same on each server) :
 
 crm respawn
 logfacility local0
 logfile /var/log/ha-log
 debugfile   /var/log/ha-debug
 warntime10
 deadtime20
 initdead120
 keepalive   2
 autojoinnone
 nodeservappli01
 nodeservappli02
 ucast   eth0 186.20.100.81 # ignored by node1 (owner of ip)
 ucast   eth0 186.20.100.82 # ignored by node2 (owner of ip)
 
 cib.xml file (the same on each server) :
 
 ?xml version=1.0 ?
 cib admin_epoch=0 crm_feature_set=3.0.1 
 dc-uuid=bf8e9a46-8691-4838-82d9-942a13aeedca epoch=127 have-quorum=1 
 num_updates=51 validate-with=pacemaker-1.0
   configuration
 crm_config
   cluster_property_set id=cib-bootstrap-options
 nvpair id=cib-bootstrap-options-dc-version name=dc-version 
 value=1.0.11-1554a83db0d3c3e546cfd3aaff6af1184f79ee87/
 nvpair id=cib-bootstrap-options-cluster-infrastructure 
 name=cluster-infrastructure value=Heartbeat/
 nvpair id=cib-bootstrap-options-expected-quorum-votes 
 name=expected-quorum-votes value=2/
 nvpair id=cib-bootstrap-options-no-quorum-policy 
 name=no-quorum-policy value=ignore/
 nvpair id=cib-bootstrap-options-stonith-enabled 
 name=stonith-enabled value=false/
   /cluster_property_set
 /crm_config
 nodes
   node id=489a0305-862a-4280-bce5-6defa329df3f type=normal 
 uname=servappli01/
   node id=bf8e9a46-8691-4838-82d9-942a13aeedca type=normal 
 uname=servappli02/
 /nodes
 resources
   clone id=TomcatClone
 meta_attributes id=TomcatClone-meta_attributes
   nvpair id=TomcatClone-meta_attributes-globally-unique 
 name=globally-unique value=true/
 /meta_attributes
 primitive class=ocf id=Tomcat provider=heartbeat type=tomcat
   instance_attributes id=Tomcat-instance_attributes
 nvpair id=Tomcat-instance_attributes-tomcat_name 
 name=tomcat_name value=TomcatSBNG/
 nvpair

Re: [Linux-HA] heartbeat and squid

2011-09-14 Thread Dejan Muhamedagic

Hi,

On Thu, Sep 01, 2011 at 06:30:46PM +0200, Nicolas Repentin wrote:
 Hi all,
 
 I've got a question for heartbeat. 
 How can I made this :
 
 If squid stop or be killed on node1, how make node2 be master ?
 
 Actually, node2 become master only when node1 is down, or heartbeat
 service on node1 is down, but if I kill squid, nothing happen.
 
 I'm using Centos 6 and last heartbeat version.

Using just heartbeat and no pacemaker? Only pacemaker has service
monitoring.

Thanks,

Dejan

 Thanks a lot for your responses !
 
 
 -- 
 Nicolas
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Restart is not same as Stop and Start

2011-08-04 Thread Rahul Kanna

Mike,

I checked the permission and those are fine.

If you can please check the restart script I have given below, it does not
touch the heartbeat lock file

*touch $LOCKDIR/$SUBSYS*

when the heartbeat is restared and I guess it is a problem. Is it not?

Btw, we have a product for some web application and as part of it we allow
Administrators to configure servers as redundant server and under lying we
use linux-ha to set up redundant servers.

Rahul
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat Restart is not same as Stop and Start

2011-08-03 Thread mike

Permission problem perhaps? Not really sure what you're doing but the 
fact that you have users configuring the cluster (why do you do this 
btw?) may be pointing to a permission issue.

-mgb
On 11-08-03 06:57 PM, Rahul Kanna wrote:
 Hi,

 Our system setup:

 Heartbeat 3.0.3
 DRBD (to manage file system and it is one of the resource managed by CRM)
 Redhat Linux
 Pacemaker

 We have built an application on top of Linux-HA for users to configure
 cluster by giving IP addresses of the nodes, do operations like Restart
 system, Change host names, Resolve split-brain scenario etc.
 In our application, we ran into problem when we do heartbeat restart for
 some operation and then when user does Restart System which internally
 runs the command shutdown -r now. I believe this due to heartbeat lsb
 script and I have explained the scenario below.

 Problem:

 In the heartbeat lsb script, restart does not remove and touches the
 heartbeat lock file.

 On, heartbeat start, the lsb script starts heartbeat and touches
 /var/lock/subsys/heartbeat lock file.

 On, heartbeat stop, the lsb script stops heartbeat and removes the lock
 file at /var/lock/subsys/heartbeat.

 On, heartbeat restart, the lsb script stops heartbeat and starts
 heartbeat. But DOES NOT remove or touches the lock file.

 We call heartbeat restart instead of heartbeat start through our script
 because we are not sure whether heartbeat is already running or not. So when
 heartbeat restart is called when heartbeat is NOT running, heartbeat lsb
 script tries to stop but its not running so it just starts heartbeat BUT
 after starting, heartbeat lock file is not touched (because of restart in
 heartbeat lsb). So now, in the system heartbeat is running (can verify this
 by looking for heartbeat process or heartbeat status command) but there is
 no /var/lock/subsys/heartbeat lock file. This lock file is used by the Linux
 kernal to know what all process it has to stop when it shuts down (shutdown
 -r now). When we run shutdown -r now, Linux kernal thinks heartbeat is not
 running (because there is no lock file) and does not stop heartbeat
 properly. When it comes back up, heartbeat is started but heartbeat state is
 not correct (because it was not stopped properly).
 Due to this, this node is identifies as Primary though the erstwhile
 Secondary node has become Primary now and this causes split-brain.

 So I believe, heartbeat restart should do exactly as heartbeat stop and
 heartbeat start which is not the case now.
 Can you please let me know if my understanding is correct and it is a bug in
 Heartbeat lsb script? Thanks for looking into it.

 I have given below the relevant code from heartbeat lsb script as well

 File: /etc/init.d/heartbeat

start)
  RunStartStop pre-start
  StartHA
  RC=$?
  echo
  if
[ $RC -eq 0 ]
  then
[ ! -d $LOCKDIR ]  mkdir -p $LOCKDIR
touch $LOCKDIR/$SUBSYS
  fi
  RunStartStop post-start $RC
  ;;

stop)
  RunStartStop pre-stop
  StopHA
  RC=$?
  echo
  if
[ $RC -eq 0 ]
  then
rm -f $LOCKDIR/$SUBSYS
  fi
  RunStartStop post-stop $RC
  ;;

restart)
  sleeptime=`ha_parameter deadtime`
  StopHA
  echo
  echo -n Waiting to allow resource takeover to complete:
  sleep $sleeptime
  sleep 10 # allow resource takeover to complete (hopefully).
  echo_success
  echo
  StartHA
  echo
  ;;
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems


___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Heartbeat 3.0.3 stable version + RHEL 6.1: restart network will make heartbeat not send broadcasts

2011-07-18 Thread Ai Lei

Hi:

I'm using Heartbeat 3.0.3 stable version on RHEL 6.1 x64 platform, and found
following issue:
If I restart network service, heartbeat will not send broadcast packages
from port 694. That makes this node never have a chance to join HA cluster
again except restart it.

Details for setting cluster:

1. Compile heartbeat 3.0.3 from source and install it on 2 RHEL 6.1 x64
nodes: installer001 and rhel61
2. Compile pacemaker 1.0.9 from source and install it on 2 RHEL 6.1 x64
nodes
3. Configure /etc/ha.d/ha.cf, make sure  both of these 2 nodes are Online
through crm status
4. run tcpdump -i eth0 port 694, we can found both of these 2 nodes are
sending heartbeat broadcast packages.

Details of configuration file:
=
[root@rhel61 ~]# cat /etc/ha.d/ha.cf
autojoin none
bcast eth0
warntime 5
deadtime 15
initdead 60
keepalive 2
node installer001
node rhel61
crm respawn


Then I tried to restart network service on the backup node installer001,
or just run ifdown eth0; ifup eth0. And on node rhel61 it will detected
installer001 as offline immediately. On node installer001, it will
detect rhel61 as offline.
Then I run tcpdump -i eth0 port 694 on installer001 again, we can only
detect rhel61 still sending broadcast packages but no broadcast packages
coming from installer001, although eth0 network is fully recovered now.

I've tried the exactly same case on RHEL 5.6 (heartbeat 3.0.3), it works
well. After restart network, the node can still send out broadcast
packages...

Thanks for you comments.
--Lei
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat three node configuration

2011-06-26 Thread Andrew Beekhof

On Thu, Jun 9, 2011 at 11:54 PM, Ricardo F ri...@hotmail.com wrote:

 What is the configuration for create a three node cluster?,

Essentially you need Pacemaker on top.
haresources based clusters were only designed for 2-nodes.

 i have this but the servers bring-up the shared ip at same time:
 ha.cflogfacility local0keepalive 2deadtime 10warntime 5initdead 
 30auto_failback offucast bond0 host1 host2 host3node host1node host2node host3
 haresourceshost1 192.168.1.10/24/bond0
 i use  heartbeat 3.0.3 in a debian squeeze in all of the nodes, all of them 
 have in the /etc/hosts the others ips and i can propagate the conf with 
 ha_propagate.

 Thanks
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat step down after split brain scenario

2011-06-20 Thread Jack Berg

Hi - thanks for the response.

Dimitri Maziuk wrote:

What do you mean by disconnecting: what's your failure scenario and
how do you expect it to be handled?

The disconnection is the loss of the intersite link which interrupts
heartbeat comms.

In this case it's expected that both sites will acquire the resources and
become active.

However, what I want to happen is that one of the sites will give up the
resources again when it sees that the other site is up again.

Dimitri Maziuk wrote:

Running daemons are not guaranteed (arguably, expected) to notice when
the network cable is unplugged. You have to monitor the link and restart
all processes that bind()/listen() on the interface.

If your nodes are at different sites, you need to also deal with the
loss of link at the switch, gateway, etc., and figure out which one is
still connected to the Internet -- and gets to keep the VIP. Which in
general can't be done from the nodes themselves.

Yes - in this case neither site has to be connected to the internet, this is
more an internal load balancing act between two connected sites in a
customers network.

What I found is that by setting auto_failback on in ha.cf at both sites
the site/node listed in haresources will keep the resources when the link is
re-established and the other site will release the resources.

This is the result I was looking for.

Regards
Jack
--
View this message in context:
http://old.nabble.com/heartbeat-step-down-after-split-brain-scenario-tp31858728p31884521.html
Sent from the Linux-HA mailing list archive at Nabble.com.

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat step down after split brain scenario

2011-06-16 Thread Dimitri Maziuk

On 06/16/2011 04:28 AM, Jack Berg wrote:
 
 I have a two node cluster using heartbeat and haproxy. Unfortunately it is
 impossible to provide redundant heartbeat paths between the two nodes at
 different sites so it is possible for a failure to cause split brain.
 
 To evaluate the impact I tried disconnecting the two nodes and I found that
 both become active and both try to keep the VIPs after the link is restored.

What do you mean by disconnecting: what's your failure scenario and
how do you expect it to be handled?

Running daemons are not guaranteed (arguably, expected) to notice when
the network cable is unplugged. You have to monitor the link and restart
all processes that bind()/listen() on the interface.

If your nodes are at different sites, you need to also deal with the
loss of link at the switch, gateway, etc., and figure out which one is
still connected to the Internet -- and gets to keep the VIP. Which in
general can't be done from the nodes themselves.

Dima
-- 
Dimitri Maziuk
Programmer/sysadmin
BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu



signature.asc
Description: OpenPGP digital signature
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat sends udp to whole network

2011-05-24 Thread Dejan Muhamedagic

Hi,

On Mon, May 23, 2011 at 03:18:37PM -0700, Nulgor Wankevitch wrote:
 hi,
 
 heartbeat seems to be send udp on port 694 to the whole network segment, 

Do you use ucast or bcast? With the latter, which is broadcast
it's of course expected. If it happens with the former, then you
must have gremlins in your network.

Thanks,

Dejan

 not just the link host, and
 getting blocked by firewall, how to limit?
 
 Firewall: *UDP_IN Blocked* IN=eth0 OUT= 
 MAC=ff:ff:ff:ff:ff:ff:00:22:19:21:f1:75:08:00 SRC=192.168.1.190 
 DST=192.168.1.255 LEN=246 TOS=0x00 PREC=0x00 TTL=64 ID=0 DF PROTO=UDP 
 SPT=42414 DPT=694 LEN=226
 
 any help thnk you,
 nulgor
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat sends udp to whole network

Hi,

thnk for reply, when use ucast things do not seem to work, the nodes are 
able
to bring up the VIP but not any services. When using bcast things seem 
to work correctly
but there is that broadcast problem, I would like to firewall the 
broadcast and isolate
it to the local machine and 2nd node however I do not want to cause 
additional problems,
please advise, thks.

nulgor

On 5/24/2011 1:52 AM, Dejan Muhamedagic wrote:
 Hi,

 On Mon, May 23, 2011 at 03:18:37PM -0700, Nulgor Wankevitch wrote:
 hi,

 heartbeat seems to be send udp on port 694 to the whole network segment,
 Do you use ucast or bcast? With the latter, which is broadcast
 it's of course expected. If it happens with the former, then you
 must have gremlins in your network.

 Thanks,

 Dejan

 not just the link host, and
 getting blocked by firewall, how to limit?

 Firewall: *UDP_IN Blocked* IN=eth0 OUT=
 MAC=ff:ff:ff:ff:ff:ff:00:22:19:21:f1:75:08:00 SRC=192.168.1.190
 DST=192.168.1.255 LEN=246 TOS=0x00 PREC=0x00 TTL=64 ID=0 DF PROTO=UDP
 SPT=42414 DPT=694 LEN=226

 any help thnk you,
 nulgor
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems



___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat sends udp to whole network

2011-05-24 Thread Dejan Muhamedagic

Hi,

On Tue, May 24, 2011 at 02:12:12AM -0700, Nulgor Wankevitch wrote:
 Hi,
 
 thnk for reply, when use ucast things do not seem to work, the nodes are 
 able
 to bring up the VIP but not any services. When using bcast things seem 
 to work correctly

Wow! You really do have gremlins somewhere. ucast cannot not work
in the way you described. Either the nodes can communicate or
they can't. Did you set the right IP address of the peer? Or
there must be some kind of network setup issue.

Thanks,

Dejan

 but there is that broadcast problem, I would like to firewall the 
 broadcast and isolate
 it to the local machine and 2nd node however I do not want to cause 
 additional problems,
 please advise, thks.
 
 nulgor
 
 On 5/24/2011 1:52 AM, Dejan Muhamedagic wrote:
  Hi,
 
  On Mon, May 23, 2011 at 03:18:37PM -0700, Nulgor Wankevitch wrote:
  hi,
 
  heartbeat seems to be send udp on port 694 to the whole network segment,
  Do you use ucast or bcast? With the latter, which is broadcast
  it's of course expected. If it happens with the former, then you
  must have gremlins in your network.
 
  Thanks,
 
  Dejan
 
  not just the link host, and
  getting blocked by firewall, how to limit?
 
  Firewall: *UDP_IN Blocked* IN=eth0 OUT=
  MAC=ff:ff:ff:ff:ff:ff:00:22:19:21:f1:75:08:00 SRC=192.168.1.190
  DST=192.168.1.255 LEN=246 TOS=0x00 PREC=0x00 TTL=64 ID=0 DF PROTO=UDP
  SPT=42414 DPT=694 LEN=226
 
  any help thnk you,
  nulgor
  ___
  Linux-HA mailing list
  Linux-HA@lists.linux-ha.org
  http://lists.linux-ha.org/mailman/listinfo/linux-ha
  See also: http://linux-ha.org/ReportingProblems
  ___
  Linux-HA mailing list
  Linux-HA@lists.linux-ha.org
  http://lists.linux-ha.org/mailman/listinfo/linux-ha
  See also: http://linux-ha.org/ReportingProblems
 
 
 
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat sends udp to whole network

ya, gremlins, very reassuring, thanks.

On 5/24/2011 2:42 AM, Dejan Muhamedagic wrote:
 Hi,

 On Tue, May 24, 2011 at 02:12:12AM -0700, Nulgor Wankevitch wrote:
 Hi,

 thnk for reply, when use ucast things do not seem to work, the nodes are
 able
 to bring up the VIP but not any services. When using bcast things seem
 to work correctly
 Wow! You really do have gremlins somewhere. ucast cannot not work
 in the way you described. Either the nodes can communicate or
 they can't. Did you set the right IP address of the peer? Or
 there must be some kind of network setup issue.

 Thanks,

 Dejan

 but there is that broadcast problem, I would like to firewall the
 broadcast and isolate
 it to the local machine and 2nd node however I do not want to cause
 additional problems,
 please advise, thks.

 nulgor

 On 5/24/2011 1:52 AM, Dejan Muhamedagic wrote:
 Hi,

 On Mon, May 23, 2011 at 03:18:37PM -0700, Nulgor Wankevitch wrote:
 hi,

 heartbeat seems to be send udp on port 694 to the whole network segment,
 Do you use ucast or bcast? With the latter, which is broadcast
 it's of course expected. If it happens with the former, then you
 must have gremlins in your network.

 Thanks,

 Dejan

 not just the link host, and
 getting blocked by firewall, how to limit?

 Firewall: *UDP_IN Blocked* IN=eth0 OUT=
 MAC=ff:ff:ff:ff:ff:ff:00:22:19:21:f1:75:08:00 SRC=192.168.1.190
 DST=192.168.1.255 LEN=246 TOS=0x00 PREC=0x00 TTL=64 ID=0 DF PROTO=UDP
 SPT=42414 DPT=694 LEN=226

 any help thnk you,
 nulgor
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems


 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems
 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems



___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat sends udp to whole network

2011-05-24 Thread Dimitri Maziuk

On 05/24/2011 05:48 AM, Nulgor Wankevitch wrote:
 ya, gremlins, very reassuring, thanks.

If the broadcast packets from host A are seen by host B, and unicast
packets from host A to host B are not seen by host B, then your universe
is governed by laws of physics we here are completely unfamiliar with.
Sometimes we call them gremlins.

HTH
Dima
-- 
Dimitri Maziuk
Programmer/sysadmin
BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu



signature.asc
Description: OpenPGP digital signature
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat sends udp to whole network

I think you guys might have jumped the gun on me, why would you
assume it is not seen? I reported it will bring up the VIP but not
the services.

nulgor

On 5/24/2011 9:37 AM, Dimitri Maziuk wrote:
 On 05/24/2011 05:48 AM, Nulgor Wankevitch wrote:
 ya, gremlins, very reassuring, thanks.
 If the broadcast packets from host A are seen by host B, and unicast
 packets from host A to host B are not seen by host B, then your universe
 is governed by laws of physics we here are completely unfamiliar with.
 Sometimes we call them gremlins.

 HTH
 Dima


 ___
 Linux-HA mailing list
 Linux-HA@lists.linux-ha.org
 http://lists.linux-ha.org/mailman/listinfo/linux-ha
 See also: http://linux-ha.org/ReportingProblems

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat sends udp to whole network

2011-05-24 Thread Dimitri Maziuk

On 05/24/2011 02:56 PM, Nulgor Wankevitch wrote:
 I think you guys might have jumped the gun on me, why would you
 assume it is not seen? I reported it will bring up the VIP but not
 the services.

The only way I can vaguely imagine that possibly happening is if cib
isn't propagated to the other node(s) due to, indeed, a problem with
comms channel. However, I can think of only one way to make that happen
over unicast but not broadcast: unicasting to a wrong host.

Dima
-- 
Dimitri Maziuk
Programmer/sysadmin
BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu



signature.asc
Description: OpenPGP digital signature
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] heartbeat sends udp to whole network