Unless openais has some way to run without multicast, that's my only alternative.
Well the other-other alternative is to run the app without clustering and devise some sort of duct tape and hot glue HA system :-) -- Robert Borkowski On Mon, Dec 14, 2009 at 2:53 AM, Fabio M. Di Nitto <[email protected]>wrote: > Binding over tun devices might be useful, but be aware of several > different gotchas: > > - MTU is not ethernet size (and it愀 not constant. vtun uses 50 bytes > for its own header - irrelevant to corosync - others might use different > size. this could affect certain opeartions) > - tun implementation. vtun, for example, adds latency that could be > relevant for cluster operations (the amount depends on the plugins > loaded - crypto, compression and so on). > - queues handling. vtun for example, in certain conditions, will block > the application when writing to the network socket. I don愒 believe this > is desirable vs dropping packets (expected behaviour?). > > so is it really worth the troubles to be able to bind to tunnels? > > Just 2c... > > Fabio > > Steven Dake wrote: > > The binding code may not support binding to tuns without modification. > > > > I'll have a look this week. > > > > Regards > > -steve > > > > On Sun, 2009-12-13 at 12:08 -0500, Robert Borkowski wrote: > >> Hello, > >> > >> > >> Is there any way to get openais/corosync working on Amazon EC2? > >> Multicast is not permitted there... > >> What I'd like to set up is a two node cluster. > >> > >> > >> My current attempt to get this working is to set up vtun tunnels > >> between the two nodes. vtun is supposed to be able to tunnel > >> multicast. > >> The two nodes have 192.168.1.1 and 192.168.1.2 on their tun0 > >> interfaces respectively, and I'm able to pass traffic through the > >> tunnel. > >> > >> > >> This is failing right now because totem won't bind to the tun0 > >> address. > >> On the first node I tried setting bindnetaddr to 192.168.1.0 and > >> 192.168.1.1. In both cases debugging indicates 'network interface is > >> down' and totem binding to 127.0.0.1. > >> Strangely enough when I configure it to bind on 192.168.1.2 it does > >> bind, but obviously that's wrong and doesn't work. > >> > >> > >> The OS is Ubuntu hardy heron. I tried the openais out of the heron > >> repo (0.82-3ubuntu2), and built corosync from the karmic source repo > >> (1.0.0-5ubuntu1). > >> Both behave the same way. > >> > >> > >> Any pointers? > >> > >> > >> > >> > >> # ifconfig tun0 > >> tun0 Link encap:UNSPEC HWaddr > >> 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00 > >> inet addr:192.168.1.1 P-t-P:192.168.1.2 > >> Mask:255.255.255.255 > >> UP POINTOPOINT RUNNING NOARP MULTICAST MTU:1450 Metric:1 > >> RX packets:11 errors:0 dropped:0 overruns:0 frame:0 > >> TX packets:11 errors:0 dropped:0 overruns:0 carrier:0 > >> collisions:0 txqueuelen:500 > >> RX bytes:924 (924.0 B) TX bytes:924 (924.0 B) > >> > >> > >> # egrep -v '#|^$' /etc/corosync/corosync.conf > >> totem { > >> version: 2 > >> token: 3000 > >> token_retransmits_before_loss_const: 10 > >> join: 60 > >> consensus: 1500 > >> vsftype: none > >> max_messages: 20 > >> clear_node_high_bit: yes > >> secauth: off > >> threads: 0 > >> rrp_mode: none > >> interface { > >> ringnumber: 0 > >> bindnetaddr: 192.168.1.0 > >> mcastaddr: 226.94.1.1 > >> mcastport: 5405 > >> } > >> } > >> amf { > >> mode: disabled > >> } > >> service { > >> ver: 0 > >> name: pacemaker > >> } > >> aisexec { > >> user: root > >> group: root > >> } > >> logging { > >> fileline: off > >> to_stderr: yes > >> to_logfile: no > >> to_syslog: yes > >> syslog_facility: daemon > >> debug: on > >> timestamp: on > >> logger_subsys { > >> subsys: AMF > >> debug: on > >> tags: enter|leave|trace1|trace2|trace3|trace4|trace6 > >> } > >> } > >> > >> > >> > >> > >> > >> > >> # corosync -f > >> Dec 13 12:00:06 corosync [MAIN ] Corosync Cluster Engine ('trunk'): > >> started and ready to provide service. > >> Dec 13 12:00:06 corosync [MAIN ] Successfully read main configuration > >> file '/etc/corosync/corosync.conf'. > >> Dec 13 12:00:06 corosync [TOTEM ] Token Timeout (3000 ms) retransmit > >> timeout (294 ms) > >> Dec 13 12:00:06 corosync [TOTEM ] token hold (225 ms) retransmits > >> before loss (10 retrans) > >> Dec 13 12:00:06 corosync [TOTEM ] join (60 ms) send_join (0 ms) > >> consensus (1500 ms) merge (200 ms) > >> Dec 13 12:00:06 corosync [TOTEM ] downcheck (1000 ms) fail to recv > >> const (50 msgs) > >> Dec 13 12:00:06 corosync [TOTEM ] seqno unchanged const (30 rotations) > >> Maximum network MTU 1500 > >> Dec 13 12:00:06 corosync [TOTEM ] window size per rotation (50 > >> messages) maximum messages per rotation (20 messages) > >> Dec 13 12:00:06 corosync [TOTEM ] send threads (0 threads) > >> Dec 13 12:00:06 corosync [TOTEM ] RRP token expired timeout (294 ms) > >> Dec 13 12:00:06 corosync [TOTEM ] RRP token problem counter (2000 ms) > >> Dec 13 12:00:06 corosync [TOTEM ] RRP threshold (10 problem count) > >> Dec 13 12:00:06 corosync [TOTEM ] RRP mode set to none. > >> Dec 13 12:00:06 corosync [TOTEM ] heartbeat_failures_allowed (0) > >> Dec 13 12:00:06 corosync [TOTEM ] max_network_delay (50 ms) > >> Dec 13 12:00:06 corosync [TOTEM ] HeartBeat is Disabled. To enable set > >> heartbeat_failures_allowed > 0 > >> Dec 13 12:00:06 corosync [TOTEM ] Initializing transmit/receive > >> security: libtomcrypt SOBER128/SHA1HMAC (mode 0). > >> Dec 13 12:00:06 corosync [TOTEM ] Receive multicast socket recv buffer > >> size (288000 bytes). > >> Dec 13 12:00:06 corosync [TOTEM ] Transmit multicast socket send > >> buffer size (262142 bytes). > >> Dec 13 12:00:06 corosync [TOTEM ] The network interface is down. > >> Dec 13 12:00:06 corosync [TOTEM ] Created or loaded sequence id > >> 20.127.0.0.1 for this ring. > >> Dec 13 12:00:06 corosync [TOTEM ] entering GATHER state from 15. > >> Dec 13 12:00:06 corosync [SERV ] Service failed to load 'pacemaker'. > >> Dec 13 12:00:06 corosync [SERV ] Service initialized 'corosync > >> extended virtual synchrony service' > >> Dec 13 12:00:06 corosync [SERV ] Service initialized 'corosync > >> configuration service' > >> Dec 13 12:00:06 corosync [SERV ] Service initialized 'corosync > >> cluster closed process group service v1.01' > >> Dec 13 12:00:06 corosync [SERV ] Service initialized 'corosync > >> cluster config database access v1.01' > >> Dec 13 12:00:06 corosync [SERV ] Service initialized 'corosync > >> profile loading service' > >> Dec 13 12:00:06 corosync [MAIN ] Compatibility mode set to > >> whitetank. Using V1 and V2 of the synchronization engine. > >> Dec 13 12:00:06 corosync [TOTEM ] Creating commit token because I am > >> the rep. > >> Dec 13 12:00:06 corosync [TOTEM ] Saving state aru 0 high seq received > >> 0 > >> Dec 13 12:00:06 corosync [TOTEM ] Storing new sequence id for ring 18 > >> Dec 13 12:00:06 corosync [TOTEM ] entering COMMIT state. > >> Dec 13 12:00:06 corosync [TOTEM ] got commit token > >> Dec 13 12:00:06 corosync [TOTEM ] entering RECOVERY state. > >> Dec 13 12:00:06 corosync [TOTEM ] position [0] member 127.0.0.1: > >> Dec 13 12:00:06 corosync [TOTEM ] previous ring seq 20 rep 127.0.0.1 > >> Dec 13 12:00:06 corosync [TOTEM ] aru 0 high delivered 0 received flag > >> 1 > >> Dec 13 12:00:06 corosync [TOTEM ] Did not need to originate any > >> messages in recovery. > >> Dec 13 12:00:06 corosync [TOTEM ] got commit token > >> Dec 13 12:00:06 corosync [TOTEM ] Sending initial ORF token > >> Dec 13 12:00:06 corosync [TOTEM ] token retrans flag is 0 my set > >> retrans flag0 retrans queue empty 1 count 0, aru 0 > >> > >> > >> > >> > >> > >> > >> > >> > >> -- > >> Robert Borkowski > >> > >> _______________________________________________ > >> Openais mailing list > >> [email protected] > >> https://lists.linux-foundation.org/mailman/listinfo/openais > > > > _______________________________________________ > > Openais mailing list > > [email protected] > > https://lists.linux-foundation.org/mailman/listinfo/openais > >
_______________________________________________ Openais mailing list [email protected] https://lists.linux-foundation.org/mailman/listinfo/openais
