Re: [Linux-HA] Help: can't find IPv6addr in heartbeat 3.0.4 package

2017-03-24 Thread Dimitri Maziuk
PS. sorry, it's the resource-agents rpm, not EPEL, so keep in mind redhat may update it and overwrite your custom build if you're not careful. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] Heartbeat packages for Redhat-7

2015-04-04 Thread Dimitri Maziuk
On 2015-04-03 17:03, Lars Ellenberg wrote: You can use heartbeat 3.0.6 (if you only use haresources mode). You can google for ticket # but basically epel heartbeat maintainer replied to my rfa with I don't use heartbeat anymore so no. I meant to post that here but forgot. So there is no

Re: [Linux-HA] Announcing the Heartbeat 3.0.6 Release

2015-02-10 Thread Dimitri Maziuk
On 02/10/2015 03:24 PM, Lars Ellenberg wrote: ... After 3½ years since the last officially tagged release of Heartbeat, I have seen the need to do a new maintenance release. Yay! Thank you Lars. - heartbeat.service file for systemd platforms RFA submitted to EPEL-7. -- Dimitri Maziuk

Re: [Linux-HA] heartbeat

2015-01-20 Thread Dimitri Maziuk
the latter, I think you want pacemaker, not heartbeat. Dunno what pacemaker might have for monitoring an IB connection, with heartbeat R1 I'd do something like grep for LinkUp in the output of ibstat. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu

Re: [Linux-HA] Support for DRDB

2015-01-16 Thread Dimitri Maziuk
! replicated iscsi block device, so ... -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux

Re: [Linux-HA] Support for DRDB

2015-01-16 Thread Dimitri Maziuk
shine on top of a 10Gb network. Or better, three 10Gb networks. DRBD works just fine over a crossover piece of cat-5e. Again, my understanding only, I could be wrong. Oh, it's a pure speculation on my part. Any resemblance to the actual RedHat is purely coincidental and all that. ;) -- Dimitri

Re: [Linux-HA] Antw: Re: Q: ping (ocf:pacemaker:ping) from specific address?

2014-09-04 Thread Dimitri Maziuk
On 09/04/2014 09:23 AM, Ulrich Windl wrote: 2) You can use an IP address as interface for -I 3), but you cannot use a hostname (resolving to the same IP address as in 2)): ping: unknown iface hostname Uhmm... ping -I `host hostname | awk '{print $NF;}'` ? -- Dimitri Maziuk Programmer

Re: [Linux-HA] Application level HA using heartbeat.. ??

2014-08-01 Thread Dimitri Maziuk
scripts, use new corosync/pacemaker hotness instead of old and busted heartbeat/haresources. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA

Re: [Linux-HA] getting proper sources

2014-06-07 Thread Dimitri Maziuk
, please provide a counter-example to prove (in mathematical sense) they're full of it. I promise I won't feel slighted. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] getting proper sources

2014-06-02 Thread Dimitri Maziuk
On 06/02/2014 03:42 PM, Jay G. Scott wrote: On Thu, May 29, 2014 at 12:43:49PM -0500, Dimitri Maziuk wrote: What do you intend to run in HA mode? 3. bind/named/dns, possible some fortran programs. Uhmm... why not just run 2 nameservers? -- Dimitri Maziuk Programmer/sysadmin BioMagResBank

Re: [Linux-HA] getting proper sources

2014-05-29 Thread Dimitri Maziuk
. What do you intend to run in HA mode? -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http

Re: [Linux-HA] getting proper sources

2014-05-28 Thread Dimitri Maziuk
. Not sure what RH is doing with their channels this week, so... presumably yum list \*pace\* should work? -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] two node cluster with postfix - how to get system mails from both nodes

2014-01-23 Thread Dimitri Maziuk
On 01/23/2014 11:14 AM, Christian Richter wrote: Hello, i'm looking for the right way to integrate postfix in my 2 node cluster. The right way is don't. Read e.g. http://serverfault.com/questions/303554/how-to-build-a-high-availability-postfix-system -- Dimitri Maziuk Programmer

Re: [Linux-HA] does heartbeat 3.0.4 use IP aliases under CentOS 6.5?

2014-01-04 Thread Dimitri Maziuk
On 1/4/2014 10:39 AM, Lars Marowsky-Bree wrote: On 2014-01-03T20:56:42, Digimer li...@alteeve.ca wrote: causing a lot of reinvention of the wheel. In the last 5~6 years, both teams have been working hard to unify under one common open-source HA stack. Pacemaker + corosync v2+ is the result of

Re: [Linux-HA] does heartbeat 3.0.4 use IP aliases under CentOS 6.5?

2014-01-03 Thread Dimitri Maziuk
-effect of RedHat going all entrerprisey with OpenStack (RDO) and having to bite the pacemaker bullet as a result. I'm told heartbeat resource scripts have been just wrappers around pacemaker's resource-agents for some time -- but RedHat's packages haven't been touched in years. Until now. -- Dimitri

Re: [Linux-HA] FYI: resource-agents-3.9.2-40.el6.x86_64 kills heartbeat-3.0.4

2013-11-28 Thread Dimitri Maziuk
On 2013-11-27 20:15, Jefferson Ogata wrote: It's nicer, however, when Red Hat takes a conservative position with the Tech Preview. They could have shipped a minimal set of resource agents in the first place, so people would have a better idea what they had to provide on their own end, instead

[Linux-HA] FYI: resource-agents-3.9.2-40.el6.x86_64 kills heartbeat-3.0.4

2013-11-27 Thread Dimitri Maziuk
if I wait just a little bit longer I can upgrade to ceph and openstack -- or retire, whichever comes first ;) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] FYI: resource-agents-3.9.2-40.el6.x86_64 kills heartbeat-3.0.4

2013-11-27 Thread Dimitri Maziuk
On 11/27/2013 06:29 PM, Jefferson Ogata wrote: On 2013-11-28 00:12, Dimitri Maziuk wrote: Hey, upgrading to pacemaker wouldn't necessarily help. Red Hat broke that last month by dropping most of the resource agents they'd initially shipped. (Don't you love Technology Previews?) Dunn about

Re: [Linux-HA] Installing Heartbeat 3.0.5 in RHEL 6.3

2013-09-18 Thread Dimitri Maziuk
6.4) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux

Re: [Linux-HA] Master/Slave status check using crm_mon

2013-06-12 Thread Dimitri Maziuk
. Analogy: don't build a skyscraper, do a log cabin. If you have to do 2.1.4, use haresources mode. You'll need to do your own monitoring (use mon). -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] Master/Slave status check using crm_mon

2013-06-12 Thread Dimitri Maziuk
On 06/12/2013 05:27 PM, Andrew Beekhof wrote: Yes, but haresources doesn't support the master/slave concept does it. So not very good advice. You could argue master+slave is the only concept it does support. ;) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http

Re: [Linux-HA] Antw: Re: Q: sbd using libxslt in SLES11 SP2?

2013-05-03 Thread Dimitri Maziuk
On 05/03/2013 12:24 AM, Ulrich Windl wrote: I wonder how they could fly to the moon without XML ;-) They couldn't. That's why they had to film the whole thing on a sekret base in Arizona. Kids these days... -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http

Re: [Linux-HA] DRBD NetworkFailure

2013-04-23 Thread Dimitri Maziuk
) pings going over the crossover cable as well, so I don't specifically monitor that link or drbd status. I do monitor eth0 (for 'link detected' in the output of ethtool). I also have nagios checking drbd for UpToDate/UpToDate, but that's not part of the cluster. HTH -- Dimitri Maziuk Programmer

Re: [Linux-HA] Multiple instances of heartbeat

2013-03-15 Thread Dimitri Maziuk
On 3/14/2013 11:15 AM, Alberto Alonso wrote: That's what I thought. The emails from 2009 seemed to indicate that it was possible to run multiple instances. I've always had difficulties with the concept: the way I see it if your hardware fails you want *all* your 200+ services moved. If you

Re: [Linux-HA] Multiple instances of heartbeat

2013-03-15 Thread Dimitri Maziuk
200 services independently sounds like a new project. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org

Re: [Linux-HA] Multiple instances of heartbeat

2013-03-15 Thread Dimitri Maziuk
are crashing all the time during normal operation. If I needed a solution for that, HA would be fairly low on my list of things to look at. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] Multiple instances of heartbeat

2013-03-15 Thread Dimitri Maziuk
to know. ;-) (I'd still argue that a full vm solution should have less maintenance overhead in the long run -- or at least it looks that way now.) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] Multiple instances of heartbeat

2013-03-15 Thread Dimitri Maziuk
anything that can deal with i/o. I recently spent a couple of weeks poking at ceph, it doesn't live up to the sales brochure either... I expect if you can roll out a dedicated 10GBe network for your iscsis you might get usable i/o speeds. :( -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW

Re: [Linux-HA] Multiple instances of heartbeat

2013-03-14 Thread Dimitri Maziuk
On 3/14/2013 9:44 AM, GGS (linux ha) wrote: ... I always laugh when people talk about having hundreds or thousands of servers, because switching to a stack model and proper utilization of hardware resources can save a ton of money. The flip side is when caps on the mobo go dry you lose

Re: [Linux-HA] achieve failover time less then 1 sec

2013-03-06 Thread Dimitri Maziuk
(default)[1960]: info: Running /etc/ha.d/resource.d/IPaddr 144.92.167.233 start ... 15:53:52 cuttlefish /usr/lib/ocf/resource.d//heartbeat/IPaddr(IPaddr_144.92.167.233)[2267]: INFO: Success -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc

Re: [Linux-HA] 'Tie-breaker' facility and quorum/membership question

2013-01-23 Thread Dimitri Maziuk
On 1/22/2013 10:31 PM, Digimer wrote: A stray iptables rule would knock you out without dropping the link layer. Presumably it wasn't there when you configured, tested, and burned in the node. Tie-breaker ain't gonna help with gremlins living in the cluster, self-modifying firewalls, or

Re: [Linux-HA] 'Tie-breaker' facility and quorum/membership question

2013-01-23 Thread Dimitri Maziuk
On 1/22/2013 7:39 PM, David Lang wrote: I've also had network connectivity restored (switch got rebooted, someone noticed a loose cable and plugged it in, etc) What I would do would be to look into defining a ping node that could be used as the tie-breaker (but that ping node needs to be HA

Re: [Linux-HA] 'Tie-breaker' facility and quorum/membership question

2013-01-21 Thread Dimitri Maziuk
On 1/20/2013 7:09 PM, Alex Sudakar wrote: I'm setting up a very basic two-node active/passive cluster using Pacemaker 1.1.7 and Corosync 1.4.1 under Red Hat Enterprise Linux 6.3. The cluster is running a web application that needs to be accessed by our separate LAN of desktops. With only

Re: [Linux-HA] possibly silly configuration - comments please

2012-12-27 Thread Dimitri Maziuk
On 12/26/2012 10:17 PM, Miles Fidelman wrote: Does this make sense, or is it totally crazy? Simple stupid is usually the best. Twisty maze of little layers of indirection tends to be fragile and unmaintainable. Over here 95% of downtime is caused by maintenance reboots (kernel/libc upgrades)

Re: [Linux-HA] Antw: Re: Q: NFS cross mounting

2012-12-21 Thread Dimitri Maziuk
On 12/20/2012 2:31 PM, Lars Marowsky-Bree wrote: On 2012-12-20T10:50:11, Dimitri Maziuk dmaz...@bmrb.wisc.edu wrote: Well, they'll be wrong unless there's a way nfs-export an unmounted filesystem. A reasonable question would be why not symlink mount point - export point ;) That'd miss

Re: [Linux-HA] Antw: Re: Q: NFS cross mounting

2012-12-20 Thread Dimitri Maziuk
On 12/20/2012 1:55 AM, Ulrich Windl wrote: Lars Marowsky-Bree l...@suse.com schrieb am 19.12.2012 um 18:29 in Nachricht 20121219172929.gg29...@suse.de: On 2012-12-19T10:59:12, Ulrich Windl ulrich.wi...@rz.uni-regensburg.de wrote: Unfortunately there was an update to SLES11 SP2 release

Re: [Linux-HA] master/slave drbd resource STILL will not failover

2012-12-05 Thread Dimitri Maziuk
, but ISTR there were problems with using the same interface for drbd and pacemaker. You're not doing that by any chance? -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] master/slave drbd resource STILL will not failover

2012-12-05 Thread Dimitri Maziuk
ordering a full stop. But in the latter case you could probably find the stop action in the RA and replace it with (e.g.) logger 'AIE ***I did not want this***' and then see what gets logged. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu

Re: [Linux-HA] Some questions about High Availability

2012-12-03 Thread Dimitri Maziuk
On 12/03/2012 12:03 PM, Rodrigo Abrantes Antunes wrote: Simple setup: put it all on one filesystem and put that on DRBD. See http://www.drbd.org/users-guide-8.3/s-heartbeat-r1.html[1] Also recommended: drbdlinks. HTH -- Dimitri Maziuk Programmer/sysadminBioMagResBank

Re: [Linux-HA] Some questions about High Availability

2012-12-03 Thread Dimitri Maziuk
as part of the process, but that takes extra hardware. I was thinking about rsync trought heartbeat's network interface to sincronize them too, without drbd, what do you think? Drbd is easier and safer. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu

Re: [Linux-HA] Some questions about High Availability

2012-11-30 Thread Dimitri Maziuk
. HTH -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux

Re: [Linux-HA] Antw: Re: pcs or crmsh?

2012-11-16 Thread Dimitri Maziuk
: a copyright lawyer. IMO -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org

Re: [Linux-HA] Question about linux cluster choice for Nagios Mysql

2012-11-15 Thread Dimitri Maziuk
can go a lot with a few lines of perl to make sure you don't get double the e-mail. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing

Re: [Linux-HA] Question about linux cluster choice for Nagios Mysql

2012-11-15 Thread Dimitri Maziuk
On 11/15/2012 02:03 PM, Dimitri Maziuk wrote: Apologies for bad cun-n-paste: Where'd security products come from? Do you mean you nagios+mysql setup is doing some sort of security monitoring? The good thing about heartbeat is it's not being developed anymore. So what you've learned about

Re: [Linux-HA] Antw: Re: cib_replace failed?

2012-11-14 Thread Dimitri Maziuk
, all there is is localhost and localhost6. (And I think RHEL5 install scripts may or may not put it there dep. on the install mode: DVD vs netboot or something.) *Using the hostname* is against best practices. Reading it from /etc/hosts is an automatic F on unix network programming. -- Dimitri

Re: [Linux-HA] Antw: Re: cib_replace failed?

2012-11-14 Thread Dimitri Maziuk
and phase of the moon. None of my RH6 machines have the hostname in /etc/hosts anymore anyway, all there is is localhost and localhost6. (And I think RHEL5 install scripts may or may not put it there dep. on the install mode: DVD vs netboot or something.) -- Dimitri Maziuk I only have 30

Re: [Linux-HA] Antw: Re: cib_replace failed?

2012-11-14 Thread Dimitri Maziuk
On 11/14/2012 11:56 AM, Dimitri Maziuk wrote: I have 29 centos 6 servers and workstations and none of them has it. They're all a) installed off netinst, b) had valid dns records for their ips at install time, and c) I run system-config-network first thing (after 'selinux off' + reboot

Re: [Linux-HA] Antw: Re: pcs or crmsh?

2012-11-14 Thread Dimitri Maziuk
owners/operators but in surprisingly many respects aren't). -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux

Re: [Linux-HA] cib_replace failed?

2012-11-02 Thread Dimitri Maziuk
to wonder if the cause of problem is your systems are bogged down by iowait resulting from all that logging and are e.g. dropping packets. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] resource monitor timeout, Killing with signal SIGTERM (15).

2012-10-24 Thread Dimitri Maziuk
crm mode. The difference now is that haresources option is much harder to find. So we don't know what OP's doing and why. Again the right answer here is not seriously. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP

Re: [Linux-HA] resource monitor timeout, Killing with signal SIGTERM (15).

2012-10-24 Thread Dimitri Maziuk
PS. but for the most part, like you said: you *have* people stuck on 2.1.4 and you keep supporting them much as you hate it. I don't think OP had any indication of it being a new deployment or an old system they're stuck with, so, yes. Seriously. -- Dimitri Maziuk Programmer/sysadmin

Re: [Linux-HA] resource monitor timeout, Killing with signal SIGTERM (15).

2012-10-24 Thread Dimitri Maziuk
On 10/24/2012 02:22 PM, Lars Marowsky-Bree wrote: On 2012-10-24T13:23:09, Dimitri Maziuk dmaz...@bmrb.wisc.edu wrote: PS. but for the most part, like you said: you *have* people stuck on 2.1.4 and you keep supporting them much as you hate it. Yes, but on SLES10, that was an actually

Re: [Linux-HA] resource monitor timeout, Killing with signal SIGTERM (15).

2012-10-19 Thread Dimitri Maziuk
will be in production phase until 2017. People will be stuck with 2.1.4 until 2017. Get used to it already. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] OS System update in live cluster ?

2012-09-05 Thread Dimitri Maziuk
be sure your service will start on the recently upgraded node with the old data version. I think if your requirements are zero downtime and a database engine that's not compatible with itself, your only option is to not upgrade. Otherwise one or the other has to go. HTH -- Dimitri Maziuk

Re: [Linux-HA] Leave Apache running on both active and passive nodes?

2012-08-22 Thread Dimitri Maziuk
and obscure variables in them is one of the reasons I'm sticking to heartbeat-r1 in the first place. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] Leave Apache running on both active and passive nodes?

2012-08-22 Thread Dimitri Maziuk
here too. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo

Re: [Linux-HA] IP Clone

2012-08-21 Thread Dimitri Maziuk
is for when ldirectord is too hard. One is for when multiple servers won't all reply at once because only one of them is visible to the reachable network, the other has a fixed rule that decides which server answers which clients. And so on. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW

Re: [Linux-HA] Leave Apache running on both active and passive nodes?

2012-08-21 Thread Dimitri Maziuk
another ip address is added to the system. So once your ip's migrated you need to restart apache (kill -HUP may do it, I'm not sure). In general if you want to do that you also want to clone the ip -- currently being discussed in the other thread, take a look. -- Dimitri Maziuk Programmer/sysadmin

Re: [Linux-HA] Leave Apache running on both active and passive nodes?

2012-08-21 Thread Dimitri Maziuk
.) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also

Re: [Linux-HA] Leave Apache running on both active and passive nodes?

2012-08-21 Thread Dimitri Maziuk
scenario easily. Me too, only s/most/all/. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http

Re: [Linux-HA] IP Clone

2012-08-20 Thread Dimitri Maziuk
On 08/20/2012 04:19 PM, Yount, William D wrote: No ideas? You lost me at I would like the IP address to run on both servers at the same time -- IME pacemaker not letting you do that is a feature. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu

Re: [Linux-HA] IP Clone

2012-08-20 Thread Dimitri Maziuk
really help you there. I don't understand how it could be called an Active/Active cluster if you aren't allowed to run the IP address on two servers at once. You are not allowed to run the IP address on two servers at once, full stop. Complain to Rob Kahn and Vint Cerf. -- Dimitri Maziuk

Re: [Linux-HA] IP Clone

2012-08-20 Thread Dimitri Maziuk
On 8/20/2012 7:32 PM, Andrew Beekhof wrote: On Tue, Aug 21, 2012 at 8:49 AM, Dimitri Maziuk dmaz...@bmrb.wisc.edu wrote: You are not allowed to run the IP address on two servers at once, full stop. Complain to Rob Kahn and Vint Cerf. Thats not strictly true. In the same way it's

Re: [Linux-HA] Antw: Re: open-iscsi won't automatically log in to set up SBD device

2012-08-09 Thread Dimitri Maziuk
are running inside messing with software installs and configs, then of course you don't want autostart. - If your main concern is hardware failure and main reason for reboot: a manual os update, then you probably want autostart. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http

Re: [Linux-HA] DRBD and automatic sync

2012-08-03 Thread Dimitri Maziuk
? If you're on 3.x why not use ceph and ditch the whole drbd/pacemaker thing altogether? It can't be worse than ext3 on dual-primary drbd using haresources mode. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital

Re: [Linux-HA] Fencing

2012-08-02 Thread Dimitri Maziuk
at the switch and let it sit until someone comes in to kick it. Assuming you don't have multiple redundant NICs, you do have the someone, and so on and so forth. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP

Re: [Linux-HA] Pacemaker and software RAID using shared storage.

2012-07-20 Thread Dimitri Maziuk
On 7/20/2012 4:42 AM, Caspar Smit wrote: Hi Dimitri, I got some test results for you. I built the setup as described (two servers with LSI 9200-8e cards, One Supermicro 847E26-RJBOD1, connected expander 1 to server 1 and expander 2 to server 2). When I plug in a SAS disk both servers see

Re: [Linux-HA] HA for NFS without DRBD

2012-07-11 Thread Dimitri Maziuk
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 dj mko...@gmail.com wrote: Greetings! I am new to setting up Linux Clusters. I am setting up a two node HA cluster (RHEL 5.8) for Postgres. We like to have NFS mount fail over from node 1 to 2. My understanding is that DRBD needs shared DISK we

Re: [Linux-HA] DRBD/Pacemaker/Heartbeat

2012-06-06 Thread Dimitri Maziuk
? In my R1 clusters I have a mon script that greps for link detected: yes in the output of ethtool. Obviously, if both go off (e.g. the switch loses power), that's gonna screw things up, but then I'll have bigger problems anyway. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http

Re: [Linux-HA] DRBD-8.4

2012-05-20 Thread Dimitri Maziuk
On 5/20/2012 7:54 AM, Willi Fehler wrote: Hi all, I would like to upgrade my cluster to DRBD-8.4.1. Currently my cluster is using DRBD-8.3.12 on CentOS-6.2. FWIW I've been using 8.4.0 from ATrpms for a while now without problems. Never tried upgrading a live cluster from 8.3 to 8.4, though.

Re: [Linux-HA] weird problem w/ R1

2012-05-18 Thread Dimitri Maziuk
On 05/09/2012 07:07 AM, Dejan Muhamedagic wrote: Hi, On Wed, May 02, 2012 at 06:25:55PM -0500, Dimitri Maziuk wrote: [heartbeat] doesn't seem to run that particular script: it starts pure_uploadscript from resource.d and mon from init.d, but not the one in between. What's weird is I now

[Linux-HA] weird problem w/ R1

2012-05-02 Thread Dimitri Maziuk
now have it happening on 2 clusters: centos 5 w/ heartbeat 2.1.4, and centos 6 w/ heartbeat 3.0.4. The only common thing is bacula version: 5. Any ideas? TIA -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital

Re: [Linux-HA] HA samba?

2012-04-30 Thread Dimitri Maziuk
.) (I can't be more specific because I'm using R1 configs here, not crm.) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA

Re: [Linux-HA] HA samba?

2012-04-25 Thread Dimitri Maziuk
on one node at a time. That one's simple: use drbdlinks to keep everything incl. /etc/samba on the drbd filesystem and fire up smbd and nmbd after drbdlinks -- pretty much like any other daemon backed by drbd storage. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http

Re: [Linux-HA] HA samba?

2012-04-25 Thread Dimitri Maziuk
on failover.) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo

Re: [Linux-HA] ocf:heartbeat:apache resource agent and timeouts

2012-04-06 Thread Dimitri Maziuk
On 04/06/2012 06:15 AM, Dejan Muhamedagic wrote: On Thu, Apr 05, 2012 at 11:54:44AM -0500, Dimitri Maziuk wrote: How well the server is performing implies that it's up and running, so it's a valid test -- in the same sense that counting ice cubes in the freezer compartment is a valid test

Re: [Linux-HA] Command execution prior to resource start

2012-03-17 Thread Dimitri Maziuk
On 3/17/2012 7:06 AM, Charles Williams wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hey all, I have been looking all over for a way to do this and have yet to find anything. I have a symlink resource that links an nfs share to /var/spool/cron (/var/spool/cron -

Re: [Linux-HA] clvm/dlm/gfs2 hangs if a node crashes

2012-03-14 Thread Dimitri Maziuk
to my satisfaction. That may be true if you offer only completely stateless services over UDP on the cluster IP address. Or running some interesting network stack on top of IP. According to my (admittedly fading) memory of networking 101 TCP-based services don't quite work that way. -- Dimitri

Re: [Linux-HA] Active-passive cluster, best practice question

2012-02-08 Thread Dimitri Maziuk
/passive on 5 nodes? As far as rsync goes, the difference is between rsync x rsync://foo/x and for i in foo bar baz ; do rsync x rsync://${i}/x ; done -- i.e. a non-issue really. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc

Re: [Linux-HA] RA manipulation of iptables firewall?

2012-01-27 Thread Dimitri Maziuk
address. Now I'm trying to find a way to run this app in my cluster. So far, I'm not seeing an RA already set up to do this. Why not make it static? I think you might even be able to exclude them from dynamic pool via /proc/sys/net/ipv4, but don't quote me on this. -- Dimitri Maziuk Programmer

Re: [Linux-HA] RA manipulation of iptables firewall?

2012-01-27 Thread Dimitri Maziuk
On 01/27/2012 02:48 PM, David Gersic wrote: On 1/27/2012 at 02:37 PM, Dimitri Maziuk dmaz...@bmrb.wisc.edu wrote: Why not make it static? Yeah, I could, but I didn't want to. I wanted to make it part of the resource group so it'll even be there if I add a new cluster node and move

Re: [Linux-HA] Services does not react after IP Takeover

2012-01-23 Thread Dimitri Maziuk
forever. If you're seeing that with active/active, then either your other node isn't really active or routing's messed up or something else's broken. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] Single Point of Failure

2012-01-13 Thread Dimitri Maziuk
is although SMTP RFCs start with reliable and efficiently, e-mail delivery has always been best effort -- so I would not build a business plan on the assumption that e-mail is or will ever be reliable. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu

Re: [Linux-HA] Single Point of Failure

2012-01-12 Thread Dimitri Maziuk
for online sales and product delivery so our solution - what ever it be - must allow for one location to completely lose power and still deliver client emails. One solution is called backup generator and a big fuel tank. Another one is called gmail. HTH -- Dimitri Maziuk Programmer/sysadmin

Re: [Linux-HA] compiling cluster glue in solaris 10 getting error

2011-12-27 Thread Dimitri Maziuk
of 'Linux-HA' spells Solaris. I actually know how to build gnuware on solaris, so if my employer told me to do that I'd first try to talk them out of it. And if that failed, I'd seriously consider changing jobs. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http

Re: [Linux-HA] Failover from external access dosent' work

2011-12-13 Thread Dimitri Maziuk
will be forwarded from the public static IP 62.2.208.170 to the IP 10.0.0.3 so what else coud the problem? Did it work when 10.0.0.1 was up? Does it work from 10.0.0.1 now? -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP

Re: [Linux-HA] Failover from external access dosent' work

2011-12-13 Thread Dimitri Maziuk
On 12/13/2011 12:48 PM, Elvis Altherr wrote: Am 13.12.2011 19:05, schrieb Dimitri Maziuk: On 12/13/2011 09:38 AM, Elvis Altherr wrote: Node 10.0.0.2 takes over the ressources and reserves the virtual IP 10.0.0.3 now if i access the Website from external (p.a. via office) i't dosen't work i

Re: [Linux-HA] Failover from external access dosent' work

2011-12-13 Thread Dimitri Maziuk
On 12/13/2011 01:45 PM, Elvis Altherr wrote: ok i will check.. but i think this ins't the problem cause if a do a portforwarding (NAT) to the webserver (mail2 = Node1) it works fine Try this, then: http://lists.linux-ha.org/pipermail/linux-ha/2008-March/031612.html -- Dimitri Maziuk

Re: [Linux-HA] Antw: Re: Q: unmanaged MD-RAID auto-recovery

2011-11-30 Thread Dimitri Maziuk
On 11/30/2011 2:01 AM, Ulrich Windl wrote: Dimitri Maziukdmaz...@bmrb.wisc.edu schrieb am 29.11.2011 um 19:36 in Nachricht4ed52637.9080...@bmrb.wisc.edu: On 11/29/2011 07:49 AM, Lars Marowsky-Bree wrote: (But the mdadm operations the RA does also shouldn't cause data corruption. That

Re: [Linux-HA] Antw: Re: Q: unmanaged MD-RAID auto-recovery

2011-11-29 Thread Dimitri Maziuk
or later. It's only MD bug in a sense that MD can't actually stop you from shooting yourself. HTH -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA

Re: [Linux-HA] Antw: Re: Q: unmanaged MD-RAID auto-recovery

2011-11-28 Thread Dimitri Maziuk
of the unmanaged resource, /without/ monitoring. I'd of thunk the un in unmanaged means none of its business. (Would the cluster also like to know what my grandmother's maiden name was between 1939 and 1945?) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu

Re: [Linux-HA] Does ANYTHING Work on RHEL6?

2011-11-01 Thread Dimitri Maziuk
On 11/01/2011 07:17 AM, Nick Khamis wrote: We will need to turn SELINUX back soon. You know you won't be able to ssh in as root if you do that, right? -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital

Re: [Linux-HA] Does ANYTHING Work on RHEL6?

2011-11-01 Thread Dimitri Maziuk
On 11/01/2011 01:45 PM, Nick Khamis wrote: Should be possible to sudo? As long as you don't need xauth -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature

Re: [Linux-HA] Does ANYTHING Work on RHEL6?

2011-11-01 Thread Dimitri Maziuk
and methods bit) -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman

Re: [Linux-HA] what if brain split happens

2011-10-31 Thread Dimitri Maziuk
On 10/31/2011 7:14 AM, Lars Ellenberg wrote: Yes, basically that is what happens: it kills everything it spawned, makes sure it sleeps for at least deadtime, then re-execs the itself as new master control process. Re-exec itself? Cute. I never seen that -- but that again, the only time I

Re: [Linux-HA] what if brain split happens

2011-10-28 Thread Dimitri Maziuk
On 10/28/2011 6:18 AM, Lars Ellenberg wrote: On Tue, Oct 25, 2011 at 02:29:55PM -0500, Dimitri Maziuk wrote: I guess he's confused or trolling because in v1 there's no way heartbeat can restart itself. No, absolutely not. ... Which is supposed to result in all started resources

Re: [Linux-HA] what if brain split happens

2011-10-25 Thread Dimitri Maziuk
On 10/24/2011 10:49 PM, Hai Tao wrote: In case heartbeat communication is lost, brain split then happened, both nodes (a two nodes cluster for a simple example) are having the vip and other resources. When the heartbeat commnication comes back, what will happen? 1. both nodes will still

Re: [Linux-HA] what if brain split happens

2011-10-25 Thread Dimitri Maziuk
because in v1 there's no way heartbeat can restart itself. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux

Re: [Linux-HA] HA-Linux for Exim

2011-10-19 Thread Dimitri Maziuk
those broken symlinks after every software update on the passive node. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA

Re: [Linux-HA] HA-Linux for Exim

2011-10-19 Thread Dimitri Maziuk
Straw poll: 1. how long did it take you to get your first corosync/pacemaker/drbd cluster up and running the way you wanted? 2. are you, or have you ever been a developer working on linux-ha projects? -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu

  1   2   3   >