[Linux-ha-dev] Patch: stonith plugin external/vcenter HOSTLIST separator
Hi all, recently I tried to use the STONITH plugin external/vcenter along with vCenter 5 (I doubt, that the version is significant). While using the stonith resource for each node separately, I had no problems, but using it in a clone resulted in failures like that one: Nov 14 08:53:57 shermcl1 external/vcenter(vfencing:0)[23236]: [23257]: ERROR: [reset shermcl2] Invalid target specified where the cluster consists of virtual machines SHERMCL1, SHERMCL2 and SHERMCL3, with their unames shermcl1, shermcl2 and shermcl3, accordingly. shermcl2 should be fenced, but the remaining cluster members were unable to kill that machine. The relevant portion of the cluster configuration is here: node shermcl1 node shermcl2 node shermcl3 primitive vfencing stonith:external/vcenter \ params VI_SERVER=virtualcenter.dom.ain VI_CREDSTORE=/root/.vmware/credstore/vicredentials.xml HOSTLIST=shermcl1=SHERMCL1;shermcl2=SHERMCL2;shermcl3=SHERMCL3 RESETPOWERON=0 \ op monitor interval=3600s clone Fencing vfencing location l-Fencing_shermcl1 Fencing 0: shermcl1 location l-Fencing_shermcl2 Fencing 0: shermcl2 location l-Fencing_shermcl3 Fencing 0: shermcl3 The location statements are needed, as the cluster itself is no symmetric. All machines are plain openSUSE 12.2 with corosync 1.4.3 and pacemaker 1.1.6. While running perfectly on the commandline with stonith -t external/vcenter VI_SERVER=virtualcenter.dom.ain \ VI_CREDSTORE=/root/.vmware/credstore/vicredentials.xml \ HOSTLIST=shermcl1=SHERMCL1;shermcl2=SHERMCL2;shermcl3=SHERMCL3 \ RESETPOWERON=0 -l and showing the names of the three virtual machines, I found, that called as resource inside the cluster only the first hostname until the first = is visible, perhaps caused by the handover as environment variable. Applying the attached trivial patch to use a colon (:) instead of the equal sign (=) the command line test stonith -t external/vcenter VI_SERVER=virtualcenter.dom.ain \ VI_CREDSTORE=/root/.vmware/credstore/vicredentials.xml \ HOSTLIST=shermcl1:SHERMCL1;shermcl2:SHERMCL2;shermcl3:SHERMCL3 \ RESETPOWERON=0 -l as well as fencing inside the cluster with primitive vfencing stonith:external/vcenter \ params VI_SERVER=virtualcenter.dom.ain VI_CREDSTORE=/root/.vmware/credstore/vicredentials.xml HOSTLIST=shermcl1:SHERMCL1;shermcl2:SHERMCL2;shermcl3:SHERMCL3 RESETPOWERON=0 \ op monitor interval=3600s succeeds. So a question around: Is anyone using the external/vcenter with the cloned resource successfully with the original syntax? If so, where is my problem? If not, the attached patch changes the syntax in the above described way. If there is no objection can it be applied? Greetings, Stefan PS: sorry for the line breaks in the code -- Stefan Botter listrea...@jsj.dyndns.org # HG changeset patch # User Stefan Botter j...@jsj.dyndns.org # Date 1352974761 -3600 # Node ID 3429be9596a95127e04706c38c5c4d82fb67e206 # Parent 0809ed6abeb7289f3a8f4229f537df8d509c0854 - trivial change to use : as hostname delimiter in HOSTLIST instead of = diff -r 0809ed6abeb7 -r 3429be9596a9 lib/plugins/stonith/external/vcenter --- a/lib/plugins/stonith/external/vcenter Mon Oct 22 17:35:17 2012 +0200 +++ b/lib/plugins/stonith/external/vcenter Thu Nov 15 11:19:21 2012 +0100 @@ -55,12 +55,12 @@ longdesc lang=en The list of hosts that the VMware vCenter STONITH device controls. Syntax is: - hostname1[=VirtualMachineName1];hostname2[=VirtualMachineName2] + hostname1[:VirtualMachineName1];hostname2[:VirtualMachineName2] -NOTE: omit =VirtualMachineName if hostname and virtual machine names are identical +NOTE: omit :VirtualMachineName if hostname and virtual machine names are identical Example: - cluster1=VMCL1;cluster2=VMCL2 + cluster1:VMCL1;cluster2:VMCL2 /longdesc /parameter parameter name=VI_SERVER @@ -128,7 +128,7 @@ my %host_to_vm = (); my %vm_to_host = (); foreach my $host (@hostlist) { - my @config = split(/=/, $host); + my @config = split(/:/, $host); my $key = $config[0]; my $value = $config[1]; if (!defined($value)) { $value = $config[0]; } $host_to_vm{$key} = $value; ___ Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev Home Page: http://linux-ha.org/
Re: [Linux-ha-dev] Patch: stonith plugin external/vcenter HOSTLIST separator
Hi, On Thu, Nov 15, 2012 at 11:52:48AM +0100, Stefan Botter wrote: Hi all, recently I tried to use the STONITH plugin external/vcenter along with vCenter 5 (I doubt, that the version is significant). While using the stonith resource for each node separately, I had no problems, but using it in a clone resulted in failures like that one: Nov 14 08:53:57 shermcl1 external/vcenter(vfencing:0)[23236]: [23257]: ERROR: [reset shermcl2] Invalid target specified where the cluster consists of virtual machines SHERMCL1, SHERMCL2 and SHERMCL3, with their unames shermcl1, shermcl2 and shermcl3, accordingly. shermcl2 should be fenced, but the remaining cluster members were unable to kill that machine. The relevant portion of the cluster configuration is here: node shermcl1 node shermcl2 node shermcl3 primitive vfencing stonith:external/vcenter \ params VI_SERVER=virtualcenter.dom.ain VI_CREDSTORE=/root/.vmware/credstore/vicredentials.xml HOSTLIST=shermcl1=SHERMCL1;shermcl2=SHERMCL2;shermcl3=SHERMCL3 RESETPOWERON=0 \ op monitor interval=3600s clone Fencing vfencing It could be that the issue comes from the bug in fence_legacy, which has been resolved in the meantime. Can you try to edit that and replace the split command (line 86) with the following (i.e. just append , 2): ($name,$val)=split /\s*=\s*/, $opt, 2; The file location should be /usr/sbin/fence_legacy. Can you please see if that helps? Cheers, Dejan location l-Fencing_shermcl1 Fencing 0: shermcl1 location l-Fencing_shermcl2 Fencing 0: shermcl2 location l-Fencing_shermcl3 Fencing 0: shermcl3 The location statements are needed, as the cluster itself is no symmetric. All machines are plain openSUSE 12.2 with corosync 1.4.3 and pacemaker 1.1.6. While running perfectly on the commandline with stonith -t external/vcenter VI_SERVER=virtualcenter.dom.ain \ VI_CREDSTORE=/root/.vmware/credstore/vicredentials.xml \ HOSTLIST=shermcl1=SHERMCL1;shermcl2=SHERMCL2;shermcl3=SHERMCL3 \ RESETPOWERON=0 -l and showing the names of the three virtual machines, I found, that called as resource inside the cluster only the first hostname until the first = is visible, perhaps caused by the handover as environment variable. Applying the attached trivial patch to use a colon (:) instead of the equal sign (=) the command line test stonith -t external/vcenter VI_SERVER=virtualcenter.dom.ain \ VI_CREDSTORE=/root/.vmware/credstore/vicredentials.xml \ HOSTLIST=shermcl1:SHERMCL1;shermcl2:SHERMCL2;shermcl3:SHERMCL3 \ RESETPOWERON=0 -l as well as fencing inside the cluster with primitive vfencing stonith:external/vcenter \ params VI_SERVER=virtualcenter.dom.ain VI_CREDSTORE=/root/.vmware/credstore/vicredentials.xml HOSTLIST=shermcl1:SHERMCL1;shermcl2:SHERMCL2;shermcl3:SHERMCL3 RESETPOWERON=0 \ op monitor interval=3600s succeeds. So a question around: Is anyone using the external/vcenter with the cloned resource successfully with the original syntax? If so, where is my problem? If not, the attached patch changes the syntax in the above described way. If there is no objection can it be applied? Greetings, Stefan PS: sorry for the line breaks in the code -- Stefan Botter listrea...@jsj.dyndns.org # HG changeset patch # User Stefan Botter j...@jsj.dyndns.org # Date 1352974761 -3600 # Node ID 3429be9596a95127e04706c38c5c4d82fb67e206 # Parent 0809ed6abeb7289f3a8f4229f537df8d509c0854 - trivial change to use : as hostname delimiter in HOSTLIST instead of = diff -r 0809ed6abeb7 -r 3429be9596a9 lib/plugins/stonith/external/vcenter --- a/lib/plugins/stonith/external/vcenterMon Oct 22 17:35:17 2012 +0200 +++ b/lib/plugins/stonith/external/vcenterThu Nov 15 11:19:21 2012 +0100 @@ -55,12 +55,12 @@ longdesc lang=en The list of hosts that the VMware vCenter STONITH device controls. Syntax is: - hostname1[=VirtualMachineName1];hostname2[=VirtualMachineName2] + hostname1[:VirtualMachineName1];hostname2[:VirtualMachineName2] -NOTE: omit =VirtualMachineName if hostname and virtual machine names are identical +NOTE: omit :VirtualMachineName if hostname and virtual machine names are identical Example: - cluster1=VMCL1;cluster2=VMCL2 + cluster1:VMCL1;cluster2:VMCL2 /longdesc /parameter parameter name=VI_SERVER @@ -128,7 +128,7 @@ my %host_to_vm = (); my %vm_to_host = (); foreach my $host (@hostlist) { - my @config = split(/=/, $host); + my @config = split(/:/, $host); my $key = $config[0]; my $value = $config[1]; if (!defined($value)) { $value = $config[0]; } $host_to_vm{$key} = $value; ___ Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org
Re: [Linux-ha-dev] Patch: stonith plugin external/vcenter HOSTLIST separator
Hi Dejan, On Thursday, November 15, 2012 01:10:27 PM Dejan Muhamedagic wrote: It could be that the issue comes from the bug in fence_legacy, which has been resolved in the meantime. Can you try to edit that and replace the split command (line 86) with the following (i.e. just append , 2): ($name,$val)=split /\s*=\s*/, $opt, 2; The file location should be /usr/sbin/fence_legacy. Can you please see if that helps? Yes, I can confirm! It works that way. Okay, thank you, I will try to build my own fixed packages for my in- house production use. Greetings, Stefan -- Stefan Botter listrea...@jsj.dyndns.org ___ Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev Home Page: http://linux-ha.org/
Re: [Linux-HA] Antw: Re: pcs or crmsh?
clusterlabs.org/doc is as good as i can do for docs. i try to keep it up-to-date and version specific (so that documenting corosync 2.x doesn't obliterate the cman/plugin stuff). packages are mostly in the hands of the distros though. building the entire stack (and keeping it up-to-date) on all the major distros is a massive job - i'd never get any actual work done. and typically not easy unless you work for the enterprise distro you're building for. In theory you should at most need scientific linux (even clusterlabs.org/rpm-next is somewhat optional) + your choice of shell/gui. It's a good site, and it is easy to see that a lot of work has gone into it. For me, though, it seems there are a few gaps where historical knowledge is assumed that a newbie does not necessarily have. (Case in point: I have 5 clusters in production and I have no idea what you mean by cman/plugin stuff.) There has been a lot of change and development over the years, and there is a whole lot of Google noise, some of which can seem fairly authoritative, but which is nevertheless dated. Also, for some reason, people who write articles often don't date them, so when you're reading something you Googled it is often not clear how current and applicable it is. This leads newbies down long and fruitless paths. There's also a lot of repos out there and it is not at all clear which is the best to use. (Case in point: I originally used the clusterlabs repo, but then I was told to use the clusterlabs-next repo--which seems to have worked--but now I guess I'm being told that I should no rmally want to use the scientific-linux repo? I may have misunderstood.) I would be thankful for a web page that said something along these lines: 1. No matter what else you may have read elsewhere, this is THE AUTHORITATIVE source for the latest up-to-date information and downloads. 2. Here's the right repo. 3. Here's links to the sources. 4. Here's a glossary of terms, with identification of ones that are outdated/deprecated and what they were replaced with. 5, Here's the most common pitfalls that newbies experience. 6. Here's where you can get community and/or paid support. It's getting close to Christmas, so there's my wish list. :-) --Eric ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On 11/15/2012 09:53 AM, Robinson, Eric wrote: clusterlabs.org/doc is as good as i can do for docs. i try to keep it up-to-date and version specific (so that documenting corosync 2.x doesn't obliterate the cman/plugin stuff). packages are mostly in the hands of the distros though. building the entire stack (and keeping it up-to-date) on all the major distros is a massive job - i'd never get any actual work done. and typically not easy unless you work for the enterprise distro you're building for. In theory you should at most need scientific linux (even clusterlabs.org/rpm-next is somewhat optional) + your choice of shell/gui. It's a good site, and it is easy to see that a lot of work has gone into it. For me, though, it seems there are a few gaps where historical knowledge is assumed that a newbie does not necessarily have. (Case in point: I have 5 clusters in production and I have no idea what you mean by cman/plugin stuff.) There has been a lot of change and development over the years, and there is a whole lot of Google noise, some of which can seem fairly authoritative, but which is nevertheless dated. Also, for some reason, people who write articles often don't date them, so when you're reading something you Googled it is often not clear how current and applicable it is. This leads newbies down long and fruitless paths. There's also a lot of repos out there and it is not at all clear which is the best to use. (Case in point: I originally used the clusterlabs repo, but then I was told to use the clusterlabs-next repo--which seems to have worked--but now I guess I'm being told that I should no rmally want to use the scientific-linux repo? I may have misunderstood.) The cman plugin provides support for some of the Red Hat cluster tools, like clustered LVM and gfs2 filesystem. I would be thankful for a web page that said something along these lines: 1. No matter what else you may have read elsewhere, this is THE AUTHORITATIVE source for the latest up-to-date information and downloads. 2. Here's the right repo. 3. Here's links to the sources. For pacemaker, clusterlabs *is* the authoritative source. For corosync, Red Hat is the authoritative source. Depends on what program you are looking at for the rest. 4. Here's a glossary of terms, with identification of ones that are outdated/deprecated and what they were replaced with. 5, Here's the most common pitfalls that newbies experience. 6. Here's where you can get community and/or paid support. This is stated on the clusterlabs page, for pacemaker. In short; Community: - This list - Pacemaker mailing list. - IRC on freenode.net at #linux-ha and #linux-cluster. It's getting close to Christmas, so there's my wish list. :-) --Eric I'd like a pony and an extra day per week, please. -- Digimer Papers and Projects: https://alteeve.ca/w/ What if the cure for cancer is trapped in the mind of a person without access to education? ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] Question about linux cluster choice for Nagios Mysql
Hello, I dont know if this email address is used for support, but I have not found any forum for HA I have to secure a homemade monitoring solution mainly based on Nagios 2.x and MySQL 5.1. I must deploy an active / passive cluster with automated switch of services. 2 servers will be located on two different datacentres and connected by an optical fiber (which will be channeled through the lifeline + cluster replication data). The base of this solution is a 32-bit Debian Squeeze. Tests were carried out with products DRBD (8.3.7) Heartbeat (3.0.3) using the official Debian mirrors. The solution works well despite some freeze occurred since deployment but I am not sure that it come from clustering products. I wanted to get your opinion on the various security products such cluster (HA / Pacemaker / Corosync / keepalived / OpenSVC ...) to point me towards the most efficient and adapted according to my needs. Thank you in advance for your feedback. Do not hesitate if you want more details about these configurations. Have a good day. Cordialement. _ Julien MARIETTE Adjoint au Responsable - Service Réseaux et Télécommunications Deputy Head - Networks and Telecommunications Departement Description : Description : Description : Description : MONTAGEsoget_microsoft SOGET S.A Docks Dombasle 4, rue des Lamaneurs CS 20858 76085 - LE HAVRE CEDEX FRANCE Tel : +33 (0)2 35 19 25 60 (poste 302) Fax : +33 (0)2 35 19 02 93 Mail : mailto:julien.marie...@soget.fr julien.marie...@soget.fr Site : http://www.soget.fr/ www.soget.fr _ Avant d'imprimer, pensez à l'environnement. Before printing, think green. Ce message électronique et tous les fichiers attachés quil contient sont confidentiels et destinés exclusivement à lusage de la personne ou du groupe auquel ils sont adressés. Si vous avez reçu ce message par erreur, merci de le retourner à son émetteur. La publication, lutilisation, la distribution, limpression ou la copie non autorisée de ce message et des fichiers qui lui sont attachés sont strictement interdits. Les idées et opinions présentées dans ce message sont celles de son auteur et ne représentent pas nécessairement celles de SOGET. SOGET décline toute responsabilité en cas de pertes ou de dommages résultant de tout virus transmis par ce message électronique. This email and any files attached with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error, please send it back to its issuer. The unauthorized disclosure, use, distribution, printing or copy of this email and its attached files are strictly prohibited. Any views or opinions presented in this email are solely those of the author and do not necessarily represent those of SOGET. SOGET accepts no liability for any loss or damage caused by any virus transmitted by this email. image001.jpg___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] Heartbeat with Oracle's ASM
Hi friend: I want know heartbeat is support oracle ASM now?? HILL FANG Engineer Guangzhou Ericsson Communication Services Co.,Ltd.(GTC) SI Support 2 /F, NO. 1025 Gaopu Road, Tianhe Software Park,Tianhe District, Guangzhou, 510663, PR China Phone +86 020-85117631 Fax +86 020-29002699 SMS/MMS 15813329521 hill.f...@ericsson.com www.ericsson.com [http://www.ericsson.com/]http://www.ericsson.com/ This Communication is Confidential. We only send and receive email on the basis of the terms set out at www.ericsson.com/email_disclaimerhttp://www.ericsson.com/email_disclaimer inline: Email_line.gifinline: Email.gif___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On 11/14/2012 03:33 PM, Digimer wrote: Linux in general is all about choice, possibly to a fault. I see no reason why clustering shouldn't be the same. I really like linux and cluster frameworks to spent choice (I was even so near to miss-spell that as joice :) but on the other hand it does not make sense to change things like crm to pcs without having any problems with the integrated, stable, multiple-used, road capable solution we already had. Customers does not really like such changes as it shows that this cluster solution is still teenage and not grown. This is in my point of view a very bad message! Regards Fabian ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On 11/15/2012 12:03 AM, Andrew Beekhof wrote: I can think of 3 tooling changes: - ptest/crm_simulate - hb_report/crm_report - standalone crmsh Thats not /too/ bad in 4 years. But completely un-needed. Where is the benefit on changing from crm to pcs? ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On 11/14/2012 11:20 PM, Andrew Beekhof wrote: I sincerely hope SUSE does continue with crmsh but I _like_ that there are people trying something new. Yes I also like things which are going better. But what is the benefit on dropping CRM and introducing PCS to that procject? What is the benefit for all distributions which in the past did not say its only tech prev? ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] pcs or crmsh?
On 11/14/2012 05:10 PM, alain.mou...@bull.net wrote: Hi Just for information, I'm using cleanup and crm_mon very very very often with lots of ressources configured in Pacemaker and never had any problem like the problems you describe ... (on RHEL) Alain crm shell and tools like crm_mon are stable on SLES since years! I really like this story to go on and no silly changes which have 0-benefit. Changes should have a real benefit otherwise they just hurt the story of a cluster project. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] pcs or crmsh?
Hi, On Wed, Nov 14, 2012 at 04:00:43PM +, Robinson, Eric wrote: Should I be using pcs or crmsh? Neither one seems to work quite right. What doesn't work? I think that at this point of time, it's be easier to get crmsh going/fixed with pcmk 1.1.8. It's probably just some path somewhere. If really nothing works, you *must* use LCMC, Pacemaker GUI. :) Rasto crm_mon 'crashes' sometimes. If I have crm_mon running one one computer and I change the config from another computer, the one where crm_mon is running will sometimes flash a very quick message and then bail out to a shell prompt. It happens so fast that I have not been able to see the whole message, but it seems to say something about the configuration not being compatible, then it drops to a shell prompt. When I go back in, I can see my changes though. Also, 'crm resource cleanup' does not always work. It leaves failed actions on the screen. I'm wondering if maybe I got crmsh from the wrong place or something. Again, here is what I have installed. Everything except crmsh came from the clusterlabs-next repo. crm_mon is not part of crmsh, but pacemaker. Just to add a bit to the general confusion ;-) crmsh came from the ha-clustering repo. That repository is fine for crmsh. Good luck! Dejan [root@ha09a ~]# rpm -qa|egrep coro|pacem|crmsh|sort corosync-1.4.1-7.el6_3.1.x86_64 corosynclib-1.4.1-7.el6_3.1.x86_64 crmsh-1.2.1-45.2.x86_64 pacemaker-1.1.8-4.el6.x86_64 pacemaker-cli-1.1.8-4.el6.x86_64 pacemaker-cluster-libs-1.1.8-4.el6.x86_64 pacemaker-libs-1.1.8-4.el6.x86_64 --Eric Disclaimer - November 14, 2012 This email and any files transmitted with it are confidential and intended solely for 'General Linux-HA mailing list'. If you are not the named addressee you should not disseminate, distribute, copy or alter this email. Any views or opinions presented in this email are solely those of the author and might not represent those of Physicians' Managed Care or Physician Select Management. Warning: Although Physicians' Managed Care or Physician Select Management has taken reasonable precautions to ensure no viruses are present in this email, the company cannot accept responsibility for any loss or damage arising from the use of this email or attachments. This disclaimer was added by Policy Patrol: http://www.policypatrol.com/ ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Heartbeat with Oracle's ASM
On 11/15/2012 05:00 AM, Hill Fang wrote: Hi friend: I want know heartbeat is support oracle ASM now?? The heartbeat project has been deprecated for some time. There are no plans to continue it's development. I am unsure of it's supported state on Oracle, but regardless, I would advice you plan to use corosync. -- Digimer Papers and Projects: https://alteeve.ca/w/ What if the cure for cancer is trapped in the mind of a person without access to education? ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Question about linux cluster choice for Nagios Mysql
On 11/15/2012 02:52 AM, julien.marie...@soget.fr wrote: Hello, I don’t know if this email address is used for support, but I have not found any forum for HA … This channel is for pretty much any open-source cluster program, so you are in the right place. I have to secure a homemade monitoring solution mainly based on Nagios 2.x and MySQL 5.1. I must deploy an active / passive cluster with automated switch of services. 2 servers will be located on two different datacentres and connected by an optical fiber (which will be channeled through the lifeline + cluster replication data). What you are trying to do is called a stretch cluster. If you want automatic failover, you will have some significant challenges. Mainly, when a node stops responding, it needs to be put into a known state to ensure that the same service isn't offered twice or that shared storage is not happening without coordination. This is done using fencing, and fencing only really useful when it uses an independent network path. So dual links are needed. Now that probability of failing both links at the same time is real (someone digs without looking, for example) would break the cluster's fencing, leaving the nodes hung until there is human intervention. Stretch clustering requires very careful planning and rarely is worth it. The base of this solution is a 32-bit Debian Squeeze. Tests were carried out with products DRBD (8.3.7) Heartbeat (3.0.3) using the official Debian mirrors. DRBD 8.3.7 is *very* old. Heartbeat is deprecated and has no future development planned. The solution works well despite some freeze occurred since deployment but I am not sure that it come from clustering products. I wanted to get your opinion on the various security products such cluster (HA / Pacemaker / Corosync / keepalived / OpenSVC ...) to point me towards the most efficient and adapted according to my needs. Thank you in advance for your feedback. Do not hesitate if you want more details about these configurations. The future of open source clustering is on corosync + pacemaker. I would start by learning more about them. cheers -- Digimer Papers and Projects: https://alteeve.ca/w/ What if the cure for cancer is trapped in the mind of a person without access to education? ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On 11/15/2012 06:52 AM, Fabian Herschel wrote: On 11/14/2012 11:20 PM, Andrew Beekhof wrote: I sincerely hope SUSE does continue with crmsh but I _like_ that there are people trying something new. Yes I also like things which are going better. But what is the benefit on dropping CRM and introducing PCS to that procject? What is the benefit for all distributions which in the past did not say its only tech prev? CRM was not dropped. It's up to it's developers to maintain it, which I suspect they have all intentions of doing. PCS is simply another option now. The tech preview comment was specific to Red Hat. How other distros choose to support pacemaker (and any other application) is entirely their choice, of course. -- Digimer Papers and Projects: https://alteeve.ca/w/ What if the cure for cancer is trapped in the mind of a person without access to education? ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Heartbeat with Oracle's ASM
There is an RA for Oracle that can be used with Pacemaker. Generally ASM behaves like a regular Oracle instance, so you can try it. On Nov 15, 2012 8:57 AM, Hill Fang hill.f...@ericsson.com wrote: Hi friend: I want know heartbeat is support oracle ASM now?? HILL FANG Engineer Guangzhou Ericsson Communication Services Co.,Ltd.(GTC) SI Support 2 /F, NO. 1025 Gaopu Road, Tianhe Software Park,Tianhe District, Guangzhou, 510663, PR China Phone +86 020-85117631 Fax +86 020-29002699 SMS/MMS 15813329521 hill.f...@ericsson.com www.ericsson.com [http://www.ericsson.com/]http://www.ericsson.com/ This Communication is Confidential. We only send and receive email on the basis of the terms set out at www.ericsson.com/email_disclaimer http://www.ericsson.com/email_disclaimer ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Heartbeat with Oracle's ASM
On 2012-11-15T10:00:21, Hill Fang hill.f...@ericsson.com wrote: Hi friend: I want know heartbeat is support oracle ASM now?? No - and yes. Oracle RAC (I assume that's the context for ASM?) does not tolerate any cluster solution except itself. This is not supported together with Pacemaker. Pacemaker with the Oracle resource agent can manage a single instance fail-over for Oracle, yes. That is supported. Postgres/MySQL too. Regards, Lars -- Architect Storage/HA SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg) Experience is the name everyone gives to their mistakes. -- Oscar Wilde ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On 2012-11-15T09:20:44, Andrew Beekhof and...@beekhof.net wrote: LCMC and crmsh/hawk are at least conceptionally very very different; Conceptually LCMC and hawk are both web based GUIs, its the implementation that makes them so different. Not quite. LCMC is pretty heavily different from a deployment perspective - ssh from where the GUI runs into the server, at the time we looked. (I admit I'm not upto speed if that is still required.) And yes, technically Java is a web technology. But. ;-) hawk is a server-side running web interface on top of crm shell. pcs is a (remote) shell and a GUI, that sounds pretty different to crmsh to me. Yes, the remote capability of pcs is different. Though crmsh has some ability in that regard too. And probably will have to grow them. But really, does everything in life have to be a there can be only one! battle? Highlander wasn't a documentary. No, of course not. Choice isn't bad. But neither is choice inherently and always good. Especially from the point of view of administrators, particularly the consistent management story of a cluster stack is tantamount. In my not humble at all opinion, this is an area where evolutionary improvements are much better received than radical changes. I realize that crmsh doesn't do everything; like all software, it has bugs, design deficiencies, features that are missing. Yet, administrators quite like it. So it must have gotten some things right. Before starting with a completely new tool at the most important part of the stack (the interface to our users), I'd liked to have seen a discussion if crmsh can't fill the gaps you identified, if we can't grow the management story consistently. And yes, a possible result could have been No, we cannot, crmsh cannot grow to accommodate them. And perhaps we'd have had a meta-tool, a side-by-side tool, which takes care of the bits that crmsh can't. Or whatever. But there might have been the chance that we could have avoided this. This discussion never happened. (At least not in public, nor with the maintainers involved.) And, strangely, pcs appeared at just about the time where Dejan decided that crmsh might be better off as its own project. Who is going to believe the this had technical reasons spiel? Honestly. We have known each other for how long? You want me to believe *what*? ;-) I enjoy choice and options when they bring a true benefit; or at least, they outweigh the cost. But that isn't always the case - consider: when we argue about how something should be configured in the CIB XML, do we implement several ways, just because we can? No? Would you merge them? Why not? Where the cases where we did the good or bad ideas, from an admin's point of view? ... Ah. You don't see my complaining about corosync 2.x changes relative to 1.4. Or cman. Fencing improvements. Plugin, MCP, etc. Dropping openais. Or many other changes. I didn't even mind heartbeat versus corosync as much; happens largely on the backend. Sure, these all bring technical challenges on update, but such is life. How often do you see me go up the walls about something? I have spent a year of said life digging into the various costs and benefits of concurrent open source developments. I realize forking/splitting code is one of the freedoms we are granted. And that there are scenarios where this is really a good idea, yes. Is this one of them? I have a very specific issue with crmsh versus pcs, because I think this hits the community (forget about RHEL and SLE HA for a moment, but they are of course also affected) exactly at the spot where it hurts most, and where choice has a high potential to be detrimental. (The only other place I could imagine that would get me as riled up would be the RA interface.) I'm being this painful and annoying because I am convinced this will hurt the project(s)/community, badly. And I'm not even talking duplicated effort, though that sucks too. And worse, the users; those who write howtos; the admins; those who try to read them and apply them to their systems ... Significantly hurt. I would go as far as the, possibly slightly stretched, position that it hurts the entire HA on Linux in the Data Center story in comparison to other proprietary HA solutions. Imagine, just for a moment, Veritas having an entirely different management front-end on RHEL than on SLES. Or changing it completely from version 5 to 6. Is it alarmist? Go talk to a few users, administrators, and consultants. Decide for yourself. Regards, Lars -- Architect Storage/HA SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg) Experience is the name everyone gives to their mistakes. -- Oscar Wilde ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On 2012-11-14T15:11:05, Digimer li...@alteeve.ca wrote: any reason at all, to try new things. Sometimes it is superior, often it is not. In either case, users are free to go where they feel is best. Ah, but can they? How likely is it that the large distributions will offer both? Only those that don't supply their own documentation, that is for sure ;-) Regards, Lars -- Architect Storage/HA SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg) Experience is the name everyone gives to their mistakes. -- Oscar Wilde ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Question about linux cluster choice for Nagios Mysql
On 11/15/2012 10:11 AM, Digimer wrote: On 11/15/2012 02:52 AM, julien.marie...@soget.fr wrote: Hello, I have to secure a homemade monitoring solution mainly based on Nagios 2.x and MySQL 5.1. I must deploy an active / passive cluster with automated switch of services. 2 servers will be located on two different datacentres and connected by an optical fiber (which will be channeled through the lifeline + cluster replication data). What you are trying to do is called a stretch cluster. If you want automatic failover, you will have some significant challenges. Mainly, when a node stops responding, it needs to be put into a known state to ensure that the same service isn't offered twice or that shared storage is not happening without coordination. This is done using fencing, and fencing only really useful when it uses an independent network path. So dual links are needed. Now that probability of failing both links at the same time is real (someone digs without looking, for example) would break the cluster's fencing, leaving the nodes hung until there is human intervention. Stretch clustering requires very careful planning and rarely is worth it. So where do nagios and mysql come into the picture? Tests were carried out with products DRBD (8.3.7) Heartbeat (3.0.3) using the official Debian mirrors. DRBD 8.3.7 is *very* old. Heartbeat is deprecated and has no future development planned. Which doesn't mean you shouldn't use heartbeat for simple stupid 2-node active/passive 'haresources' cluster. You shouldn't use *if* you need more than simple stupid. The good news is it's not changing to something not entirely dissimilar every 18 months, unlike everything that's been developed since. DRBD is old but our public servers have been running 8.3 for quote some time now without problems. (Our centos 5 servers have been running heartbeat 2.1.4 and drbd 8.3.8 for years now.) I wanted to get your opinion on the various security products such cluster (HA / Pacemaker / Corosync / keepalived / OpenSVC ...) to point me towards the most efficient and adapted according to my needs. Where'd security products come from? Do you mean you nagios+mysql setup is doing some sort of security monitoring? The good thing about heartbeat is it's not being developed anymore. So what you've learned about it remains relevant. The future of open source clustering is on corosync + pacemaker. I would start by learning more about them. I would wait a year. They'll come up with something else and you'll have to unlearn the old busted coronary+zapper and learn about the new shiny+hotness instead. But for the most part: what is you're trying to actually do? Using drbd for database replication is suboptimal, especially over non-local links. You really want transactional replication and if mysql doesn't do it, switch to the one that does. As for nagios, why not set up two independent ones monitoring everything and each other? I suspect you can go a lot with a few lines of perl to make sure you don't get double the e-mail. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Question about linux cluster choice for Nagios Mysql
On 11/15/2012 02:03 PM, Dimitri Maziuk wrote: Apologies for bad cun-n-paste: Where'd security products come from? Do you mean you nagios+mysql setup is doing some sort of security monitoring? The good thing about heartbeat is it's not being developed anymore. So what you've learned about it remains relevant. -- the second sentence wasn't supposed to be there. -- Dimitri Maziuk Programmer/sysadmin BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu signature.asc Description: OpenPGP digital signature ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Question about linux cluster choice for Nagios Mysql
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On 11/15/2012 03:03 PM, Dimitri Maziuk wrote: On 11/15/2012 10:11 AM, Digimer wrote: On 11/15/2012 02:52 AM, julien.marie...@soget.fr wrote: Hello, I have to secure a homemade monitoring solution mainly based on Nagios 2.x and MySQL 5.1. I must deploy an active / passive cluster with automated switch of services. 2 servers will be located on two different datacentres and connected by an optical fiber (which will be channeled through the lifeline + cluster replication data). What you are trying to do is called a stretch cluster. If you want automatic failover, you will have some significant challenges. Mainly, when a node stops responding, it needs to be put into a known state to ensure that the same service isn't offered twice or that shared storage is not happening without coordination. This is done using fencing, and fencing only really useful when it uses an independent network path. So dual links are needed. Now that probability of failing both links at the same time is real (someone digs without looking, for example) would break the cluster's fencing, leaving the nodes hung until there is human intervention. Stretch clustering requires very careful planning and rarely is worth it. So where do nagios and mysql come into the picture? Tests were carried out with products DRBD (8.3.7) Heartbeat (3.0.3) using the official Debian mirrors. DRBD 8.3.7 is *very* old. Heartbeat is deprecated and has no future development planned. Which doesn't mean you shouldn't use heartbeat for simple stupid 2-node active/passive 'haresources' cluster. You shouldn't use *if* you need more than simple stupid. The good news is it's not changing to something not entirely dissimilar every 18 months, unlike everything that's been developed since. DRBD is old but our public servers have been running 8.3 for quote some time now without problems. (Our centos 5 servers have been running heartbeat 2.1.4 and drbd 8.3.8 for years now.) I wanted to get your opinion on the various security products such cluster (HA / Pacemaker / Corosync / keepalived / OpenSVC ...) to point me towards the most efficient and adapted according to my needs. Where'd security products come from? Do you mean you nagios+mysql setup is doing some sort of security monitoring? The good thing about heartbeat is it's not being developed anymore. So what you've learned about it remains relevant. The future of open source clustering is on corosync + pacemaker. I would start by learning more about them. I would wait a year. They'll come up with something else and you'll have to unlearn the old busted coronary+zapper and learn about the new shiny+hotness instead. But for the most part: what is you're trying to actually do? Using drbd for database replication is suboptimal, especially over non-local links. You really want transactional replication and if mysql doesn't do it, switch to the one that does. As for nagios, why not set up two independent ones monitoring everything and each other? I suspect you can go a lot with a few lines of perl to make sure you don't get double the e-mail. This is verging on a philosophical debate, which I am not to interested in. For what it's worth, I use corosync + cman + rgmanager because it is so stable and relatively unchanging, with a planned supported life to 2020. My point was that Julien, starting a new project, should not start on day 1 with very old, deprecated software. By your argument, I could say what will a user do when X simply is not supported anymore and a critical issue is found?. I offer my advice for free, and people can take from it what they paid for it. - -- Digimer Papers and Projects: https://alteeve.ca/w/ What if the cure for cancer is trapped in the mind of a person without access to education? -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.12 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://www.enigmail.net/ iQIcBAEBAgAGBQJQpU9QAAoJEJL1R7RwoP6I++4P+wal3C76+IC5eDwNbRnSQIIU hGI+ozr0b/oz7HBCWvZijNDiABRd0lh2du16qlHfI/ZjFw8ArZGlZNcGe9ZWpGoH yT5HG78nHvR8yaQu/bpjp/QE0lrf4fqgKjrp1hvduT6IAWQd3FZL/QlWt7VMWofE S6A9BVq+bI6PRUddAsJPiZrmpaQ6xRb4LKmaUECQafrl3j4PqBoFNf6v0D7ywGBR Fa3N1JkU6xd146rZPwnAoQ80hsVMbvmNe6ekRQEoeHytd/LAJilVRU4GsYyz8soo 2Z5RdhVq+AbrrWH8OT5Ch1HhlcbJNmL5kXOTbMNHaUYDi7r/5qlcrv7VtGOAJyIN QpBqprOH4woLDeRJgiseQCevKnBncueaLqOa+vgLzEvzP8YImURx6DvvHXLfYR8N ViROgGihgHGSdDPO9HlnvYRaAkoNIqjD8WYK5mSXWJh+DkHDswYxp7oheLVAL1M3 HB7LrBtx4OzcX79V439zq1nCK1qBmW8wxj90//MTfSid/gj7cYDTj+cHgzE41iJu 1utWtzq1yrj2oEx1ZqI6YtP1cacl4J/h2XHFWTcjf1LtOqFRaAcf+dAY3ceX2X21 uFKmeqlU0+SdX4hRWrqkvdXD2nrsICaNcZ02sj+0ADT13R/Sx+k7qj3ZsNQ42Zbt sReTnT3DaUeSNhRtytYS =IYGa -END PGP SIGNATURE- ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also:
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On Fri, Nov 16, 2012 at 1:53 AM, Robinson, Eric eric.robin...@psmnv.com wrote: clusterlabs.org/doc is as good as i can do for docs. i try to keep it up-to-date and version specific (so that documenting corosync 2.x doesn't obliterate the cman/plugin stuff). packages are mostly in the hands of the distros though. building the entire stack (and keeping it up-to-date) on all the major distros is a massive job - i'd never get any actual work done. and typically not easy unless you work for the enterprise distro you're building for. In theory you should at most need scientific linux (even clusterlabs.org/rpm-next is somewhat optional) + your choice of shell/gui. It's a good site, and it is easy to see that a lot of work has gone into it. For me, though, it seems there are a few gaps where historical knowledge is assumed that a newbie does not necessarily have. (Case in point: I have 5 clusters in production and I have no idea what you mean by cman/plugin stuff.) Fair point. For the sake of clarification though: Unlike Heartbeat (or Corosync 2.x) Corosync 1.x does not provide the notion of quorum that Pacemaker needs. There are two possible sources cman[1] or the pacemaker plugin. Conceptually they do the same thing so when I'm being lazy I tend to lump them together as cman/plugin :-) [1] Which also happens to be a corosync plugin but thats beside the point. There has been a lot of change and development over the years, and there is a whole lot of Google noise, some of which can seem fairly authoritative, but which is nevertheless dated. Also, for some reason, people who write articles often don't date them, so when you're reading something you Googled it is often not clear how current and applicable it is. This leads newbies down long and fruitless paths. There's also a lot of repos out there and it is not at all clear which is the best to use. (Case in point: I originally used the clusterlabs repo, but then I was told to use the clusterlabs-next repo--which seems to have worked--but now I guess I'm being told that I should no rmally want to use the scientific-linux repo? I may have misunderstood.) I would be thankful for a web page that said something along these lines: 1. No matter what else you may have read elsewhere, this is THE AUTHORITATIVE source for the latest up-to-date information and downloads. 2. Here's the right repo. Already you're hosed. Unless you try and force everyone onto the same distro. If you say pick one and document it, then I'll point you to Clusters from Scratch :-) 3. Here's links to the sources. 4. Here's a glossary of terms, with identification of ones that are outdated/deprecated and what they were replaced with. 5, Here's the most common pitfalls that newbies experience. 6. Here's where you can get community and/or paid support. It's getting close to Christmas, so there's my wish list. :-) Well I happen to be working on a new landing page which partially addresses these points. I'll see if I can can get most of them covered. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On Thu, Nov 15, 2012 at 10:52 PM, Fabian Herschel fabian.hersc...@arcor.de wrote: On 11/14/2012 11:20 PM, Andrew Beekhof wrote: I sincerely hope SUSE does continue with crmsh but I _like_ that there are people trying something new. Yes I also like things which are going better. But what is the benefit on dropping CRM and introducing PCS to that procject? I'm growing tired of people moaning without listening. The _project_ has not replaced crm with pcs. What is the benefit for all distributions which in the past did not say its only tech prev? If you paid attention at all, you'd know that they will continue to offer crmsh. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] pcs or crmsh?
On Thu, Nov 15, 2012 at 10:55 PM, Fabian Herschel fabian.hersc...@arcor.de wrote: On 11/14/2012 05:10 PM, alain.mou...@bull.net wrote: Hi Just for information, I'm using cleanup and crm_mon very very very often with lots of ressources configured in Pacemaker and never had any problem like the problems you describe ... (on RHEL) Alain crm shell and tools like crm_mon are stable on SLES since years! So why not keep using it there? Why do you feel the need to keep complaining about something that isn't going to affect you? I really like this story to go on and no silly changes which have 0-benefit. Changes should have a real benefit otherwise they just hurt the story of a cluster project. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On Fri, Nov 16, 2012 at 6:42 AM, Lars Marowsky-Bree l...@suse.com wrote: On 2012-11-14T15:11:05, Digimer li...@alteeve.ca wrote: any reason at all, to try new things. Sometimes it is superior, often it is not. In either case, users are free to go where they feel is best. Ah, but can they? How likely is it that the large distributions will offer both? They don't have to. RHEL has EPEL and SLES has OBS, two mechanisms specifically designed to allow people to easily use software not officially supported by their distribution. Debian has ways too. Only those that don't supply their own documentation, that is for sure ;-) Regards, Lars -- Architect Storage/HA SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg) Experience is the name everyone gives to their mistakes. -- Oscar Wilde ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
Re: [Linux-HA] Antw: Re: pcs or crmsh?
On Fri, Nov 16, 2012 at 6:42 AM, Lars Marowsky-Bree l...@suse.com wrote: On 2012-11-15T09:20:44, Andrew Beekhof and...@beekhof.net wrote: LCMC and crmsh/hawk are at least conceptionally very very different; Conceptually LCMC and hawk are both web based GUIs, its the implementation that makes them so different. Not quite. LCMC is pretty heavily different from a deployment perspective - ssh from where the GUI runs into the server, at the time we looked. (I admit I'm not upto speed if that is still required.) And yes, technically Java is a web technology. But. ;-) hawk is a server-side running web interface on top of crm shell. pcs is a (remote) shell and a GUI, that sounds pretty different to crmsh to me. Yes, the remote capability of pcs is different. Though crmsh has some ability in that regard too. And probably will have to grow them. And a REST interface and a GUI that talks to it as well? The design Chris has come up with is very nice and you'd have to squint really really hard to make crmsh fit. I could also turn that around and say that pcs is really just ccs growing pacemaker support. With enough work, anything can grow anything. But that doesn't imply its a good idea. But really, does everything in life have to be a there can be only one! battle? Highlander wasn't a documentary. No, of course not. Choice isn't bad. But neither is choice inherently and always good. Especially from the point of view of administrators, particularly the consistent management story of a cluster stack is tantamount. In my not humble at all opinion, this is an area where evolutionary improvements are much better received than radical changes. I realize that crmsh doesn't do everything; like all software, it has bugs, design deficiencies, features that are missing. Yet, administrators quite like it. So it must have gotten some things right. Before starting with a completely new tool at the most important part of the stack (the interface to our users), I'd liked to have seen a discussion if crmsh can't fill the gaps you identified, if we can't grow the management story consistently. And yes, a possible result could have been No, we cannot, crmsh cannot grow to accommodate them. And perhaps we'd have had a meta-tool, a side-by-side tool, which takes care of the bits that crmsh can't. Or whatever. But there might have been the chance that we could have avoided this. This discussion never happened. (At least not in public, nor with the maintainers involved.) To be blunt, because it is no more your business than the creation of LCMC or Hawk was mine. As a company, SUSE decided pygui wasn't going to get them to where they wanted to go, went back to the drawing board and decided to create Hawk and chose technologies that complimented SUSE's in-house management framework. I and the community were informed after the decisions was made, there was no request for permission. Nor would that be expected. The only surprise is when some management/marketing type person doesn't try to keep stuff like this a secret so the finished product can be unveiled as a competitive advantage. I understand you think its a really bad idea and that you would have liked the opportunity to talk Chris out of it before it was too late. But I seriously doubt it would have changed anything. No-one here is thinking man, we screwed up but its too late to pull out now. You've not managed to convince me and I've also got a fair bit to loose wouldn't you say? And, strangely, pcs appeared at just about the time where Dejan decided that crmsh might be better off as its own project. Who is going to believe the this had technical reasons spiel? Honestly. We have known each other for how long? You want me to believe *what*? ;-) Whatever you think I was thinking about then, I didn't make the decision (or lobby for a specific outcome). But what you believe is up to you. I enjoy choice and options when they bring a true benefit; or at least, they outweigh the cost. But that isn't always the case - consider: when we argue about how something should be configured in the CIB XML, do we implement several ways, just because we can? No? Would you merge them? Why not? Where the cases where we did the good or bad ideas, from an admin's point of view? ... Ah. You don't see my complaining about corosync 2.x changes relative to 1.4. Or cman. Fencing improvements. Plugin, MCP, etc. Dropping openais. Or many other changes. I didn't even mind heartbeat versus corosync as much; happens largely on the backend. Sure, these all bring technical challenges on update, but such is life. How often do you see me go up the walls about something? I have spent a year of said life digging into the various costs and benefits of concurrent open source developments. I realize forking/splitting code is one of the freedoms we are granted. And that there are scenarios where this is really a good idea, yes. Is this one
[Linux-HA] Antw: Re: pcs or crmsh?
Dejan Muhamedagic deja...@fastmail.fm schrieb am 15.11.2012 um 17:05 in Nachricht 20121115160527.GB3763@squib: Hi, On Wed, Nov 14, 2012 at 04:00:43PM +, Robinson, Eric wrote: [...] crm_mon 'crashes' sometimes. If I have crm_mon running one one computer and I change the config from another computer, the one where crm_mon is running will sometimes flash a very quick message and then bail out to a shell prompt. It happens so fast that I have not been able to see the whole message, but it seems to say something about the configuration not [...] Ever tried script to capture the output? (script is available on UNIX since the 80ies). Regards, Ulrich ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems