Re: [Pacemaker] hawk session timeout?

2014-12-01 Thread Tim Serong
HA), or the github issue tracker (https://github.com/ClusterLabs/hawk) if not. Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailma

Re: [Pacemaker] [ha-wg] [RFC] Organizing HA Summit 2015

2014-12-01 Thread Tim Serong
sume is what's going to happen this time), or all virtual. Mixing the two is exceedingly difficult to do well, IMO. Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlab

Re: [Pacemaker] Hawk session ends after start or stop action

2014-03-05 Thread Tim Serong
k (service hawk restart) and possibly logging out and back in in your web browser should have been enough to resolve it. Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs

[Pacemaker] Announce: Hawk (HA Web Konsole) 0.6.2

2013-12-06 Thread Tim Serong
&package=cluster-glue -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.

[Pacemaker] Announce: opensuse-ha mailing list

2013-09-17 Thread Tim Serong
storage (drbd) - Basically, anything in network:ha-clustering:* on OBS is on topic :) If you'd like to subscribe, just send an email to: opensuse-ha+subscr...@opensuse.org Please also see the wiki page at: https://en.opensuse.org/openSUSE:High_Availability Happy clustering! Tim -- Tim S

Re: [Pacemaker] reorg of network:ha-clustering repo on build.opensuse.org

2013-08-09 Thread Tim Serong
On 07/26/2013 09:58 PM, Tim Serong wrote: > On 07/25/2013 03:59 PM, Tim Serong wrote: >> Hi All, >> >> This is just a quick heads-up. We're in the process of reorganising the >> network:ha-clustering repository on build.opensuse.org. If you don't >>

Re: [Pacemaker] reorg of network:ha-clustering repo on build.opensuse.org

2013-07-26 Thread Tim Serong
On 07/25/2013 03:59 PM, Tim Serong wrote: > Hi All, > > This is just a quick heads-up. We're in the process of reorganising the > network:ha-clustering repository on build.opensuse.org. If you don't > use any of the software from this repo feel free to stop reading now

[Pacemaker] reorg of network:ha-clustering repo on build.opensuse.org

2013-07-24 Thread Tim Serong
roject for openSUSE:Factory) This means that if you're currently using packages from network:ha-clustering, you'll need to point to network:ha-clustering:Stable instead (once we've finished shuffling everything around). I'll send another email out when this is done. Regards, Tim --

Re: [Pacemaker] Is crm_gui available under RHEL6?

2013-02-17 Thread Tim Serong
s (FC 18 ships rails 3.2). I do have a reasonable rails 3.2 port which I'll make available "soon", but I still have some work in progress, bugs to fix, things to clean up, etc. etc. before announcing a release. Regards, Tim -- Tim Serong Senior Clustering Engineer S

Re: [Pacemaker] killproc not found? o2cb shutdown via resource agent

2012-11-08 Thread Tim Serong
On 11/08/2012 07:56 PM, Andrew Beekhof wrote: > On Thu, Nov 8, 2012 at 5:16 PM, Tim Serong wrote: >> On 11/08/2012 12:11 PM, Andrew Beekhof wrote: >>> On Thu, Nov 8, 2012 at 9:59 AM, Matthew O'Connor wrote: >>>> Follow-up and additional info: >>>&g

Re: [Pacemaker] killproc not found? o2cb shutdown via resource agent

2012-11-07 Thread Tim Serong
s is not the most desirable solution. > > I think thats as good a solution as any. > I wonder where other distros are getting it from. SLES 11 SP2: # rpm -qf /sbin/killproc sysvinit-2.86-210.1 openSUSE 12.2: # rpm -qf /sbin/killproc sysvinit-tools-2.88+-77.3.1.x86_64 Can't speak for any

[Pacemaker] Fwd: Re: How can I make the secondary machine elect itself owner of the floating IP address?

2012-09-23 Thread Tim Serong
quot;bak" are kind of meaningless assuming identical nodes (and the nomenclature gets confusing when you start talking about masters and slaves on top of that). Anyway... Original Message Subject: Re: How can I make the secondary machine elect itself owner of the float

Re: [Pacemaker] [corosync] Ideas on merging #linux-ha and #linux-cluster on freenode

2012-05-28 Thread Tim Serong
n't just opened. :) >> > > I think the only thing you missed was proposing a meta-project to rule > them all :-) ...One Totem Ring to rule them all, one Totem Ring to find them... If only Sauron had implemented RRP during the Second Age, thing

Re: [Pacemaker] ocfs2_controld.pcmk process issue

2012-05-15 Thread Tim Serong
for configuration/setup - these are pretty much equally applicable for both SLES and openSUSE. Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/

Re: [Pacemaker] [Openais] Help on mysql-proxy resource

2012-03-30 Thread Tim Serong
roxy-lua-scripts > /usr/share/doc/packages/mysql-proxy/examples/tutorial-basic.lua" \ This is --proxy-lua-scripts (plural). I'm guessing maybe that's the problem. HTH, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___

Re: [Pacemaker] pacemaker - corosync with not automatic failover

2012-02-06 Thread Tim Serong
ith preference on node_0 (score 100), or node_1 (score 50), or some other node if neither node_0 nor node_1 are available (and assuming you have more than two nodes). HTH, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com __

Re: [Pacemaker] OCFS2 problems when connectivity lost

2011-12-21 Thread Tim Serong
e quorate node, because of loss of DLM comms. If STONITH is configured, the non-quorate node should be killed after a failed (or timed out) stop, and the quorate node should resume behaving normally. HTH, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com

Re: [Pacemaker] Doc: Resource templates

2011-12-13 Thread Tim Serong
that Florian mentions "prototype", hmm...) Anyway, IMO, overloading the word "template" isn't /too/ bad. It could be qualified if necessary as "resource template" (the new feature we're talking about here) and "configuration template" (existing she

Re: [Pacemaker] ACL setup

2011-12-11 Thread Tim Serong
ought of that. Adding the user to the haclient group removes any restrictions as I was able to write to the config without error. Did you set "crm configure property enable-acl=true"? Without this, all users in the haclient group have full access. Regards, Tim -- Tim Serong Se

Re: [Pacemaker] Service failed to load 'pacemaker'

2011-12-07 Thread Tim Serong
tanza, so there's no need for a separate /etc/corosync/service.d/pcmk file (although you can use that if you want, just don't have both!) HTH, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pace

Re: [Pacemaker] How to stop a failed resource?

2011-11-07 Thread Tim Serong
h document(s) have I missed please? http://clusterlabs.org/doc/crm_cli.html Also, just run "crm", it has tab completion, online help, etc. Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pac

Re: [Pacemaker] [Linux-HA] pcmk + corosync + cman for dlm support?

2011-11-02 Thread Tim Serong
encing for any prototype system that's going to need fencing when put into production :) Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterl

Re: [Pacemaker] [Ocfs2-users] Error building ocfs2-tools

2011-11-02 Thread Tim Serong
bs.org/pacemaker/1.1/file/9971ebba4494/lib/common/ais.c#l327 but note ais.c moved to corosync.c in newer source tree on github -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss

Re: [Pacemaker] Language bindings, again (was Re: Newcomer's question - API?)

2011-11-02 Thread Tim Serong
On 11/02/2011 06:35 PM, Florian Haas wrote: On 2011-11-02 04:33, Tim Serong wrote: I vaguely recall reading the FSF considered headers generally exempt from GPL provisions, provided they're "boring", i.e. just structs, function definitions etc. If they're a whole lotta inl

Re: [Pacemaker] Newcomer's question - API?

2011-11-01 Thread Tim Serong
npack_rsc_op() functions from Pacemaker's lib/pengine/unpack.c in $other_language_of_your_choice. Regards, Tim [1] http://clusterlabs.org/wiki/Hawk [2] http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained/ch-status.html -- Tim Serong Senior Clusterin

Re: [Pacemaker] Trouble with KVM Resource

2011-10-31 Thread Tim Serong
be possible for that state file to be empty. Unless, somehow (wild guess), permissions on the state file or some parent directory prohibit writing? Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mai

Re: [Pacemaker] cloning primatives with differing params

2011-10-25 Thread Tim Serong
ay to specify that a resource can run on any node without having to add a location constraint for each node as they are added? You could try one constraint per resource, covering all nodes, something like: location some-res-on-all-nodes some-resource \ rule 0: #uname eq

Re: [Pacemaker] 4 servers; different resources on different servers?

2011-10-03 Thread Tim Serong
ds what location constraints you configure. HTH, Tim [1] Depending on your definition, it might also mean "the exact same resource is running on at least two nodes, e.g.: a clustered filesystem. -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com __

Re: [Pacemaker] Call cib_modify failed (-22): The object/attribute does not exist

2011-09-26 Thread Tim Serong
ause the element is bogus (apparently the cibadmin man page needs tweaking). Try: Better yet, use the crm shell instead of cibadmin, and you can forget about the XML :) Regards, Tim -- Tim Serong Senior Clustering Engineer S

Re: [Pacemaker] crm_mon -n -1 : Command output format

2011-09-08 Thread Tim Serong
at does with orphans. Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org G

Re: [Pacemaker] crm resource status and HAWK display differ after manually mounting filesystem resource

2011-08-30 Thread Tim Serong
On 29/08/11 13:24, Tim Serong wrote: On 28/08/11 21:43, Sebastian Kaps wrote: Hi, on our two-node cluster (SLES11-SP1+HAE; corosync 1.3.1, pacemaker 1.1.5) we have defined the following FS resource and its corresponding clone: primitive p_fs_wwwdata ocf:heartbeat:Filesystem \ params device

Re: [Pacemaker] crm resource status and HAWK display differ after manually mounting filesystem resource

2011-08-28 Thread Tim Serong
upported-by-SUSE-but-best-effort-support-by-me) build, you can try hawk-0.4.1 from: http://software.opensuse.org/search?q=Hawk&baseproject=SUSE%3ASLE-11%3ASP1&lang=en Alternately, if you can reproduce the issue then send me the output of "cibadmin -Q" (offlist is fine)

Re: [Pacemaker] Announce: Hawk 4.1 (Pacemaker GUI) packages for Debian Squeeze

2011-08-22 Thread Tim Serong
<http://www.dizopsin.net/debian-and-ubuntu-packages-for-clusterlabs-ha> Best regards, Joerg Many thanks for your work! Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clus

Re: [Pacemaker] DLM and Control instances for OCFS2

2011-08-21 Thread Tim Serong
separate instance of the above for each OCFS2 volume being managed by Corosync/Pacemaker cluster? Nope, just the one. Regards, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http

Re: [Pacemaker] Extracting resource state information from the XML

2011-08-11 Thread Tim Serong
C code that does the same thing). If you only care about state, you probably only care about the *last* op. I should also take the opportunity to plug Hawk, if you need a web based thing for managing Pacemaker clusters: http://www.clusterlabs.org/wiki/Hawk HTH, T

Re: [Pacemaker] Dependency Loop Errors in Log

2011-08-09 Thread Tim Serong
ce Constraints chapter of Pacemaker explained (http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained/) or the mailing list archives (this has come up a few times in recent memory). HTH, Tim -- Tim Serong Senior Clustering Engineer SUSE tser...@suse.com

Re: [Pacemaker] wiping out cluster config

2011-07-07 Thread Tim Serong
list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker -- Tim

Re: [Pacemaker] Pacemaker+Corosync from OBS

2011-06-22 Thread Tim Serong
On 22/06/11 22:14, Ciro Iriarte wrote: 2011/6/21 Tim Serong: On 22/06/11 08:57, Ciro Iriarte wrote: Hi, I'm trying pacemaker from OBS and I don't see any init script for corosync or pacemaker, am I overlooking something obvious? Name: pacemakerRelocat

Re: [Pacemaker] Pacemaker+Corosync from OBS

2011-06-21 Thread Tim Serong
: 1.1 Build Date: Thu Apr 14 04:08:04 2011 Regards, Install openais as well - it includes /etc/init.d/openais which starts corosync. Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. ___ Pacemaker ma

Re: [Pacemaker] Permission denied using HAWK

2011-06-19 Thread Tim Serong
o the latest version (hawk-0.4.1-2.1.$ARCH.rpm). Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Re: [Pacemaker] [Openais] question for the technical support

2011-06-09 Thread Tim Serong
o connect to Samba on the remaining host if you use the host's physical IP address, rather than a virtual IP? Are there any errors in /var/log/samba/log.smbd and/or /var/log/ctdb/log.ctdb? Regards, Tim -- Tim Serong Senior Clustering Enginee

Re: [Pacemaker] Announce: Hawk (HA Web Konsole) 0.4.1

2011-05-20 Thread Tim Serong
is fine with both "adaugherity" and "ADaugherity" but hawk/crm_gui require the mixed-case version. They go via the PAM backends too, so this is surprising ... Thanks for pointing this out. Noted. I'm not sure what's going on there yet

Re: [Pacemaker] Announce: Hawk (HA Web Konsole) 0.4.1

2011-05-20 Thread Tim Serong
On 19/05/11 00:43, Tim Serong wrote: Hi Everybody, This is to announce version 0.4.1 of Hawk, a web-based GUI for managing and monitoring Pacemaker High-Availability clusters. [...] Building an RPM for Fedora/Red Hat is still just as easy as last time: # hg clone http://hg.clusterlabs.org

[Pacemaker] Announce: Hawk (HA Web Konsole) 0.4.1

2011-05-18 Thread Tim Serong
rmation is available at: http://www.clusterlabs.org/wiki/Hawk Please direct comments, feedback, questions, etc. to myself and/or (preferably) the Pacemaker mailing list. Happy clustering, Tim [1] http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained/ch-constraints.html

Re: [Pacemaker] Failover when storage fails

2011-05-14 Thread Tim Serong
port for that entire time period. Regards, Tim -Original Message- From: Tim Serong [mailto:tser...@novell.com] Sent: 13 May 2011 04:22 To: The Pacemaker cluster resource manager (pacemaker@oss.clusterlabs.org) Subject: Re: [Pacemaker] Failover when storage fails On 5/12/2011 at 02:2

Re: [Pacemaker] Failover when storage fails

2011-05-12 Thread Tim Serong
[2561]: info: perform_op:2884: operation > stop[202] on ocf::Filesystem::MyApp_fs_graph for client 31850, its > parameters: fstype=[ext4] crm_feature_set=[3.0.2] > device=[/dev/VolGroupB00/abb_graph] CRM_meta_timeout=[2] > directory=[/naab1] for rsc is already running. > May 11 12:34:59 host002 lrmd: [2561]: info: perform_op:2894: postponing all > ops on resour

Re: [Pacemaker] addendum: "problems with node membership"

2011-05-11 Thread Tim Serong
need the "freshly installed" > condition of pacemaker without reinstalling the complete package, because a > "fresh" node joins without problems...how can this be done? I'd suggest double-checking the corosync config and network settings (IP addresses and preferably disable an

Re: [Pacemaker] [PATCH]Bug 2567 - crm resource migrate should support an optional "role" parameter

2011-05-10 Thread Tim Serong
usterlabs.org:8010/builders/opensuse-11.3-i386-devel/builds/ > 48/steps/cli_test/logs/stdio > > and > > > http://build.clusterlabs.org:8010/builders/fedora-13-x86_64-devel/builds/48 > /steps/cli_test/logs/stdio > > > > What distro are you on? > &g

Re: [Pacemaker] [pacemaker][patch 3/4] Simple changes for "Pacemaker Explained", Chapter 6 CH_Constraints.xml

2011-05-04 Thread Tim Serong
. > > > >> > >> > >>> >> > >>> >> > >> > >> While we're messing with sets anyway, I'd like to re-hash the idea I > >> brought up on pcmk-devel. To make configuration more com

Re: [Pacemaker] Multi-site support in pacemaker (tokens, deadman, CTR)

2011-04-28 Thread Tim Serong
e or form, preferably configurable > through an RA parameter. What was discussed in Boston is that in an > initial step, Subscriber could simply take an XSLT script, apply it to > the CIB stream with xsltproc, and then update its local CIB with the re

Re: [Pacemaker] Announce: Hawk (HA Web Konsole) 0.4.0

2011-04-23 Thread Tim Serong
On 4/22/2011 at 10:14 PM, Nikita Michalko wrote: > Am Dienstag 19 April 2011 12:59:35 schrieb Tim Serong: > > Greetings All, > > > > This is to announce version 0.4.0 of Hawk, a web-based GUI for > > managing and monitoring Pacemaker High-Availability clusters.

[Pacemaker] Announce: Hawk (HA Web Konsole) 0.4.0

2011-04-19 Thread Tim Serong
ther information is available at: http://www.clusterlabs.org/wiki/Hawk Please direct comments, feedback, questions, etc. to myself and/or (preferably) the Pacemaker mailing list. Happy clustering, Tim -- Tim Serong Senior Clustering Engineer, OPS Engi

Re: [Pacemaker] [pacemaker][patch 3/4] Simple changes for "Pacemaker Explained", Chapter 6 CH_Constraints.xml

2011-04-13 Thread Tim Serong
>>> On 4/13/2011 at 04:37 PM, Andrew Beekhof wrote: > On Wed, Apr 13, 2011 at 8:28 AM, Tim Serong wrote: > > On 4/12/2011 at 05:48 PM, Andrew Beekhof wrote: > >> Here's an example of the before and after. Thoughts? > > > > Looks pretty good t

Re: [Pacemaker] [pacemaker][patch 3/4] Simple changes for "Pacemaker Explained", Chapter 6 CH_Constraints.xml

2011-04-12 Thread Tim Serong
> > > > > > > > After: > > > > > > > > > > > > > > > > > > > > > > > > > > On Mon, Apr 11, 2011 at 6:02 PM, Andrew Beekhof wrote: > > On

Re: [Pacemaker] operative tasks for a pacemaker cluster

2011-04-12 Thread Tim Serong
x27;em. They're invaluable for debugging failures, BTW. Were those 7000 pe-inputs all created over that 7 day period? Because that's a transition every 1.44 minutes. Is it just me, or does that sound like a rather busy cluster? Regards, Tim -- Tim Serong Senior Cluster

Re: [Pacemaker] [pacemaker][patch 3/4] Simple changes for "Pacemaker Explained", Chapter 6 CH_Constraints.xml

2011-04-11 Thread Tim Serong
>>> On 4/11/2011 at 10:23 PM, Andrew Beekhof wrote: > On Mon, Apr 11, 2011 at 2:18 PM, Tim Serong wrote: > > On 4/11/2011 at 09:37 PM, Andrew Beekhof wrote: > >> On Mon, Apr 11, 2011 at 12:57 PM, Tim Serong wrote: > >> > On 3/21/

Re: [Pacemaker] [pacemaker][patch 3/4] Simple changes for "Pacemaker Explained", Chapter 6 CH_Constraints.xml

2011-04-11 Thread Tim Serong
On 4/11/2011 at 09:37 PM, Andrew Beekhof wrote: > On Mon, Apr 11, 2011 at 12:57 PM, Tim Serong wrote: > > On 3/21/2011 at 08:20 PM, Andrew Beekhof wrote: > >> > >> Small improvement to: > >> + The only thing that matters is that in order for any mem

Re: [Pacemaker] [pacemaker][patch 3/4] Simple changes for "Pacemaker Explained", Chapter 6 CH_Constraints.xml

2011-04-11 Thread Tim Serong
ight now should thus be changed as follows in order to match the diagram: - - + + - - + + Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. _

Re: [Pacemaker] Should monitor operations be stopped after a resource is unmanaged?

2011-04-03 Thread Tim Serong
On 4/4/2011 at 04:29 AM, Ron Kerry wrote: > On 7/22/64 2:59 PM, Tim Serong wrote: > > On 4/2/2011 at 09:42 PM, Ron Kerry wrote: > > > On 7/22/64 2:59 PM, Serge Dubrouski wrote: > > > > On Fri, Apr 1, 2011 at 2:09 PM, Ron Kerry wrote: > > > >

Re: [Pacemaker] Should monitor operations be stopped after a resource is unmanaged?

2011-04-02 Thread Tim Serong
tor > failure when the resource > is stopped. Pacemaker then takes the 'onfail' action defined for the monitor > operation. In other > words, the resource is still being managed to some degree. If the monitor > operation was still > running but no action

Re: [Pacemaker] lrmd: WARN: G_SIG_dispatch: Dispatch function for S 1000 ms (> 100 ms) before being called

2011-03-30 Thread Tim Serong
report if I wanted more info. So now I > have an hb_report ready to go. Excuse the naive question, but where/how > do I submit it? http://developerbugs.linux-foundation.org/enter_bug.cgi HTH, Tim -- Tim Serong Senior Clustering Engineer, OPS

Re: [Pacemaker] state of resource when start returns success

2011-03-24 Thread Tim Serong
-slow-resource (ocf::heartbeat:Delay) Started : my-slow-resource_start_0 (node=node-0, call=86, rc=0): complete Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. ___ Pacemaker mailing list: Pacemaker@oss.

Re: [Pacemaker] emulate crm_mon output by xsltproc'essing cibadmin -Ql

2011-03-09 Thread Tim Serong
he most recent op and rc on each node (highest call ID) tells you the state of the resource on that node. Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. ___ Pacemaker mailing list: Pacemaker@oss.clusterlab

Re: [Pacemaker] resource agent starting out-of-order

2011-03-03 Thread Tim Serong
-role="Master" > clone clone-libvirtd p-libvirtd \ > meta interleave="true" > clone clone-lvm_gh p-lvm_gh \ > meta interleave="true" > location cli-standby-p-vd_vg.test1 p-vd_vg.test1 \ > rule $id="cli-standby-rule-p-vd

Re: [Pacemaker] version confusion

2011-03-02 Thread Tim Serong
y updated the version number to 1.0.10. So, yes, you do have version 1.0.10. Try to think of it as an unfortunate typo :) Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. ___ Pacemaker mailing list: Pacemaker@os

Re: [Pacemaker] [Linux-HA] Solved: SLES 11 HAE SP1 Signon to CIB Failed

2011-02-14 Thread Tim Serong
shipped won't start > > pacemaker, I'm not sure if that's on purpose or not, but I found it a > > bit confusing after being used to it 'just working' previously. > > Ah. Understandably confusing. That got fixed post-SP1, in a > maintenance upda

Re: [Pacemaker] Solved: [Linux-HA] SLES 11 HAE SP1 Signon to CIB Failed

2011-02-03 Thread Tim Serong
#x27;t start > pacemaker, I'm not sure if that's on purpose or not, but I found it a > bit confusing after being used to it 'just working' previously. Ah. Understandably confusing. That got fixed post-SP1, in a maintenance update that went out in September or there

Re: [Pacemaker] STONITH external/ssh missing on RHEL 5.5 EPEL 5.4 + ClusterLabs Repo RPM Build?

2010-12-20 Thread Tim Serong
#x27;s intentional, see: http://hg.linux-ha.org/glue/rev/5ef3f9370458 You really don't want to rely on SSH STONITH in a production environment. Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. __

Re: [Pacemaker] Resources not migrating on node failure?

2010-12-01 Thread Tim Serong
hat done automatically? No, you need to define them. > If I need to do it specifically, how do I do that now that I have it all up > and running without defining monitor actions? Run "crm configure edit" and add whichever monitor ops you need. Have a look at Clusters from

Re: [Pacemaker] Extending CTS with other tests

2010-11-30 Thread Tim Serong
" is running. Right? > > Probably To my intense amazement, you can do this: mkfs.ocfs2 --cluster-stack=pcmk --cluster-name=pacemaker /dev/foo This works when the cluster is not running. These parameters are not mentioned anywhere at all in the mkfs.ocfs2 manpage. *sigh* Tim --

Re: [Pacemaker] colocation that doesn't

2010-11-29 Thread Tim Serong
On 11/30/2010 at 10:11 AM, Alan Jones wrote: > On Thu, Nov 25, 2010 at 6:32 AM, Tim Serong wrote: > > Can you elaborate on why you want this particular behaviour? Maybe > > there's some other way to approach the problem? > > I have explained the issue as c

Re: [Pacemaker] colocation that doesn't

2010-11-25 Thread Tim Serong
ou elaborate on why you want this particular behaviour? Maybe there's some other way to approach the problem? (Or maybe someone else can think of a way to express this...) Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. _

Re: [Pacemaker] colocation that doesn't

2010-11-23 Thread Tim Serong
nodeB. Then it tries to place resX, wants to place it where resY is not (nodeA), but can't, due to the -inf score for resX on nodeA. So in this case, resX lands on nodeB as well. If it decides where to put resX first, it puts resX on nodeB because of the -inf score for nodeA. Then

Re: [Pacemaker] "probe" operations always use cluster default operation timeout

2010-11-17 Thread Tim Serong
answering these sorts of questions? There's the Linux Foundation bugzilla: http://developerbugs.linux-foundation.org/ There's also a few mercurial repos. Commit messages tend to be fairly informative: http://hg.clusterlabs.org/pacemaker/1.1/ http://hg.linux-ha.org/ag

Re: [Pacemaker] Help understanding why a failover occurred.

2010-10-17 Thread Tim Serong
ot; or "Promote/Demote/Stop FOO (...)", it means something has changed. Scroll up a bit, to above where pengine is saying "unpack_config", "determine_node_status" etc. and you should see a message suggesting the cause for the change (failed op, timeout, ping attribu

Re: [Pacemaker] Cluster failure with mod_security using rotatelogs

2010-10-10 Thread Tim Serong
x that problem? I've not seen that before, but, just to rule out one possibility... What happens if you just run: /usr/sbin/httpd -DSTATUS -f /etc/httpd/conf/httpd.conf Does that ever return? If no, I'd suggest apache is broken. If yes, I'd start pointing my finger towards o

Re: [Pacemaker] /etc/hosts

2010-09-28 Thread Tim Serong
ell of a lot better than scp and having to remember where to copy what to, and when :) There's a little section on csync2 in the SLE HAE Guide under "Transferring the Configuration to All Nodes" at: http://www.novell.com/documentation/sle_ha/book_sleha/?page=/documentation/sle_ha/b

Re: [Pacemaker] crm_gui login failure

2010-09-28 Thread Tim Serong
any reason. > > So that's strange it was zeroed out. You might need to check the > modification time to recall what was happening. Wild guess - was your system STONITH'd or otherwise forcibly reset, immediately after installing pacemaker

Re: [Pacemaker] Resource stop during migration

2010-08-27 Thread Tim Serong
On 8/27/2010 at 03:22 PM, Michael Smith wrote: > Hi, > > I have a pacemaker setup using the Xen resource agent and I've found > something weird during migration: if a VM is in the middle of > live-migrating from node 1 to node 2, and I stop the resource in crm, > pacemaker forgets about

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-27 Thread Tim Serong
On 8/27/2010 at 03:37 PM, Michael Smith wrote: > On Thu, 26 Aug 2010, Tim Serong wrote: > > > > for now I have stonith-enabled="false" in > > > my CIB. Is there a way to make clvmd/dlm respect it? > > > > No. At least, I don't think s

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-26 Thread Tim Serong
On 8/27/2010 at 01:49 PM, Michael Smith wrote: > On Thu, 26 Aug 2010, Tim Serong wrote: > > > > Aug 26 18:31:51 xen-test1 cluster-dlm[8870]: fence_node_time: Node > > > 236655788/xen-test2 has not been shot yet > > > Do you have STONITH configured? Not

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-26 Thread Tim Serong
1:51 xen-test1 crmd: [8489]: info: ais_dispatch: Membership > 1260: quorum still lost > Aug 26 18:31:51 xen-test1 cluster-dlm: [8870]: info: ais_dispatch: > Membership 1260: quorum still lost Do you have STONITH configured? Note that it says "xen-test2 has not been shot yet"

[Pacemaker] Updated openSUSE packages in network:ha-clustering repo

2010-08-23 Thread Tim Serong
ource-agents 1.0.3 Happy clustering, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home:

Re: [Pacemaker] CFP: Linux Plumbers Mini-Conf on High-Availability/Clustering

2010-08-14 Thread Tim Serong
y the only chance we get to collaborate in one place > this whole year. I actually can't see the original CFP email in the linux-cluster archives. On the bold assumption that *this* email somehow magically makes it to that list, here's the URL to submit proposals: http://www.li

Re: [Pacemaker] Opensuse 11.3

2010-07-26 Thread Tim Serong
d resource-agents (1.0.3). Heartbeat is a bit out of date (2.99.3). There's one problem I'm aware of (can't start openais/corosync on x86_64) but this can be worked around by creating a few symlinks, see the bug report for details: https://bugzilla.novell.co

[Pacemaker] Revenge of the cluster-glue clplumbing ABI change (a public service announcement)

2010-07-21 Thread Tim Serong
has been a public service announcement. Thank you for reading. Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pace

Re: [Pacemaker] RFC: cluster-wide attributes

2010-07-05 Thread Tim Serong
On 7/5/2010 at 04:54 PM, Andrew Beekhof wrote: > On Mon, Jul 5, 2010 at 6:21 AM, Tim Serong wrote: > > On 6/30/2010 at 09:42 PM, Andrew Beekhof wrote: > >> On Thu, Jun 24, 2010 at 5:41 PM, Lars Marowsky-Bree > >> wrote: > >> > Hi, > >>

Re: [Pacemaker] RFC: cluster-wide attributes

2010-07-04 Thread Tim Serong
ook twice right? Just for the record, a use case of this came up on IRC last week: you could specify cluster-wide standby="on", so new nodes joining the cluster would automatically join in standby mode, with the admin activating them later (per-node standby="off" thus overridi

Re: [Pacemaker] Pacemaker cant start CTDB

2010-07-02 Thread Tim Serong
/ctdb and replace it with its own like in SLES11.. at least > Ubuntu isn't. Curious. It's *meant* to replace that file. Anything interesting that you can specify in that file should be specified using RA instance parameters. For some notes on this, see: http://linux-ha.org/w

Re: [Pacemaker] Issues with constraints - working for start/stop, being ignored on "failures"

2010-06-06 Thread Tim Serong
On 6/2/2010 at 11:10 AM, Cnut Jansen wrote: > Am 31.05.2010 05:47, schrieb Tim Serong: > > On 5/31/2010 at 12:57 PM, Cnut Jansen wrote: > > > >> Current constraints: > >> colocation TEST_colocO2cb inf: cloneO2cb cloneDlm > >> colocation c

Re: [Pacemaker] Set order on two clone set, but apply on each node

2010-06-06 Thread Tim Serong
cl-mysqld cl-apache > > If i want to apply this rule to each node, what setting should i configure? Try cloning a group, something like: group mysqld-with-apache mysqld apache clone cl-mysqld-with-apache mysqld-with-apache Regards, Tim -- Tim Serong

Re: [Pacemaker] Openais OCF Script Question

2010-05-31 Thread Tim Serong
> > No LSB Primitive named ppsd-6. It was LSB but I had changed it to ocf > recently and somehow still tried to execute the former LSB script. That sounds like bad behaviour. Can you please open a bug and include an hb_report for a time period which shows the errant run of th

Re: [Pacemaker] Issues with constraints - working for start/stop, being ignored on "failures"

2010-05-30 Thread Tim Serong
ave out the ":start" specifiers as this is implicit. > Constraints added to "work around" at least the DRBD-resources left in > state "started (unmanaged) failed": > order GNAH_orderDrbdMysql_stop 0: cloneMountMysql:stop

Re: [Pacemaker] Info on customization of RA

2010-05-30 Thread Tim Serong
on to modify the apache > RA ;-) > > Could this be a runnable approach? > Also to put for example totally new personal RAs in new dirs? Nothing wrong with that. That's actually exactly what you *should* do if you're writing your own RAs that aren't going to

Re: [Pacemaker] Openais OCF Script Question

2010-05-30 Thread Tim Serong
> params externalip="192.168.0.50" \ > op monitor interval="10s" timeout="90s" \ > op start interval="0" timeout="1800s" \ > op stop interval="0" timeout="180s" \ > me

Re: [Pacemaker] Making a resource slightly sticky?

2010-05-13 Thread Tim Serong
e 2 until node 2 fails, at which point they'd > migrate to node 1. Yes, you want the "resource-stickiness" property. Using "crm configure", per resource: # primitive foo \ meta resource-stickiness="1" Or, to make everything a bit sticky: #

Re: [Pacemaker] How SuSEfirewall2 affects on openais startup?

2010-05-13 Thread Tim Serong
] > > > > Don't clone the SBD stonith resource, you only need a single primitive > > here (not that this should be causing your startup trouble). > > sbd fence must be on each node. The sbd daemon needs to be ru

Re: [Pacemaker] How SuSEfirewall2 affects on openais startup?

2010-05-13 Thread Tim Serong
ed everything to come online if you just wait a few minutes. You can watch status changes (if any) as they occur, with "crm_mon -r". It's worth checking /var/log/messages etc. on each node too, to see if anything is obviously screaming in pain. > Full list of resources: &

Re: [Pacemaker] How SuSEfirewall2 affects on openais startup?

2010-05-12 Thread Tim Serong
urces do not run. What does the output of "crm_mon -r1" show in this case? Regards, Tim -- Tim Serong Senior Clustering Engineer, OPS Engineering, Novell Inc. ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clust

Re: [Pacemaker] Announce: HA Web Konsole (Hawk 0.3.3)

2010-04-13 Thread Tim Serong
On 4/14/2010 at 01:59 AM, Dejan Muhamedagic wrote: > On Tue, Apr 13, 2010 at 05:45:02AM -0600, Tim Serong wrote: > > On 4/13/2010 at 08:13 PM, Dejan Muhamedagic wrote: > > > Hi, > > > > > > On Mon, Apr 12, 2010 at 10:56:30PM +0200, Roberto Giordani wr

  1   2   >