On Mon, 14 Apr 2014 14:40:43 +1000
Andrew Beekhof and...@beekhof.net wrote:
On 11 Apr 2014, at 10:54 pm, Marco Felettigh ma...@nucleus.it wrote:
On Fri, 11 Apr 2014 17:17:57 +1000
Andrew Beekhof and...@beekhof.net wrote:
On 8 Apr 2014, at 8:37 pm, ma...@nucleus.it wrote:
On
On 11 Apr 2014, at 10:54 pm, Marco Felettigh ma...@nucleus.it wrote:
On Fri, 11 Apr 2014 17:17:57 +1000
Andrew Beekhof and...@beekhof.net wrote:
On 8 Apr 2014, at 8:37 pm, ma...@nucleus.it wrote:
On Tue, 8 Apr 2014 10:49:16 +1000
Andrew Beekhof and...@beekhof.net wrote:
On 7 Apr
On Fri, 11 Apr 2014 17:17:57 +1000
Andrew Beekhof and...@beekhof.net wrote:
On 8 Apr 2014, at 8:37 pm, ma...@nucleus.it wrote:
On Tue, 8 Apr 2014 10:49:16 +1000
Andrew Beekhof and...@beekhof.net wrote:
On 7 Apr 2014, at 8:46 pm, ma...@nucleus.it wrote:
Hi,
in a production
Hi,
in a production environment with 2 nodes ( nodeA , nodeB ) we had an
hardware failure so we restart the nodeB.
After the restarted nodeB came up we restart corosync/pacemaker on it
but for 2 days till now che corosync/pacemaker stuff is looping.
crm_mon NodeA:
Stack: openais
Current DC:
On 7 Apr 2014, at 8:46 pm, ma...@nucleus.it wrote:
Hi,
in a production environment with 2 nodes ( nodeA , nodeB ) we had an
hardware failure so we restart the nodeB.
After the restarted nodeB came up we restart corosync/pacemaker on it
but for 2 days till now che corosync/pacemaker stuff is
Hi all,
I use pacemaker 1.1.9 with corosync 2.3 both built from source.
My OS is CentOS 6.4 x86_64
I have about 30 resources of one type managed by my own resource agent.
It is nesessary for the resource agent to know utilization parameter of
the configured resource. I query for this
On 24/09/2013, at 2:09 AM, Халезов Иван i.khale...@rts.ru wrote:
Hi all,
I use pacemaker 1.1.9 with corosync 2.3 both built from source.
My OS is CentOS 6.4 x86_64
I have about 30 resources of one type managed by my own resource agent. It is
nesessary for the resource agent to know
Hi,
On Wed, 23 Jan 2013 18:52:20 +0100
Dejan Muhamedagic deja...@fastmail.fm wrote:
nodes
node id=35956928 uname=sipc2n2
Note sure if id can start with a digit.
Corosync node id's are always digits-only.
This should really work with versions = v1.2.4
Yeah… I have looked into
On Thu, 24 Jan 2013 09:04:14 +0100
Jacek Konieczny jaj...@jajcus.net wrote:
I should probably upgrade my CIB somehow
Indeed. 'cibadmin --upgrade --force' solved my problem.
Thanks for all the hints.
Greets,
Jacek
___
Pacemaker mailing list:
On Thu, Jan 24, 2013 at 09:04:14AM +0100, Jacek Konieczny wrote:
Hi,
On Wed, 23 Jan 2013 18:52:20 +0100
Dejan Muhamedagic deja...@fastmail.fm wrote:
nodes
node id=35956928 uname=sipc2n2
Note sure if id can start with a digit.
Corosync node id's are always digits-only.
On Thu, Jan 24, 2013 at 09:10:33AM +0100, Jacek Konieczny wrote:
On Thu, 24 Jan 2013 09:04:14 +0100
Jacek Konieczny jaj...@jajcus.net wrote:
I should probably upgrade my CIB somehow
Indeed. 'cibadmin --upgrade --force' solved my problem.
Thanks for all the hints.
crm(live)configure# help
Hi,
I have recently upgraded Pacemaker on one of my clusters from
1.0.something to 1.1.8 and installed crmsh to manage it as I used to.
crmsh mostly works for me, until I try to change the configuration with
'crm configure'. Any, even trivial change shows verification errors and
fails to commit:
On 2013-01-23T16:31:20, Jacek Konieczny jaj...@jajcus.net wrote:
I have recently upgraded Pacemaker on one of my clusters from
1.0.something to 1.1.8 and installed crmsh to manage it as I used to.
It'd be helpful if you mentioned which crmsh version you installed. The
errors you get suggest
On Wed, 23 Jan 2013 16:44:45 +0100
Lars Marowsky-Bree l...@suse.com wrote:
On 2013-01-23T16:31:20, Jacek Konieczny jaj...@jajcus.net wrote:
I have recently upgraded Pacemaker on one of my clusters from
1.0.something to 1.1.8 and installed crmsh to manage it as I used
to.
It'd be
Hi,
On Wed, Jan 23, 2013 at 04:31:20PM +0100, Jacek Konieczny wrote:
Hi,
I have recently upgraded Pacemaker on one of my clusters from
1.0.something to 1.1.8 and installed crmsh to manage it as I used to.
crmsh mostly works for me, until I try to change the configuration with
'crm
Normally we log an error at startup if we can't write there... did
this not happen?
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Ies, it happened. I saw a warning while writing
Hi there,
a strange thing happened to my two node cluster: I rebooted both machine
at the same time, when s.o. went up again, no resources were configured
anymore: as it was a fresh installation. Why ?
It was explained to me that the configuration of resources managed by
pacemaker should be in
On Thu, Mar 29, 2012 at 9:54 AM, Fiorenza Meini fme...@esseweb.eu wrote:
Hi there,
a strange thing happened to my two node cluster: I rebooted both machine at
the same time, when s.o. went up again, no resources were configured
anymore: as it was a fresh installation. Why ?
It was explained
Il 29/03/2012 10:12, Rasto Levrinc ha scritto:
On Thu, Mar 29, 2012 at 9:54 AM, Fiorenza Meinifme...@esseweb.eu wrote:
Hi there,
a strange thing happened to my two node cluster: I rebooted both machine at
the same time, when s.o. went up again, no resources were configured
anymore: as it was a
On Thu, Mar 29, 2012 at 8:45 PM, Fiorenza Meini fme...@esseweb.eu wrote:
Il 29/03/2012 10:12, Rasto Levrinc ha scritto:
On Thu, Mar 29, 2012 at 9:54 AM, Fiorenza Meinifme...@esseweb.eu wrote:
Hi there,
a strange thing happened to my two node cluster: I rebooted both machine
at
the same
On Tue, Oct 25, 2011 at 4:08 AM, Proskurin Kirill
k.prosku...@corp.mail.ru wrote:
Hello.
corosync-1.4.1
pacemaker-1.1.5
pacemaker runs with ver: 1
I run on strange problem. Hope someone can help me.
I have 9 nodes cluster. All was fine till I need to reboot a node.
After reboot it don`t
On Fri, Oct 1, 2010 at 3:45 PM, Shravan Mishra shravan.mis...@gmail.com wrote:
Hi,
Just a quick question, who generates the very first cib.xml when
pacemaker processes are initialized?
The cib
Thanks
Shravan
On Thu, Sep 30, 2010 at 4:22 AM, Andrew Beekhof and...@beekhof.net wrote:
On
Hi,
Just a quick question, who generates the very first cib.xml when
pacemaker processes are initialized?
Thanks
Shravan
On Thu, Sep 30, 2010 at 4:22 AM, Andrew Beekhof and...@beekhof.net wrote:
On Tue, Sep 28, 2010 at 11:47 AM, Andrew Beekhof and...@beekhof.net wrote:
On Mon, Sep 27, 2010 at
On Tue, Sep 28, 2010 at 11:47 AM, Andrew Beekhof and...@beekhof.net wrote:
On Mon, Sep 27, 2010 at 6:26 AM, Shravan Mishra
shravan.mis...@gmail.com wrote:
Thanks Raoul for the response.
Changing the permission to hacluster:haclient did stop that error.
Now I'm hitting another problem
Hi,
I did a bt on the core, this is what I found:
==
Core was generated by `/usr/lib64/heartbeat/cib'.
Program terminated with signal 11, Segmentation fault.
[New process 12340]
#0 0x7f23acc553fa in strncmp () from /lib64/libc.so.6
(gdb) bt
#0 0x7f23acc553fa in strncmp ()
Some more info:
root 14170 14166 0 12:23 ?00:00:00 /usr/lib64/heartbeat/stonithd
nobody 14172 14166 0 12:23 ?00:00:00 /usr/lib64/heartbeat/lrmd
82 14173 14166 0 12:23 ?00:00:00 /usr/lib64/heartbeat/attrd
82 14174 14166 0 12:23 ?00:00:00
On Mon, Sep 27, 2010 at 6:26 AM, Shravan Mishra
shravan.mis...@gmail.com wrote:
Thanks Raoul for the response.
Changing the permission to hacluster:haclient did stop that error.
Now I'm hitting another problem whereby cib is failing to start
Very strange logs.
Which distribution is this?
Sorry forgot to attach my corosync.conf.
=
totem {
version: 2
# token: 3000
# token_retransmits_before_loss_const: 10
# join: 60
# consensus: 1500
# vsftype: none
# max_messages: 20
# clear_node_high_bit: yes
secauth: off
On 24.09.2010 21:41, Shravan Mishra wrote:
crmd[20612]: 2010/09/24_15:29:57 ERROR: crm_log_init_worker: Cannot
change active directory to /var/lib/heartbeat/cores/hacluster:
Permission denied (13)
ls -ald /var/lib/heartbeat/cores/hacluster /var/lib/heartbeat/cores/
/var/lib/heartbeat/
Hi All,
We recently upgraded to
/usr/sbin/corosync -v
Corosync Cluster Engine, version '1.2.1' SVN revision '2723:2724'
Copyright (c) 2006-2009 Red Hat, Inc.
In my logs I see the following lines:
crmd[20612]: 2010/09/24_15:29:57 ERROR: crm_log_init_worker: Cannot
change active directory to
I spoke to Steve, and the only thing he could come up with was that
the group might not be correct.
When the cluster is in this state, please run:
ps x -o pid,euser,ruser,egroup,rgroup,command
And compare it to the normal output.
Also, confirm that there is only one group named haclient, and
Andrew Beekhof wrote:
I spoke to Steve, and the only thing he could come up with was that
the group might not be correct.
When the cluster is in this state, please run:
ps x -o pid,euser,ruser,egroup,rgroup,command
And compare it to the normal output.
Also, confirm that there is only one
On Thu, Sep 2, 2010 at 2:18 PM, Michael Smith msm...@cbnco.com wrote:
On Thu, 2 Sep 2010, Andrew Beekhof wrote:
On Mon, Aug 30, 2010 at 10:04 PM, Michael Smith msm...@cbnco.com wrote:
Hi,
I have a pacemaker/corosync setup on a bunch of fully patched SLES11 SP1
systems. On one of the
On Mon, 6 Sep 2010, Andrew Beekhof wrote:
Is /dev/shm full (or not mounted) by any chance?
No - I tried clearing that out, too.
And corosync is actually running?
Yes, it's logging [IPC ] Invalid IPC credentials. when cib tries to
connect.
Mike
On Mon, Aug 30, 2010 at 10:04 PM, Michael Smith msm...@cbnco.com wrote:
Hi,
I have a pacemaker/corosync setup on a bunch of fully patched SLES11 SP1
systems. On one of the systems, if I /etc/init.d/openais stop, then
/etc/init.d/openais start, pacemaker fails to come up:
Is /dev/shm full (or
On Thu, 2 Sep 2010, Andrew Beekhof wrote:
On Mon, Aug 30, 2010 at 10:04 PM, Michael Smith msm...@cbnco.com wrote:
Hi,
I have a pacemaker/corosync setup on a bunch of fully patched SLES11 SP1
systems. On one of the systems, if I /etc/init.d/openais stop, then
/etc/init.d/openais start,
Hi,
I have a pacemaker/corosync setup on a bunch of fully patched SLES11 SP1
systems. On one of the systems, if I /etc/init.d/openais stop, then
/etc/init.d/openais start, pacemaker fails to come up:
Aug 30 15:48:09 xen-test1 cib: [5858]: info: crm_cluster_connect:
Connecting to OpenAIS
Aug
Lars Ellenberg wrote:
On Thu, Apr 01, 2010 at 08:27:02AM -0600, Alan Robertson wrote:
Lars Ellenberg wrote:
On Thu, Apr 01, 2010 at 12:12:47AM -0600, Alan Robertson wrote:
OK
Since there was no ssh-as-root between the cluster nodes, I didn't
send all the logs along from every node in the
On Fri, Apr 02, 2010 at 08:16:32AM -0600, Alan Robertson wrote:
Do it again, with higher log level. Sorry, no time right now to rebuild
your exact thing with your exact gcc and stuff to look at your core file.
You can just download the RPM and extract the objects. That's what I used.
core
OK
Since there was no ssh-as-root between the cluster nodes, I didn't send
all the logs along from every node in the cluster - and it didn't occur
to me to look at all of them.
However, the problem has gotten curioser and curioser - because ALL the
nodes in the cluster reported the same
On Thu, Apr 01, 2010 at 12:12:47AM -0600, Alan Robertson wrote:
OK
Since there was no ssh-as-root between the cluster nodes, I didn't
send all the logs along from every node in the cluster - and it
didn't occur to me to look at all of them.
However, the problem has gotten curioser and
Lars Ellenberg wrote:
On Thu, Apr 01, 2010 at 12:12:47AM -0600, Alan Robertson wrote:
OK
Since there was no ssh-as-root between the cluster nodes, I didn't
send all the logs along from every node in the cluster - and it
didn't occur to me to look at all of them.
However, the problem has
On 2010-04-01 16:27, Alan Robertson wrote:
None of them verified. All the nodes in the cluster failed the test at
the same time - and now I have no official CIBs on disk - on any cluster
nodes... I sent Andrew all the CIBs, and all the core files, and
basically everything under
Florian Haas wrote:
On 2010-04-01 16:27, Alan Robertson wrote:
None of them verified. All the nodes in the cluster failed the test at
the same time - and now I have no official CIBs on disk - on any cluster
nodes... I sent Andrew all the CIBs, and all the core files, and
basically everything
On Thu, Apr 01, 2010 at 08:27:02AM -0600, Alan Robertson wrote:
Lars Ellenberg wrote:
On Thu, Apr 01, 2010 at 12:12:47AM -0600, Alan Robertson wrote:
OK
Since there was no ssh-as-root between the cluster nodes, I didn't
send all the logs along from every node in the cluster - and it
Hi,
I've run into what looks at first blush to be a CIB bug in writing to disk.
The key messages from this incident are these:
Mar 31 19:02:52 vhost0384 cib: [13294]: ERROR: validate_cib_digest:
Digest comparision failed: expected 316049fa7ee8d2e107573ce7cded07cf
Please
- enable coredumps (set ulimit -c unlimited at the top of the
corosync init file)
- use hb_report to create a support tarball covering the problem
- attach the tarball to a new bug:
http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Thats the minimum we'd
And you'll also want this patch for the crmd
diff -r 4619c842d58c crmd/callbacks.c
--- a/crmd/callbacks.c Fri May 22 16:52:14 2009 +0200
+++ b/crmd/callbacks.c Fri May 22 21:34:12 2009 +0200
@@ -179,7 +179,6 @@ crmd_ha_msg_callback(HA_Message *hamsg,
} else {
Ah, well that was pretty obvious.
/me humbly apologizes for such a stupid error.
(It wasn't caught by my own valgrind testing because this function is
specific to heartbeat based clusters)
Try this:
diff -r ea5d0b58c0be cib/callbacks.c
--- a/cib/callbacks.c Wed May 20 11:56:39 2009 +0200
+++
On Wed, May 20, 2009 at 02:02:52PM +0200, Andrew Beekhof wrote:
Ah, well that was pretty obvious.
/me humbly apologizes for such a stupid error.
Hi and thanks! no problem
(It wasn't caught by my own valgrind testing because this function is
specific to heartbeat based clusters)
don't worry,
On Sat, May 16, 2009 at 10:33 PM, Nikola Ciprich
extmaill...@linuxbox.cz wrote:
Hi guys,
I was able to enable valgrind on our production cluster today,
but unfortunately only on the secondary node, I'll be allowed to enable
it on primary node hopefully during next weekend.
Unfortunately it
Hi,
Dejan, thanks a lot, I compiled Your version, but crmd with shipped pacemaker
keeps segfaulting
with it, and unable to rebuild pacemaker with this heartbeat to get the -debug
package.
compilation fails with:
plugin.c: In function 'check_message_sanity':
plugin.c:1190: warning: format '%d'
On Thu, May 14, 2009 at 3:58 PM, Nikola Ciprich extmaill...@linuxbox.cz wrote:
Hi,
Dejan, thanks a lot, I compiled Your version, but crmd with shipped pacemaker
keeps segfaulting
with it, and unable to rebuild pacemaker with this heartbeat to get the
-debug package.
compilation fails with:
Hi guys,
sooo I've got valgrind grinding:)
I had some trouble getting the latest stuff working, so I used heartbeat-2.99.2
with Dejan's (fixed) patch and --enable-valgrind
--with-valgrind-log=--log-file=/tmp/crm-%p.valgrind
and recompiled pacemaker-1.0.3 (withount openais as Andrew suggested).
54 matches
Mail list logo