Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread emmanuel segura
why you are editing corosync.conf, but as cluster stack, you are using cman? 2013/12/9 emmanuel segura > because in your corosync-cfgtool -s you are using bonding address > > > > > 2013/12/9 Dvorak Andreas > >> Hi >> >> >> >> Here it is >&

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread emmanuel segura
show cat /proc/net/bonding/bond0 2013/12/9 Dvorak Andreas > Dear all, > > > > during failover tests I found out that I can put down the heartbeat > interfaces and the cluster ignores that. But if I put down bond0 the > fencing is running. > > Can please somebody help me? > > > > bond0 Lin

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread emmanuel segura
p_conntrackd \ > > meta target-role="Started" > > clone pingclone p_ping \ > > meta interleave="true" > > location groupwithping cluster1 \ > > rule $id="groupwithping-rule" pingd: defined pingd > > colocation clu

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread emmanuel segura
where is your config? 2013/12/9 Bauer, Stefan (IZLBW Extern) > Hi List, > > > > even though following well known documentations about a ping clone > resource my resources are not moving to the node with the better > connectivity: > > > > 2 Nodes configured, 2 expected votes > > 6 Resources con

Re: [Pacemaker] configuration of stonith

2013-12-08 Thread emmanuel segura
shouldn't the delay's be different to avoid a stonith-battle? yes, in redhat cluster it was like that 2013/12/8 Masopust, Christian > > > > -Ursprüngliche Nachricht- > > Von: Digimer [mailto:li...@alteeve.ca] > > Gesendet: Freitag, 06. Dezember 2013 17:20 > > An: m...@sys4.de; The Pacem

Re: [Pacemaker] configuration of stonith

2013-12-06 Thread emmanuel segura
make two resources pcs -f stonith_cfg stonith create impi-fencing fence_ipmilan pcmk_host_list="sv2836" ipaddr=10.0.0.1 login=testuser passwd=acd123 op monitor interval=60s pcs -f stonith_cfg stonith create impi-fencing fence_ipmilan pcmk_host_list="sv2837" ipaddr=10.0.0.2 login=testuser passwd=a

Re: [Pacemaker] Pacemaker 1.1.10 and pacemaker-remote

2013-12-05 Thread emmanuel segura
what happen if you try ping db0 from your physical host? 2013/12/5 James Oakley > On Thursday, December 5, 2013 9:21:18 AM "Lars Marowsky-Bree" < > l...@suse.com> wrote: > > > primitive lxc_db0 @lxc \ > > > params container="db0" config="/var/lib/lxc/db0/config" \ > > > meta rem

Re: [Pacemaker] Pacemaker 1.1.10 and pacemaker-remote

2013-12-05 Thread emmanuel segura
Follow the Lars comment "I think this is because crm doesn't know about the remote-node attribute" 2013/12/5 James Oakley > On Thursday, December 5, 2013 9:08:46 AM "emmanuel segura" < > emi2f...@gmail.com> wrote: > > did you try in th

Re: [Pacemaker] Pacemaker 1.1.10 and pacemaker-remote

2013-12-05 Thread emmanuel segura
did you try in this following way? primitive lxc_db0 ocf:heartbeat:lxc \ params container="db0" config="/var/lib/lxc/db0/config" \ meta remote-node="db0" 2013/12/5 James Oakley > I have Pacemaker 1.1.10 cluster running on openSUSE 13.1 and I am trying > to get pacemaker-remote

Re: [Pacemaker] crm_mon segment fault con fedora 20

2013-11-11 Thread emmanuel segura
you can find the two files in the attachment Thanks 2013/11/11 Andrew Beekhof > > On 9 Nov 2013, at 8:56 am, emmanuel segura wrote: > > > Hello Andrew, > > > > You can the file in the attachment. > > It would be very useful to know what is NULL at:

Re: [Pacemaker] crm_mon segment fault con fedora 20

2013-11-08 Thread emmanuel segura
Hello Andrew, You can the file in the attachment. 2013/11/8 Andrew Beekhof > > On 6 Nov 2013, at 9:36 am, emmanuel segura wrote: > > > Hello everybody, > > > > On Fedora 20 i got a crm_mon segment fault with the following > configuration http://ur1.ca/fzndq m

Re: [Pacemaker] Stonith question

2013-11-08 Thread emmanuel segura
with location constrain. if you need info about constrains, you can look the clusterlab docs 2013/11/8 s.oreilly > That's what I thought. How do I specify which node to run them on? > > Many thanks > > Sean O'Reilly > > On Fri 08/11/13 1:08 PM , "emma

Re: [Pacemaker] Stonith question

2013-11-08 Thread emmanuel segura
fence of host1 needs to be running on host2 and fence of host2 needs to be running on host1 2013/11/8 s.oreilly > I am trying to configure stonith on a 2 node cluster. > > Using fence_vmware_soap and it works manually > > I configure stonith as below > > pcs stonith create test-stonith1 params

[Pacemaker] crm_mon segment fault con fedora 20

2013-11-05 Thread emmanuel segura
Hello everybody, On Fedora 20 i got a crm_mon segment fault with the following configuration http://ur1.ca/fzndq maybe my configuration is wrong, but in any case the is what i saw with gdb http://ur1.ca/fznf2 [root@pcs1 ~]# rpm -qa | grep pacemaker pacemaker-remote-1.1.9-3.fc20.2.x86_64 pacemaker

Re: [Pacemaker] problem with config

2013-11-04 Thread emmanuel segura
http://clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/s-failure-migration.html 2013/11/4 Alex Samad - Yieldbroker > Hi > > > I have attached my config at the bottom. But very basically when 1 > resource is failing it is not restarting on the other node ? strangely this > used

Re: [Pacemaker] Question regarding collocation

2013-11-02 Thread emmanuel segura
your order constrain it should be order PSQL_ORDER inf: PSQL_DISK_MS:promote PSQL:start 2013/11/2 Neocox > Hi! > > > > I have Corosync + Pacemaker installed on my Debian wheezy: > > root@rasp02:~# dpkg -s pacemaker | grep ^Version > > Version: 1.1.7-1 > > root@rasp02:~# dpkg -s corosync | grep

Re: [Pacemaker] Master resource never being promoted

2013-11-01 Thread emmanuel segura
i don't know if is this the problem, but this colacation are wrong colocation c_g_premount_on_drbd_meta inf: ms_drbd_meta:Master g_premount g_postmount it should be colocation c_g_premount_on_drbd_meta inf: g_premount g_postmount ms_drbd_meta:Master 2013/11/1 David Dunsmore > Hello, > > I am

Re: [Pacemaker] Integrating Xen with Pacemaker and DRBD

2013-10-27 Thread emmanuel segura
>From drbd site DOCS 13.4. Using DRBD VBDs In order to use a DRBD resource as the virtual block device, you must add a line like the following to your Xen domU configuration: disk = [ 'drbd:,xvda,w' ] This example configuration makes the DRBD resource named *resource*available to the domU as /d

Re: [Pacemaker] Resource only failsover in one direction

2013-10-22 Thread emmanuel segura
OCF_ROOT=/usr/lib/ocf/ OCF_RESKEY_configfile="/etc/nginx/nginx.conf" /usr/lib/ocf/resource.d/heartbeat/nginx start 2013/10/22 Lucas Brown > > Date: Tue, 22 Oct 2013 07:27:00 +0200 > > From: emmanuel segura > > To: The Pacemaker cluster resource manager > >

Re: [Pacemaker] Resource only failsover in one direction

2013-10-21 Thread emmanuel segura
try crm ra test nginx lb02 start 2013/10/22 Lucas Brown > Hey guys, > > I'm encountering a really strange problem testing failover of my > ocf:heartbeat:nginx resource in my 2 node cluster. I am able to manually > migrate the resource around the nodes and that works fine, but I can't get > the

Re: [Pacemaker] [pacemaker] DRBD + corosync + pacemaker + postgresql

2013-10-15 Thread emmanuel segura
ource# list >> Master/Slave Set: ms_drbd_postgresql [drbd_postgresql] >> Stopped: [ drbd_postgresql:0 drbd_postgresql:1 ] >> Resource Group: postgresql >> fs_postgresql (ocf::heartbeat:Filesystem) Stopped >> vip_cluster(ocf

Re: [Pacemaker] Offline Cluster edit

2013-10-15 Thread emmanuel segura
Backup your current cib.xml and modify by hand and after that, try to start the cluster 2013/10/15 Robert Lindgren > Yeah it's a nice idea, but servers are at datacenter, some hours away. > > > On Tue, Oct 15, 2013 at 10:42 AM, Florian Crouzat < > gen...@floriancrouzat.net> wrote: > >> Le 15/1

Re: [Pacemaker] Offline Cluster edit

2013-10-15 Thread emmanuel segura
+1 2013/10/15 Florian Crouzat > Le 15/10/2013 09:39, Robert Lindgren a écrit : > > I have a cluster that is offline, and I can't start it to do edits >> (since IPs and so will conflict with old cluster). What is the preferred >> way of doing the edits (change IPs) so that I can start the clust

Re: [Pacemaker] Offline Cluster edit

2013-10-15 Thread emmanuel segura
Why modify the cib.xml? in cib.xml there is no reference to ip, i think you have hostname there, i think you need to edit /etc/hosts and try to start the cluster again 2013/10/15 Robert Lindgren > Hi, > > I have a cluster that is offline, and I can't start it to do edits (since > IPs and so wil

[Pacemaker] Fedora 20 Alpath with pcs and crm

2013-10-12 Thread emmanuel segura
Hello list I'm testing Fedora 20 Alpath with the new tool pcs, but i saw in pacemaker this two parameters "rsc_defaults resource-stickiness" and "property default-resource-stickiness" what is the defirente between then? pacemaker-1.1.9-3.fc20.2.x86_64 Thanks _

Re: [Pacemaker] [pacemaker] DRBD + corosync + pacemaker + postgresql

2013-10-11 Thread emmanuel segura
try with this constrains colocation col_postgresql inf: postgresql_cluster ms_drbd_postgresql:Master order or_postgresql inf: ms_drbd_postgresql:promote postgresql_cluster:start 2013/10/11 Thomaz Luiz Santos > Dear all! > > I'm trying to make a sample cluster, in virtual machine, and after mi

Re: [Pacemaker] Pacemaker/DRBD troubles

2013-09-23 Thread emmanuel segura
i'm not sure if this is the problem, but i think you only need one order constrain like this 2013/9/23 David Parker > Hello, > > I'm attempting to set up a simple NFS failover test using Pacemaker and > DRBD on 2 nodes. The goal is to have one host be the DRBD master, and have > the volume m

Re: [Pacemaker] Error when managing network with ping/pingd.

2013-08-29 Thread emmanuel segura
> I retrieved a "schoreshow" script but I do not understand the result. > > Best regards. > > Francis > > > On 08/29/2013 10:32 AM, emmanuel segura wrote: > >> I think your score is wrong in your rule expression >> >> >> 2013/8/29 Francis

Re: [Pacemaker] Error when managing network with ping/pingd.

2013-08-29 Thread emmanuel segura
I think your score is wrong in your rule expression 2013/8/29 Francis SOUYRI > Hello, > > I have a corosync/pacemaker with 2 nodes and 2 nets by nodes, > 192.168.1.0/24 for cluster access, 10.1.1.0/24 for drbd in bond, both > used by corosync. > I try to used ocf:pacemaker:ping to monitor the 1

Re: [Pacemaker] crm_mon --as-xml not reporting failcounts and failed actions

2013-07-30 Thread emmanuel segura
crm_mon -1Arof 2013/7/30 Nikola Ciprich > Hi, > > we'd like to use crm_mon for gathering status data. For simple parsing, > we're using XML format, but the problem is, I haven't found way to get > failed actions and failure counters... Is it possible to get those somehow? > > thanks a lot in ad

Re: [Pacemaker] Pacemaker not starting in 2nd node of a 2 node cluster

2013-07-29 Thread emmanuel segura
Hello That would say you have problem with your multicast :) 2013/7/29 Enric Muñoz > Using unicast it is working well. Thank you very much. > > ** ** > > *De:* Michael Schwartzkopff [mailto:mi...@clusterbau.com] > *Enviado el:* lunes, 29 de julio de 2013 13:17 > *Para:* The Pacemaker cl

Re: [Pacemaker] Pacemaker not starting in 2nd node of a 2 node cluster

2013-07-29 Thread emmanuel segura
Hello If you are using multicast check your igmp switch support is enabled. Thanks 2013/7/29 Enric Muñoz > Iptables is disabled and selinux set to permissive in both nodes. > > ** ** > > *De:* Michael Schwartzkopff [mailto:mi...@clusterbau.com] > *Enviado el:* lunes, 29 de julio de 2013

Re: [Pacemaker] Pacemaker not starting in 2nd node of a 2 node cluster

2013-07-29 Thread emmanuel segura
Hello You need configure stonith for your cluster. Thanks 2013/7/29 Enric Muñoz > Hi all, > > ** ** > > I am trying to build a 2 node iSCSI HA storage cluster with Pacemaker, > Corosync and DRBD on CentOS 6.4. I have problems while Building the > cluster. The problem is that pacemaker is

Re: [Pacemaker] Question about the behavior when a pacemaker's process crashed

2013-07-12 Thread emmanuel segura
Hi try to disable selinux +++ Jul 12 16:46:10 dev1 setroubleshoot: SELinux is preventing /usr/bin/virsh from getattr access on the file /usr/bin/ssh. For complete SELinux messages. run sealert -l 3d5afba4-40a5-41ff-9530-3839da8a8c

Re: [Pacemaker] again trouble with quorum (now with cman)

2013-07-11 Thread emmanuel segura
The stonith should be enabled, if you wanna your node joint the cluster in clean state after you disconnect the cable. Thanks 2013/7/11 Andrey Groshev > Hi again! > I've played enough with corosync 2.3.x. nothing good yet. > Now I try build cluster with corosync/cman/pacemaker. > I started

Re: [Pacemaker] crmsh dosn't respect the acl read permissions

2013-07-09 Thread emmanuel segura
Thanks Andrew I know is easy, but for enable this option the pacemaker needs to be compiled with --with-acl, if i understand well Thanks 2013/7/9 Andrew Beekhof > > On 09/07/2013, at 4:58 PM, emmanuel segura wrote: > > > Hello Andrew > > > > please, can you tell m

Re: [Pacemaker] crmsh dosn't respect the acl read permissions

2013-07-09 Thread emmanuel segura
Hello Andrew please, can you tell me why? Thanks 2013/7/9 Andrew Beekhof > > On 09/07/2013, at 3:29 PM, emmanuel segura wrote: > > > Hi > > > > I compiled pacemaker using the following commands > > > > git clone git://github.com/ClusterLabs/pacemake

Re: [Pacemaker] crmsh dosn't respect the acl read permissions

2013-07-08 Thread emmanuel segura
t; listed in the output of > "cibadmin -!"? > > Regards, > Gao,Yan > > On 07/08/13 17:57, emmanuel segura wrote: > > Hi > > > > I did > > > > Thanks > > > > > > 2013/7/8 Dejan Muhamedagic > <mailto:deja...@fastmail.fm>

Re: [Pacemaker] crmsh dosn't respect the acl read permissions

2013-07-08 Thread emmanuel segura
Hi I did Thanks 2013/7/8 Dejan Muhamedagic > Hi, > > On Mon, Jul 08, 2013 at 12:52:07AM +0200, emmanuel segura wrote: > > Hello List > > > > Maybe this is wrong the wrong list, but now i'm playing with pacemaker > > 1.10 and a i see the crmsh dosn&

[Pacemaker] crmsh dosn't respect the acl read permissions

2013-07-07 Thread emmanuel segura
Hello List Maybe this is wrong the wrong list, but now i'm playing with pacemaker 1.10 and a i see the crmsh dosn't respeact the read permissions like i show below ^^^ [root@nod

Re: [Pacemaker] Node name problems after upgrading to 1.1.9

2013-06-27 Thread emmanuel segura
Hello Bernardo I don't know if this is the problem, but try this option clear_node_high_bit This configuration option is optional and is only relevant when no nodeid is specified. Some openais clients require a signed 32 bit nodeid that is greater than zer

Re: [Pacemaker] ERROR: Wrong stack o2cb

2013-06-25 Thread emmanuel segura
Hello Denis If you use ocfs with pacemaker, you don't need to configure ocfs in legacy mode using /etc/ocfs2/cluster.conf Thanks Emmanuel 2013/6/25 Denis Witt > Hi List, > > I'm having trouble getting OCFS2 running. If I run everything by hand > the OCFS-Drive works quite well, but cluster in

Re: [Pacemaker] The main road of the cluster stack evolution

2013-06-10 Thread emmanuel segura
Hello About tools, i like crmsh so much :) Thanks 2013/6/10 Халезов Иван > Hello everyone! > > I would like to ask a few questions about the main road of the cluster > stack evolution. > > 1) The RedHat company is planning to drop corosync support and wants to > switch to CMAN. ( http://www.g

Re: [Pacemaker] corosync does not start

2013-06-10 Thread emmanuel segura
Hello Andreas What do you have in /etc/security/limits.conf ? Thanks 2013/6/10 andreas graeper > hi, > Jun 10 15:09:06 n1 corosync[2785]: [MAIN ] Could not set SCHED_RR at > priority 99: Operation not permitted (1) > Jun 10 15:09:06 n1 corosync[2785]: [MAIN ] Could not lock memory of >

Re: [Pacemaker] corosync does not start

2013-06-10 Thread emmanuel segura
Hello Andreas Ho do you start the cluster ? Thanks 2013/6/10 andreas graeper > hi, > Jun 10 15:09:06 n1 corosync[2785]: [MAIN ] Could not set SCHED_RR at > priority 99: Operation not permitted (1) > Jun 10 15:09:06 n1 corosync[2785]: [MAIN ] Could not lock memory of > service to avoid p

Re: [Pacemaker] commandline option to load cib-file like crm(live)configure: load /tmp/cib

2013-06-10 Thread emmanuel segura
Hello Bauer crm configure < filename 2013/6/10 Bauer, Stefan (IZLBW Extern) > I was to stupid to read the manpage. Its done like: > > ** ** > > crm configure load replace /tmp/cib > > ** ** > > sorry for the trouble! > > ** ** > > Stefan > > ** ** > > *Von:* Bauer, Stefan (IZL

Re: [Pacemaker] Troube mounting filesystem (DRBD)

2013-06-04 Thread emmanuel segura
Hello Denis I'm glad you solved 2013/6/4 Denis Witt > On Tue, 4 Jun 2013 15:38:57 +0200 > Denis Witt wrote: > > > I'm trying to setup a Apache/DRBD cluster, but the Filesystem isn't > > mounted. crm status always tells me "not installed" as status for the > > filesystem primitive. Mounting th

Re: [Pacemaker] Troube mounting filesystem (DRBD)

2013-06-04 Thread emmanuel segura
Hello Denis Did you tried to mount the filesystem manualy, without the cluster? Thanks 2013/6/4 Denis Witt > Hi List, > > I'm trying to setup a Apache/DRBD cluster, but the Filesystem isn't > mounted. crm status always tells me "not installed" as status for the > filesystem primitive. Mountin

Re: [Pacemaker] Disabling failover for a resource

2013-05-27 Thread emmanuel segura
Hello Angel I think you can use colocation constrain, for more info read doc Thanks 2013/5/27 Angel L. Mateo > Hello, > > I have configured a active/passive cluster for my dovecot server. > Now I want to add to it a resource for running the backup service. I want > this resource to be

Re: [Pacemaker] Pacemaker 1.1.8 and corosync's cpg service?

2013-05-21 Thread emmanuel segura
https://bugzilla.redhat.com/show_bug.cgi?id=657041 2013/5/21 Mike Edwards > On Tue, May 21, 2013 at 11:15:56AM +1000, Andrew Beekhof babbled thus: > > cpg_join() is returning CS_ERR_TRY_AGAIN here. > > > > Jan: Any idea why this might happen? Thats a fair time to be blocked > for. > > Looks li

Re: [Pacemaker] Loss of ocf:pacemaker:ping target forces resources to restart?

2013-05-15 Thread emmanuel segura
Hello How do you configure your cluster network? are you using a private network for the cluster and one public for the services? 2013/5/15 Andrew Widdersheim > Sorry to bring up old issues but I am having the exact same problem as the > original poster. A simultaneous disconnect on my two nod

Re: [Pacemaker] clvmd start times out

2013-05-15 Thread emmanuel segura
w, this is same on suse 11 and redhat Thanks 2013/5/15 Michael Schwartzkopff > ** > > Am Mittwoch, 15. Mai 2013, 14:48:14 schrieb emmanuel segura: > > > Hello Michael > > > > > > I always used clvm on redhat cluster suite and suse hae with locking type > &g

Re: [Pacemaker] clvmd start times out

2013-05-15 Thread emmanuel segura
> ** > > Am Mittwoch, 15. Mai 2013, 14:34:55 schrieb emmanuel segura: > > > clvm locking type is 3 > > > > if you want to use internal locking. If you want to use separate libraries > you can choose "2". At lease that is what I understood from the docs. > &

Re: [Pacemaker] clvmd start times out

2013-05-15 Thread emmanuel segura
clvm locking type is 3 2013/5/15 Michael Schwartzkopff > ** > > Am Mittwoch, 15. Mai 2013, 13:35:21 schrieb Lars Marowsky-Bree: > > > On 2013-05-15T12:34:54, Michael Schwartzkopff > wrote: > > > > Hi, > > > > > > > > perhaps a little bit off topic, but I have a clvmd problem in my > cluster. >

Re: [Pacemaker] mysql resource can't start

2013-05-09 Thread emmanuel segura
Hello Li Maybe you have all resource in unmanaged state, because you set maintenance-mode="true" 2013/5/9 Li, Chen > For addition, > I have already tried to start resource in crm shell. > Nothing happened, and there is no log in mysql. > > Thanks. > -Chen > > > > 在 2013-5-9,17:10,"Li, Chen" ma

Re: [Pacemaker] R: Frequent SBD triggered server reboots

2013-05-07 Thread emmanuel segura
Hello Andrea i think you need to think about that Lars told you = (Upgrade to SP2) or maybe you can try to use a diferent lun for the sbd and use ionice for setting the realtime class for sbd process 2013/5/7 andrea cuozzo > Hi, > > Here are three logs from the last server watchdog-driven re

Re: [Pacemaker] [Troubleshooting] ERROR: Setup problem: couldn't find command: drbdsetup

2013-05-03 Thread emmanuel segura
r-1.1.6-3.el6.x86_64 > > ms ms-monkey-drbd monkey-drbd \ > meta master-max="1" clone-max="2" clone-node-max="1" > master-node-max="1" notify="true" interleave="true" > ? > > anyway, doesn't seem to addr

Re: [Pacemaker] [Troubleshooting] ERROR: Setup problem: couldn't find command: drbdsetup

2013-05-03 Thread emmanuel segura
> > it is strange, since permissions are 755 of drbd in resourced.d/linbit/drbd > > hmmm > > - Original Message - > > *From:* emmanuel segura > *To:* The Pacemaker cluster resource manager > *Sent:* Thursday, May 02, 2013 5:11 PM > *Subject:* Re: [Pacemaker] [Troubleshoo

Re: [Pacemaker] Frequent SBD triggered server reboots

2013-05-03 Thread emmanuel segura
lative multipathd > messages on the syslog about path lost and reinstated, What makes me think > my problem might not be multipath related is that there's no sign of port > down or path lost messages in the syslog when the problem happens, there's > just the sbd delay countd

Re: [Pacemaker] Frequent SBD triggered server reboots

2013-05-02 Thread emmanuel segura
://en.it-usenet.org/thread/18723/12998/ > > And this my multipath -ll output: > > server1:~ # multipath -ll > > san (360060e8006d2e100d2e100e6) dm-0 HP,OPEN-V > size=50G features='0' hwhandler='0' wp=rw > `-+- policy='round-robin 0' prio=1 s

Re: [Pacemaker] Frequent SBD triggered server reboots

2013-05-02 Thread emmanuel segura
Hello Andrea Can you show me your multipath.conf? Thanks 2013/5/2 andrea cuozzo > Hi, > > ** ** > > It's my first try at asking for help on a mailing list, I hope I'll not > make netiquette mistakes. I really could use some help on SBD, here's my > scenario: > > ** ** > > I have three

Re: [Pacemaker] [Troubleshooting] ERROR: Setup problem: couldn't find command: drbdsetup

2013-05-02 Thread emmanuel segura
did you try to test resource agent by hand? if not try ocf-test Thanks 2013/5/2 Arvydas > ** > yes, i am using ocf: > > > - Original Message - > *From:* emmanuel segura > *To:* The Pacemaker cluster resource manager > *Sent:* Thursday, May 02, 2013

Re: [Pacemaker] [Troubleshooting] ERROR: Setup problem: couldn't find command: drbdsetup

2013-05-02 Thread emmanuel segura
which resource agent are you using? can you show your config? Thanks 2013/5/2 Arvydas > ** > # which drbdsetup > /sbin/drbdsetup > it works > > - Original Message - > *From:* emmanuel segura > *To:* The Pacemaker cluster resource manager > *Sent:* Thur

Re: [Pacemaker] [Troubleshooting] ERROR: Setup problem: couldn't find command: drbdsetup

2013-05-02 Thread emmanuel segura
Hello try the command which drbdsetup 2013/4/30 Arvydas > Hello, > > has anyone encountered such problem: > > lrmd: [2524]: info: RA output: (monkey-drbd:1:monitor:stderr) > 2013/04/30_12:42:34 ERROR: Setup problem: couldn't find command: drbdsetup > > > ? > > # whereis drbdadm > drbdadm: /s

Re: [Pacemaker] pcs equivalent of crm configure erase

2013-04-19 Thread emmanuel segura
I don't know why redhat doesn't give to the users the alternatives to use what they want sorry for my ugly english :-) Thanks 2013/4/19 T. > Hi Chris, > > > No, you're definitely not missing anything. The 'pcs cluster cib' > > output isn't pretty. > why there is an approach to build a new co

Re: [Pacemaker] Disable startup fencing with cman

2013-04-15 Thread emmanuel segura
Hello If i remember well, for disable startup-fencing on cman clean_start="1" in fence_daemon tag, for pacemaker i think is startup-fencing="false" Thanks 2013/4/15 Andreas Mock > Hi Andrew, > > thank you for your answers (to all of my questions). > > My problem is, I have both nodes down. N

Re: [Pacemaker] change sbd watchdog timeout in a running cluster

2013-03-27 Thread emmanuel segura
Hello Lars We have no iowait on disks in very load time, i'll try 30s timeout thanks for everything 2013/3/27 Lars Marowsky-Bree > On 2013-03-26T18:22:07, emmanuel segura wrote: > > > Hello Lars > > > > what timeout you recommend me > > I don't know

Re: [Pacemaker] change sbd watchdog timeout in a running cluster

2013-03-26 Thread emmanuel segura
Hello Lars what timeout you recommend me Thanks a lot 2013/3/26 Lars Marowsky-Bree > On 2013-03-26T17:13:34, emmanuel segura wrote: > > > Hello Lars > > > > Because we have a vm(suse 11) cluster on a esx cluster, as datastore we > are > > using a netapp in

Re: [Pacemaker] change sbd watchdog timeout in a running cluster

2013-03-26 Thread emmanuel segura
Hello Lars Why do you think the long timeout is wrong? Do i need to change the stonith-timeout on pacemaker? Thanks 2013/3/26 Lars Marowsky-Bree > On 2013-03-26T16:48:30, emmanuel segura wrote: > > > Hello Lars > > > > So the procedura should be: > > >

Re: [Pacemaker] change sbd watchdog timeout in a running cluster

2013-03-26 Thread emmanuel segura
Thanks 2013/3/26 Lars Marowsky-Bree > On 2013-03-26T16:48:30, emmanuel segura wrote: > > > Hello Lars > > > > So the procedura should be: > > > > crm resource stop stonith_sbd > > sbd -d /dev/sda1 message exit = (on every node) > > sbd -d /dev

Re: [Pacemaker] change sbd watchdog timeout in a running cluster

2013-03-26 Thread emmanuel segura
Hello Lars So the procedura should be: crm resource stop stonith_sbd sbd -d /dev/sda1 message exit = (on every node) sbd -d /dev/sda1 -1 90 -4 180 create crm resource start stonith_sbd Thanks 2013/3/26 Lars Marowsky-Bree > On 2013-03-26T15:56:48, emmanuel segura wrote: > > >

[Pacemaker] change sbd watchdog timeout in a running cluster

2013-03-26 Thread emmanuel segura
Hello List How can i change the sbd watchdog timeout without stopping the cluster? Thanks -- esta es mi vida e me la vivo hasta que dios quiera ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacem

Re: [Pacemaker] DRBD+LVM+NFS problems

2013-03-26 Thread emmanuel segura
Hello Dennis This constrain is wrong colocation c_web1_on_drbd inf: ms_drbd_web1:Master p_fs_web1 it should be colocation c_web1_on_drbd inf: p_fs_web1 ms_drbd_web1:Master Thanks 2013/3/26 Dennis Jacobfeuerborn > I have now reduced the configuration further and removed LVM from the > pictur

Re: [Pacemaker] stonith and avoiding split brain in two nodes cluster

2013-03-25 Thread emmanuel segura
I have a production cluster, using two vm on esx cluster, for stonith i'm using sbd, everything work find 2013/3/25 Angel L. Mateo > Hello, > > I am newbie with pacemaker (and, generally, with ha clusters). I > have configured a two nodes cluster. Both nodes are virtual machines > (vmwar

Re: [Pacemaker] stonith and avoiding split brain in two nodes cluster

2013-03-25 Thread emmanuel segura
I have a production cluster, using two vm on esx cluster, for stonith i'm using sbd, everything work fine 2013/3/25 emmanuel segura > I have a production cluster, using two vm on esx cluster, for stonith i'm > using sbd, everything work find > > 2013/3/25 Ange

Re: [Pacemaker] pacemaker + corosync + clvm in ubuntu

2013-03-22 Thread emmanuel segura
Hello Angel I'm using debian, i don't know if the result on ubuntu is the same, try apt-file search dlm_controld.pcmk Result should be: dlm-pcmk: /usr/sbin/dlm_controld.pcmk 2013/3/22 Angel L. Mateo > Hello, > > I'm trying to configure a cluster based in pacemaker and corosync > in t

Re: [Pacemaker] Migrate vm on drbd in correct order?

2013-03-16 Thread emmanuel segura
try to change the colocation like this, example: colocation vma_on_drbd \ inf: kvm_vma ms_vma_R:Master ms_vma_S:Master 2013/3/16 Matthias Teege > Hi, > > I'm try to setup a two node cluster for virtuell machines (KVM/libvirt) > with drbd as storage backend. For each vm I use two drbd devices.

Re: [Pacemaker] mysql/drbd on wheezy active/passive setup issues

2013-03-13 Thread emmanuel segura
Hello Use *ocf-tester *to debug your resource 2013/3/13 christopher barry > ** > Greetings all, > > I'm almost there, and figure I just have something small out of place. > Wondering if you can view my setup here: > > > https://zerobin.permutation.net/?d8664af27a7de3be#Bh3fBAupeEw3RhBWOlvDomyPk

Re: [Pacemaker] Problem on Pacemaker when a new node is joining the cluster

2013-03-12 Thread emmanuel segura
Mar 12 11:39:27 HA_NODE2 pdns[32333]: Calling daemonize, going to background Mar 12 11:39:27 HA_NODE2 lrmd[31077]: notice: operation_finished: PDNS_D_start_0:32323 [ Mar 12 11:39:27 Unable to parse configuration file '/etc/powerdns/recursor.conf' ] 2013/3/12 Paul Sun > Hi > > ** ** > > I

Re: [Pacemaker] Pacemaker is initializing the service before mounting the partition

2013-03-08 Thread emmanuel segura
You need a order constrain order fs_after_ms inf: drbd_sistema:promote sistema_fs:start order pgsql_afterLfs inf: sistema_fs postgresql Or maybe you can put fs and pgsql in a group, like that you can use a contrais like this order foo inf: drbd_sistema:promote myservicegroup:start 2013/3/8 Cr

Re: [Pacemaker] Fw: Fw: Cluster resources failing to move

2013-03-04 Thread emmanuel segura
rt >error: text2role: Unknown role: Start >error: get_target_role: voip: Unknown value for target-role: Start > error: text2role: Unknown role: Start >error: get_target_role: p_asterisk: Unknown value for target-role: > Start > Errors found during check: conf

Re: [Pacemaker] Fw: Cluster resources failing to move

2013-03-04 Thread emmanuel segura
itor interval="10s" timeout="30s" \ > op stop interval="0" timeout="30s" migration-threshold="1" > > I tried stopping the asterisk service using service asterisk stop. I > repeated that for at least 4 times but the service keep

Re: [Pacemaker] Fw: Cluster resources failing to move

2013-03-04 Thread emmanuel segura
meout="30s" \ > op monitor interval="10s" timeout="30s" \ > op stop interval="0" timeout="30s" migration-threshold="1" > > I tried stopping the asterisk service using service asterisk stop. I > repeated that for at least 4 tim

Re: [Pacemaker] Cluster resources failing to move

2013-03-04 Thread emmanuel segura
>From Suse Docs 7.4.2. Cleaning Up Resources¶ A resource will be automatically restarted if it fails, but each failure raises the resource's failcount. If a migration-thresh

Re: [Pacemaker] Pacemaker is not automatically mounting the DRBD partitions

2013-02-15 Thread emmanuel segura
:-) Nice 2013/2/15 Cristiane França > Hi Emmanuel, > > Thank you very much! > I changed my pacemaker config as you suggested and the problem was solved. > > Thanks. > Cristiane > > > On Thu, Feb 14, 2013 at 4:38 PM, emmanuel segura wrote: > >> Hello

Re: [Pacemaker] Pacemaker is not automatically mounting the DRBD partitions

2013-02-14 Thread emmanuel segura
d: [ drbd_home:0 ] > Master/Slave Set: ms_drbd_sistema [drbd_sistema] > drbd_sistema:0 (ocf::linbit:drbd): Slave primario (unmanaged) FAILED > Stopped: [ drbd_sistema:1 ] > Master/Slave Set: ms_drbd_database [drbd_database] > drbd_database:0 (ocf::linbit:drbd): Slave p

Re: [Pacemaker] Pacemaker is not automatically mounting the DRBD partitions

2013-02-14 Thread emmanuel segura
Hello Cristiane I think your pacemaker config doesn't call the resource defined in your drbd config 2013/2/14 Cristiane França > hello, > I installed Pacemaker (1.1.7-6) and DRBD (8.4.2-2) on my server CentOS 6.3 > (kernel 2.6.32-279.19.1 - 64 bits). > I'm having the following problem: > The Pa

Re: [Pacemaker] Pacemaker is not automatically mounting the DRBD partitions

2013-02-14 Thread emmanuel segura
Hello Cristiane can you post your cluster logs and your drbd config Thanks 2013/2/14 Cristiane França > hello, > I installed Pacemaker (1.1.7-6) and DRBD (8.4.2-2) on my server CentOS 6.3 > (kernel 2.6.32-279.19.1 - 64 bits). > I'm having the following problem: > The Pacemaker is not automatica

[Pacemaker] crmsh on fedora 18

2013-02-03 Thread emmanuel segura
Hello List Sorry for this stupid question, but i would like to know if i can install crmsh on fedora 18, i know fedora 18 use pcs, but i don't like pcs Thanks -- esta es mi vida e me la vivo hasta que dios quiera ___ Pacemaker mailing list: Pacemaker@

Re: [Pacemaker] [PACEMAKER] Why cant't migrate group resource with collation in drbd resource

2013-01-23 Thread emmanuel segura
Ummm Fist the IP-AND-FS? but what happen if the FS is on drbd? Thanks 2013/1/23 Kashif Jawed Siddiqui > You must change the order > > > #order DRBD_BEF_FS inf: ms_drbd:promote IP-AND-FS:start > > order DRBD_BEF_FS inf: IP-AND-FS:start ms_drbd:promote > > //First start IP-AND-FS, only then pr

Re: [Pacemaker] Oracle instance problem

2013-01-09 Thread emmanuel segura
Hejjo Dejan use ocf-test to see why you have this tow errors 1: cannot start already-running ORACLE - shut it down first) 2: /u01/app/oracle/product/10.2.0/db_1/dbs/lk*: No such file or directory Thanks 2013/1/9 Dejan Muhamedagic > cannot start already-running ORACLE - shut it down first)

Re: [Pacemaker] Oracle instance problem

2013-01-09 Thread emmanuel segura
Hello Maybe that isn't the problem, but before you try to start oracle from pacemaker, try to see if there is a share memory segment allocated from oracle and no deallocated ipcs command can help you 2013/1/9 Dejan Muhamedagic > Hi, > > On Wed, Jan 09, 2013 at 03:38:08PM +0530, Sucheta wrote:

Re: [Pacemaker] node status does not change even if pacemakerd dies

2013-01-09 Thread emmanuel segura
Hello Maybe my question is stupid, but are you root when you try to killing the procs? Thanks 2013/1/9 Kazunori INOUE > Hi Andrew, > > I have another question about this subject. > Even if pengine, stonithd, and attrd crash after pacemakerd is killed > (for example, killed by OOM_Killer), node

Re: [Pacemaker] crmd used all its file descriptors

2012-12-07 Thread emmanuel segura
If i remember well, this is old bug, has been fixed 2012/12/7 Piotr Jewiec > Hi, > > I have a corosync/pacemaker cluster running on Ubuntu 10.04.2. The > following error is getting appended to the syslog: > > Dec 6 20:44:46 filer-1 crmd: [2970]: ERROR: socket_client_channel_new: > socket: Too m

Re: [Pacemaker] resource doesn't migrate after failcount is reached

2012-10-22 Thread emmanuel segura
Hello Andreas Thanks :-) That solved my problem 2012/10/21 Andreas Kurz > On 10/20/2012 12:53 PM, emmanuel segura wrote: > > Hello List > > > > I have a stand alone resource and one group, i would like that when the > > stand alone resource reaches the failcount

[Pacemaker] resource doesn't migrate after failcount is reached

2012-10-20 Thread emmanuel segura
Hello List I have a stand alone resource and one group, i would like that when the stand alone resource reaches the failcount, the group doesn't migrate and the stand alone stays on the node where the group is situated This is my test conf ~

Re: [Pacemaker] Clustered LVM in failover cluster

2012-09-10 Thread emmanuel segura
I worked in Redhat Cluster without clvmd, i did a storage migration where the service was running. Until this point everythig was fine, but when the primary node where the service was running crashed it, the secondary node used the old luns, this happened because nobody synchronized the lvm metadat

Re: [Pacemaker] filesystem script block when umount a failed storage

2012-08-08 Thread emmanuel segura
Hello Dejan do you see any errors in your systemlog? what kind of controler you are using? So please give more information any time you need help 2012/8/8 Dejan Muhamedagic > Hi, > > On Thu, Aug 02, 2012 at 01:36:05AM +0800, Mia Lueng wrote: > > Hi all: > > When I disconnect the connection

Re: [Pacemaker] Filesystem will not mount (mounts fine when mannualy mounted)

2012-08-08 Thread emmanuel segura
are you using /dev/mapper/mpath0 directly, i don't think this is the problem, but i always used lvm with multipath 2012/8/8 Patrick de Ruiter > I currently have a problem with mounting a filesystem, it fails and > somehow I cannot seem to figure out why. > Could someone have a look to point me

Re: [Pacemaker] about iTCO_wdt watchdog

2012-08-02 Thread emmanuel segura
echo "b" >/proc/sysrq-trigge 2012/8/2 Mia Lueng > Hi All: >I use IBM 3650 to build a HA cluster. And set iTCO_wdt as the > watchdog. The following test is performed > 1. modprobe iTCO_wdt heartbeat=60 nowayout=1 > 2. echo "1" >/dev/watchdog >system will reboot after 60s > >But when I

<    1   2   3   >