hello
when I use yum install -y pacemaker corosync heartbeat to install I meet a
problem
libopenais-devel-0.80.5-15.1.x86_64 from installed has depsolving problems
-- Missing Dependency: openais = 0.80.5-15.1 is needed by package
libopenais-devel-0.80.5-15.1.x86_64 (installed)
Error:
follow is my ping command:
1.create ping resource
crm configure primitive pingd_data ocf:pacemaker:pingd meta target-role=stopped
params name=pingd_num op start timeout=90s op stop timeout=100s op monitor
interval=90s timeout=100s
2.set param
crm_resource -p host_list -r pingd_data -v host_list
Clone Set: pingd_data_net
Started: [ oss3 oss2 oss1 ]
I use the command :
crm_resource -g host_list -r pingd_data_net
to check the param host_list
the result is
pingd_data_net is active on more than one node, returning the default value
for null
1.create ping resource
crm configure primitive pingd_data ocf:pacemaker:pingd meta
target-role=stopped params name=pingd_num op start timeout=90s op stop
timeout=100s op monitor interval=90s timeout=100s
2.set param
crm_resource -p host_list -r pingd_data -v host_list
The
Hi everybody:
I start three node oss1,oss2,oss3 at the same time, however, crm_mon show oss2
node: pending. and oss1 oss3 online.Why? If one node in pending condition. what
it can do?
thanks a lot:-)
___
Pacemaker mailing list:
--- 10年9月21日,周二, jiaju liu liujiaj...@yahoo.com.cn 写道:
发件人: jiaju liu liujiaj...@yahoo.com.cn
主题: re:re:node pending problem
收件人: and...@beekhof.net
日期: 2010年9月21日,周二,下午2:19
Message: 1
Date: Mon, 20 Sep 2010 11:55:05 +0200
From: Andrew Beekhof and...@beekhof.net
To: The Pacemaker
hi guys
I use command service openais force-stop to stop openais, It ofen waste a long
time to stop or maybe run this command and no end. sometimes I use command
service openais force-stop twice it will be ok, or I have to kill pocess. who
has a better way to stop service.
Thanks a lot:-)
/plain; charset=iso-8859-1
On Tue, Sep 28, 2010 at 10:00 AM, jiaju liu liujiaj...@yahoo.com.cn wrote:
hi guys
I use command service openais force-stop to stop openais, It ofen waste a
long time to stop or maybe run this command and no end. sometimes I
use command service openais force-stop
/plain; charset=iso-8859-1
On Tue, Sep 28, 2010 at 10:00 AM, jiaju liu liujiaj...@yahoo.com.cn wrote:
hi guys
I use command service openais force-stop to stop openais, It ofen waste a
long time to stop or maybe run this command and no end. sometimes I
use command service openais force-stop
Message: 2
Date: Thu, 7 Oct 2010 21:58:29 +0200
From: Pavlos Parissis pavlos.paris...@gmail.com
To: The Pacemaker cluster resource manager
pacemaker@oss.clusterlabs.org
Subject: Re: [Pacemaker] crm resource move doesn't move the resource
Message-ID:
Hi everybody
I use command
crm configure primitive apache ocf:heartbeat:apache
params /opt/lampp/etc/httpd.conf meta resource-stickiness=1
target-role=stopped op start timeout=120s op stop timeout=120s op monitor
timeout=20s interval=10s op status timeout=30s;
and
crm resource start apache
hi everybody
I use command crm openais stop first to stop openais service and then use rm
-rf /var/lib/heartbear/crm/* clear all information. then change multicast and
start it in another cluster.
the problem is sometimes it works I can use crm_mon command. and sometimes it
doesn't work. I use
the screen says
node1 pingd: [1927]: info: stand_alone_ping: Node 192.168.10.110 is unr
eachable (read)
and the node could not start
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
updated the
packages to the latest versions of corosync, pacemaker, etc. The tricky
part was doing the migration procedure for upgrading production servers
without service downtime, but that's another story.
Regards,
Dan
jiaju liu wrote
Hi
I reboot my node, and it appears
node2 pingd: [3932]: info: stand_alone_ping: Node 192.168.10.100 is unreachable
(read)
and the node could not start
192.168.10.100 is ib network I will start ib after the node start, so do you
have any idea let the node start first?Thanks very much.:-)
Hi
I reboot my node, and it appears
node2 pingd: [3932]: info: stand_alone_ping: Node 192.168.10.100 is unreachable
(read)
and the node could not start
192.168.10.100 is ib network I will start ib after the node start, so do you
have any idea let the node start first?Thanks very much.:-)
at 3:47 AM, jiaju liu liujiaju86 [at] yahoo wrote:
Hi
I reboot my node, and it appears
node2 pingd: [3932]: info: stand_alone_ping: Node 192.168.10.100 is
unreachable (read)
and the node could not start
192.168.10.100 is ib network I will start ib after the node start, so do
you
, removed the RPMs, removed
/var/lib/heartbeat/crm/* configured corosync, started it
Et voila, it works. BTW, awesome job done to maintain compatibility
between versions, the upgrade was truly seamless.
Regards,
Dan
jiaju liu wrote:
Hi
Thank you for your help. I want to upgrade my
? Thanks a lot
Hi,
It's not mandatory to install ldirectord, I know it's not a dependency
anymore. As for libesmtp see http://tinyurl.com/2uhdpzw
jiaju liu wrote:
Hi I have already installed rpm as follow:
cluster-glue-1.0.5-1.el5.x86_64.rpm
cluster-glue-libs-1.0.5-1.el5.x86_64.rpm
cluster
--- 10年10月18日,周一, pacemaker-requ...@oss.clusterlabs.org
pacemaker-requ...@oss.clusterlabs.org 写道:
Hi I also use Pacemake manage Lustre system. Now I meet a question. If I mount
and umount the same ost or mdt several times. It will last more then 3 minutes
to mount this ost or mdt again.
Hi everyone
I use command crm resource stop Rsc to stop a resource and use crm resource
cleanup at last use crm configure delete Rsc to delete resource.
However I still found the resource's imformation in XML file.
my question is how to remove one resource imformation completely from cluster.
when I use crm configure refresh, it shows WARNING: status: operation not
recognized
What's wrong with it
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home:
Hi,
On Thu, Oct 21, 2010 at 01:51:25PM +0800, jiaju liu wrote:
Hi everyone
I use command crm resource stop Rsc to stop a resource and use crm resource
cleanup at last use crm configure delete Rsc to delete resource.
However I still found the resource's imformation in XML file.
crm resource
Hi guys
I use command crm configure delete to delete a resource. However it shows
datavol_mpath0 (ocf::heartbeat:Filesystem): ORPHANED Started oss2
By the way, I stop this resource first. and could not delete this orphaned
resource
Orphan resource are those for which there are entry in the
datavol_mpath0
or something to that effect
On Tue, Oct 26, 2010 at 3:30 AM, jiaju liu liujiaj...@yahoo.com.cn wrote:
Hi guys
I use command crm configure delete to delete a resource. However it shows
datavol_mpath0 (ocf::heartbeat:Filesystem): ORPHANED Started oss2
By the way, I stop
, jiaju liu liujiaj...@yahoo.com.cn wrote:
Hi guys
I use command crm configure delete to delete a resource. However it shows
datavol_mpath0 (ocf::heartbeat:Filesystem): ORPHANED Started oss2
By the way, I stop this resource first. and could not delete this orphaned
resource
Orphan resource
Thanks for your reply, I have used cleanup order, They still there. When
I use crm resource show datavol_mpath0 it shows not exist. however when I
use crm_mon it shows ORPHANED Started oss2. If?restart corosync servcie
the service could not stop. fortunately,?this is doesn't happen
I want to start a filesystem it often failed I check the log it shows: I want
to know there is bug in pingd
my pacemaker package are
pacemaker-1.0.8-6.1.el5
pacemaker-libs-devel-1.0.8-6.1.el5
pacemaker-libs-1.0.8-6.1.el5
openais packages are
openaislib-devel-1.1.0-1.el5
openais-1.1.0-1.el5
I load the same filesystem resource into cluster for serval times, at the
beginning is ok however it start failed.I check the log;it shows:
Nov 09 11:13:36 oss3 crmd: [2968]: WARN: log_data_element: do_lrm_invoke: bad
input crm_xml
Nov 09 11:13:36 oss3 crmd: [2968]: WARN:
the syslog as follows is these means there are some comunication problem in
cluster
oss1 crmd: [11282]: info: abort_transition_graph: do_te_invoke:191 - Triggered
transition abort (complete=0) : Peer Cancelled
oss1 crmd: [11282]: info: abort_transition_graph: do_te_invoke:191 - Triggered
Fixed upstream a while back, please update
ok, thank you and where I
I load the same filesystem resource into cluster for serval times, at the
beginning is ok however it start failed.I check the log;it shows:
Nov 09 11:13:36 oss3 crmd: [2968]: WARN: log_data_element: do_lrm_invoke:
Fixed upstream a while back, please update
ok, thank you by the way where I can get fixed and which one is I need. thanks
a lot
I load the same filesystem resource into cluster for serval times, at the
beginning is ok however it start failed.I check the log;it shows:
Nov
Hi
I reboot my node, and it appears
node2 pingd: [3932]: info: stand_alone_ping: Node 192.168.10.100 is
unreachable (read)
and the node could not start
192.168.10.100 is ib network I will start ib after the node start, so do
you have any idea let the node start first?Thanks very
start resource steps
step(1)
crm configure primitive vol_mpath0 ocf:heartbeat:Filesystem meta
target-role=stopped params device=/dev/mapper/mpath0
directory=/mnt/mapper/mpath0 fstype='lustre' op start timeout=300s op stop
timeout=120s op monitor timeout=120s interval=60s op notify timeout=60s
start resource steps
step(1)
crm configure primitive?vol_mpath0 ocf:heartbeat:Filesystem meta
target-role=stopped params device=/dev/mapper/mpath0
directory=/mnt/mapper/mpath0 fstype='lustre' op start timeout=300s? op stop
timeout=120s op monitor timeout=120s interval=60s op notify
Message: 6
Date: Sun, 14 Nov 2010 13:17:23 +0800 (CST)
From: jiaju liu liujiaj...@yahoo.com.cn
To: pacemaker@oss.clusterlabs.org
Subject: Re: [Pacemaker] start filesystem like this is right?
Message-ID: 476924.89659...@web15703.mail.cnb.yahoo.com
Content-Type: text/plain; charset=iso-8859
Hi all
If I use command like this
service corosync start
it shows
Starting Corosync Cluster Engine (corosync): [FAILED]
and I do nothing just reboot my computer it will be OK what is the reason?
Thanks a lot
my pacemaker packages are
pacemaker-1.0.8-6.1.el5
On 11/22/2010 01:27 AM, jiaju liu wrote:
Hi all
If I use command like this
service corosync start
it shows
Starting Corosync Cluster Engine (corosync): [FAILED]
and I do nothing just reboot my computer it will be OK?what is the
reason
On 11/22/2010 01:27 AM, jiaju liu wrote:
Hi all
If I use command like this
service corosync start
it shows
Starting Corosync Cluster Engine (corosync): [FAILED]
and I do nothing just reboot my computer it will be OK?what is the
reason
On 11/22/2010 01:27 AM, jiaju liu wrote:
? ???Hi all
? ???If I use command like this
? ? ? service corosync start
? ???it shows
? ???Starting Corosync Cluster Engine (corosync):? ? ? ? ? ? ???[FAILED]
? ? ?
? ???and I do nothing just reboot my computer it will be OK?what
This problem has already resolved. The pacemaker version is not match with
corosync. I change pacemaker to 1.0.9.1-1.e15 is OK.
Thanks a lot
\
? ???Hi all
? ???If I use command like this
? ? ? service corosync start
? ???it shows
? ???Starting Corosync Cluster Engine (corosync):? ? ? ? ? ?
Hi all
I have 4 nodes node1,node2,node3,node4 in cluster. and start clone resource
pingd. this ping a router. when node1 and node2 offline. The clone resource
stopped. however the resource ipmi is ok.
Is the feature of clone resource? If half node of cluster offline clone
resource stop?
strange, the disk could not mount umount for more than 3 times
Please read our documentation, we're not here to read it to you.
Quorum is one of the first things covered in clusters from scratch
On Wed, Nov 24, 2010 at 11:39 AM, jiaju liu liujiaj...@yahoo.com.cn wrote:
Hi all
I have 4 nodes
I have 4 nodes node1,node2,node3,node4 in cluster. and?start clone
resource pingd. this ping a router.?when node1 and node2 offline. The
clone resource stopped. however the resource ipmi is ok. Is the feature
of?clone resource? If half node of cluster offline clone resource stop??
thanks
Hi all
I use crm_node -i order to check node id. the result is as follow
[r...@oss1 ~]# crm_node -i
1678456074
Unknown option 'i'
crm_node - Tool for displaying low-level node information
Usage: crm_node command [options]
Options:
-?, --help This text
-$, --version Version information
Hi all
I use crm_node -i order to check node id. the result is as follow
[r...@oss1 ~]# crm_node -i
1678456074
Unknown option 'i'
crm_node - Tool for displaying low-level node information
Usage: crm_node command [options]
Options:
?-?, --help ??This text
?-$, --version ??Version
Hi all
I use crm_node -i order to check node id. the result is as follow
[r...@oss1 ~]# crm_node -i
1678456074
Unknown option 'i'
crm_node - Tool for displaying low-level node information
Usage: crm_node command [options]
Options:
?-?, --help ??This text
?-$, --version ??Version
Hi all
I want to add lampp into my HA cluster, unfortunately, I have not find lampp
RA, is this mean I should write RA by myself or my resource-agent version is
too old?
my packages version are
cluster-glue-libs-devel-1.0.5-1.el5
cluster-glue-1.0.5-1.el5
cluster-glue-libs-1.0.5-1.el5
Hi all
One node corosync start is ok ,however, could not add into cluster, I found
corosync (pid 20303 20296) is running. is this result in the problem?
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
Hi all
lampp is short for linux apache mysql php. So does our pacemaker supports it?
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home: http://www.clusterlabs.org
Yes offcause.
Php runs inside apache, so you make rules for mysql and apache.
Best regards
Allan Jacobsen
Thank you for your reply, I haven't found lampp resource agent, so this means
I must write a resource agent for lampp? would you please tell me more about
what I should do ? Thanks a
apache default listen port is 80, which is used by IIC so I have to change
apache listen port to 800. when I use service httpd start it is ok however when
I use pacemaker to start it it failed
it shows :apache_start_0 (node=mds1, call=57, rc=1, status=complete): unknown
error
the apache
There are a lot of warning information .Is there way erase them
My pacemaker version is pacemaker-1.0.9.1-1.el5
10.53.11.159: WARNING: status: operation not recognized
10.53.11.159: WARNING: status: operation not recognized
10.53.11.159: WARNING: force-reload: operation not recognized
when I use command
crm configure property start-failure-is-fatal=FALSE
it shows
WARNING: status: operation not recognized
WARNING: status: operation not recognized
WARNING: status: operation not recognized
WARNING: status: operation not recognized
WARNING: status: operation not recognized
when I use command
crm configure property start-failure-is-fatal=FALSE
it shows
WARNING: status: operation not recognized
WARNING: status: operation not recognized
WARNING: status: operation not recognized
WARNING: status: operation not recognized
WARNING: status: operation not
I use command
crm configure primitive cfs_monitor lsb:gmetad meta resource-stickiness=1 op
monitor timeout=15 interval=15 op force-reload timeout=15
to start a resource gmetad,crm resource show shows
cfs_monitor (lsb:gmetad) Started
and
service gmetad status shows
gmetad is stopped
my
I use lustre filesystem in cluster,By default, the start, stop, and monitor
operations in a Filesystem resource
time out after 20 sec. Since some mounts in Lustre require up to 5 minutes or
more,so, the default timeouts for these operations must be modified.I want to
change it to 10min, is it
I use lustre in my HA-cluster,sometimes when I umount ost, the node will
reboot.an 20 09:23:10 oss1 kernel: Lustre: server umount testTwo-OST0003
completeJan 20 10:28:56 oss1 kernel: Lustre: server umount testTwo-OST
completeJan 20 10:28:58 oss1 kernel: exit dynlocks cacheJan 20 10:29:11
Hi,
I have two network cards and configured corosync-1.2.7 with
rrp_mode: active
at first corosync-cfg -s tells me
Printing ring status.
Local node ID 1210452490
RING ID 0
id = 10.10.38.72
status = ring 0 active with no faults
RING ID 1
id =
Hi Is there any paper write about pacemaker, I want to learn more and deeply
about pacemaker and HA,Thanks a lot.
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home:
Hi all when I umount / mount my filesystem with HA, stonith often reboot the
node,I have already ser dc-deadtime to 2min.it doesn't work.The log only says
one node lost.my pacemaker version is
:pacemaker-libs-devel-1.0.9.1-1.el5pacemaker-libs-1.0.9.1-1.el5pacemaker-1.0.9.1-1.el5
I think this
I check log find information as follows, by the way my stonithd resource is
ipmi,is it ok?
Jan 30 10:51:43 mds1 external/ipmi[7686]: debug: ipmitool output: Chassis Power
is on
Jan 30 10:52:14 mds1 external/ipmi[7843]: debug: ipmitool output: Chassis Power
is on
Liang Ma
Contractuel | Consultant | SED Systems Inc.
Ground Systems Analyst
Agence spatiale canadienne | Canadian Space Agency
6767, Route de l'A?roport, Longueuil (St-Hubert), QC, Canada, J3Y 8Y9
T?l/Tel : (450) 926-5099 | T?l?c/Fax: (450) 926-5083
Hi listI find we all use drbd with mysql and pacemaker to realize HA. If I use
unison to synchronize database, and then add mysql in HA cluster.Is it OK?
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
I have two node ha cluster node1 and node2 I start virtual IP resource in
cluster on node1, I set stick-ness=0,when node1 reboot virtual ip migrate to
node2.however when node1 come back virtual IP still start on node2.Is there any
way to migrate resource back to node1?Thanks a lot
Hi allI think there is something wrong with cluster communication,result in
node reboot. so I want to use disk heartbeating, I use corosync-1.2.2-1.1.el5
and pacemaker-1.0.9.1-1.el5.is there any guide tell me how to realize disk
heartbeating with corosync and pacemaker?Thanks a lot
the HA software I use meet a lot of strange problem, I am not sure the version
I use is
OK?cluster-glue-libs-devel-1.0.5-1.el5cluster-glue-libs-1.0.5-1.el5cluster-glue-debuginfo-1.0.5-1.el5cluster-glue-1.0.5-1.el5
resource-agents-1.0.3-2.el5
when I use order crm configure ...it often shows
WARNING: status: operation not recognizedWARNING: force-reload: operation not
recognizedWARNING: force-reload: operation not recognizedWARNING: status:
operation not recognizedWARNING: status: operation not recognizedWARNING:
force-reload:
Hi I use corosync+pacemaker. I set debug:on in corosync.conf.However there is
nothing more output in log. If I want log to output more debug info what I
should do?Thanks a lot
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
Hi all yesterday my lab balckout, this morning when I start pacemaker I find
the filesystem could not be mounted, There is no use of crm resource cleanup,
after delete the files /var/lib/heartbeat/crm/*, and then restart corosync it
works,I do not understand
HI all I use ipmi as stonith resource. the interface I use lanplus first,
however, it doesn't work. I check log,it shows nodea external/ipmi[3433]:
ERROR: error executing
ipmitool: Error: Unable to establish IPMI v2 / RMCP+ session^M Unable to
get Chassis Power Status
so I change interface
Hi,
On Tue, May 24, 2011 at 03:31:01PM +0800, jiaju liu wrote:
HI all I use ipmi as stonith resource. the interface I use lanplus first,
however, it doesn't work. I check log,it shows nodea external/ipmi[3433]:
ERROR: error executing
ipmitool: Error: Unable to establish IPMI
Hi,
25.05.2011 03:49, jiaju liu wrote:
Hi,
On Tue, May 24, 2011 at 03:31:01PM +0800, jiaju liu wrote:
HI all I use ipmi as stonith resource. the interface I use
lanplus first, however, it doesn't work. I check log,it shows nodea
external/ipmi[3433
HI all
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs:
Hi all I intend to use wwid as resource name which is 32bit, so are there any
limitations of resource name? 32bit is ok?___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home:
75 matches
Mail list logo