[Linux-cluster] IP Resource behavior with Red Hat Cluster

2010-12-23 Thread Parvez Shaikh
Hi all, I am using Red Hat cluster 6.2.0 (version shown with cman_tool version) on Red Hat 5.5 I am on host that has multiple network interfaces and all(or some) of which may be active while I tried to bring up my IP resource up. My cluster is of simple configuration - It has only 2 nodes, and

Re: [Linux-cluster] IP Resource behavior with Red Hat Cluster

2010-12-24 Thread Parvez Shaikh
do I have to add virtual interface manually (as above or any other method?) before I could start service with IP resource under it? Thanks Parvez On Fri, Dec 24, 2010 at 11:30 AM, Rajagopal Swaminathan raju.rajs...@gmail.com wrote: Greetings, On Fri, Dec 24, 2010 at 5:33 AM, Parvez Shaikh

Re: [Linux-cluster] IP Resource behavior with Red Hat Cluster

2010-12-24 Thread Parvez Shaikh
Thanks a ton Jakov. It has clarified my doubts. Yours gratefully, Parvez On Sat, Dec 25, 2010 at 6:34 AM, Jakov Sosic jakov.so...@srce.hr wrote: On 12/24/2010 05:46 PM, Parvez Shaikh wrote: Hi Jakov Thank you for your response. My two hosts have multiple network interfaces or ethernet cards

Re: [Linux-cluster] IP Resource behavior with Red Hat Cluster

2010-12-26 Thread Parvez Shaikh
, Parvez Shaikh wrote: Hi Jakov Thank you for your response. My two hosts have multiple network interfaces or ethernet cards. I understood from your email, that the IP corresponding to cluster node name for both hosts, should be in the same subnet before a cluster could bring virtual IP up

Re: [Linux-cluster] IP Resource behavior with Red Hat Cluster

2010-12-26 Thread Parvez Shaikh
Gratefully yours On Mon, Dec 27, 2010 at 12:18 PM, Rajagopal Swaminathan raju.rajs...@gmail.com wrote: Greetinds, On Mon, Dec 27, 2010 at 9:51 AM, Parvez Shaikh parvez.h.sha...@gmail.com wrote: Hi Dec 27 17:35:32 datablade1 clurgmgrd[31853]: err Error storing ip: Duplicate Dec 27 17:36:55

Re: [Linux-cluster] Determining red hat cluster version

2011-01-05 Thread Parvez Shaikh
Hi Fabio This produces output - cman-2.0.115-29.el5 So does it indicate 2.0.115-29 is version? On Thu, Jan 6, 2011 at 12:34 PM, Fabio M. Di Nitto fdini...@redhat.com wrote: On 1/6/2011 6:24 AM, Parvez Shaikh wrote: Hi all, Is there any command which states Red Hat cluster version? I

Re: [Linux-cluster] Determining red hat cluster version

2011-01-06 Thread Parvez Shaikh
versions may be) Gratefully yours On Thu, Jan 6, 2011 at 1:14 PM, Fabio M. Di Nitto fdini...@redhat.com wrote: On 1/6/2011 8:28 AM, Parvez Shaikh wrote: Hi Fabio This produces output - cman-2.0.115-29.el5 So does it indicate 2.0.115-29 is version? yes Fabio -- Linux-cluster mailing

[Linux-cluster] configuring bladecenter fence device

2011-01-06 Thread Parvez Shaikh
Hi all, From RHCS documentation, I could see that bladecenter is one of the fence devices - http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/5/html/Cluster_Administration/ap-fence-device-param-CA.html Table B.9. IBM Blade Center Field Description NameA name for the IBM

Re: [Linux-cluster] configuring bladecenter fence device

2011-01-06 Thread Parvez Shaikh
Hi Ben Thanks a ton for below information. But I have doubt on cluster.conf file snippet below - clusternode name=node1 votes=1 fence method name=1 device blade=2 name=chassis_fence/

Re: [Linux-cluster] configuring bladecenter fence device

2011-01-06 Thread Parvez Shaikh
Thanks Hugo Your gratefully On Fri, Jan 7, 2011 at 11:09 AM, Hugo Lombard h...@elizium.za.net wrote: On Fri, Jan 07, 2011 at 10:12:16AM +0530, Parvez Shaikh wrote:                clusternode name=node1 votes=1                        fence                                method name=1

[Linux-cluster] Error while manual fencing and output of clustat

2011-01-10 Thread Parvez Shaikh
Dear experts, I have two node cluster(node1 and node2), and manual fencing is configured. Service S2 is running on node2. To ensure failover happen, I shutdown node2.. I see following messages in /var/log/messages - agent fence_manual reports: failed: fence_manual no node

Re: [Linux-cluster] Error while manual fencing and output of clustat

2011-01-10 Thread Parvez Shaikh
Parvez Shaikh wrote: Dear experts, I have two node cluster(node1 and node2), and manual fencing is configured. Service S2 is running on node2. To ensure failover happen, I shutdown node2.. I see following messages in /var/log/messages -                     agent fence_manual reports: failed

[Linux-cluster] Determining failed node on another node of cluster during failover

2011-01-11 Thread Parvez Shaikh
Hi all, Taking this question from another thread, here is a challenge that I am facing - Following is simple cluster configuration - Node 1, node 2, node 3, and node4 are part of cluster, its unrestricted unordered fail-over domain with active - active nxn configuration So a node 2 can get

Re: [Linux-cluster] Determining failed node on another node of clusterduring failover

2011-01-12 Thread Parvez Shaikh
also setting up monitoring. The monitoring package can then notify you if any cluster member fails. Regards, Kit -Original Message- From: linux-cluster-boun...@redhat.com [mailto:linux-cluster-boun...@redhat.com] On Behalf Of Parvez Shaikh Sent: woensdag 12 januari 2011 7:04

Re: [Linux-cluster] Determining failed node on another node of clusterduring failover

2011-01-12 Thread Parvez Shaikh
] On Behalf Of Parvez Shaikh Sent: woensdag 12 januari 2011 11:01 To: linux clustering Subject: Re: [Linux-cluster] Determining failed node on another node of clusterduring failover Hi Is monitoring package part of RHCS? What is name of this component? Is there any other mechanism which doesn't

Re: [Linux-cluster] Determining failed node on another node of clusterduring failover

2011-01-13 Thread Parvez Shaikh
Gratefully yours On 1/13/11, Parvez Shaikh parvez.h.sha...@gmail.com wrote: Hi, I have been using clustat command. clustat -x -s servicename to get following XML file - ?xml version=1.0? clustat version=4.1.1 groups group name=service:service_on_node1 state=112 state_str=started flags

[Linux-cluster] Questions related to cluster quorum and fencing

2011-01-18 Thread Parvez Shaikh
Hi all, *Quorum - * The questions are bit theoretical, I have gone through documentation and man pages and have understood that, a cluster is quorate if a cluster or its partition has nodes, with votes equal to or more than expected_votes in cman section of cluster.conf file (with no requirement

[Linux-cluster] Running cluster tools using non-root user

2011-01-25 Thread Parvez Shaikh
Hi all Is it possible to run cluster tools like clustat or clusvcadm etc. using non-root user? If yes, to which groups this user should belong to? Otherwise can this be done using sudo(and sudoers) file. As of now I get following error on clustat - Could not connect to CMAN: Permission denied

Re: [Linux-cluster] Running cluster tools using non-root user

2011-01-27 Thread Parvez Shaikh
about a week ago. [/Shameless plug] On Tue, Jan 25, 2011 at 10:39 AM, Parvez Shaikh parvez.h.sha...@gmail.com wrote: Hi all Is it possible to run cluster tools like clustat or clusvcadm etc. using non-root user? If yes, to which groups this user should belong to? Otherwise can

[Linux-cluster] Tuning red hat cluster

2011-02-10 Thread Parvez Shaikh
Hi, As per my understanding rgmanager invokes 'status' on resource groups periodically to determine if these resources are up or down. I observed that this period is of around 30 seconds. Is it possible to tune or adjust this period for individual services or resource groups? Thanks --

[Linux-cluster] SNMP support with IBM Blade Center Fence Agent

2011-02-27 Thread Parvez Shaikh
Hi all, I have a question related to fence agents and SNMP alarms. Fence Agent can fail to fence the failed node for various reason; e.g. with my bladecenter fencing agent, I sometimes get message saying bladecenter fencing failed because of timeout or fence device IP address/user credentials

Re: [Linux-cluster] SNMP support with IBM Blade Center Fence Agent

2011-03-01 Thread Parvez Shaikh
)? Thanks On Mon, Feb 28, 2011 at 9:44 PM, Ryan O'Hara roh...@redhat.com wrote: On Mon, Feb 28, 2011 at 12:43:10PM +0530, Parvez Shaikh wrote: Hi all, I have a question related to fence agents and SNMP alarms. Fence Agent can fail to fence the failed node for various reason; e.g

Re: [Linux-cluster] SNMP support with IBM Blade Center Fence Agent

2011-03-04 Thread Parvez Shaikh
. Thanks again and have great weekend ahead Yours truly, Parvez On Fri, Mar 4, 2011 at 10:45 PM, Lon Hohberger l...@redhat.com wrote: On Tue, Mar 01, 2011 at 06:50:18PM +0530, Parvez Shaikh wrote: Hi Ryan, Thank you for response. Does it mean there is no way to intimate administrator about

[Linux-cluster] Two node cluster - a potential problem of node fencing each other?

2011-03-12 Thread Parvez Shaikh
Hi all, I have a question pertaining to two node cluster, I have RHEL 5.5 and cluster along with it which at least should have two nodes. In a situation where both nodes of the cluster are up, and have reliable connection to fencing device (e.g. power switch OR any other power fencing device)

Re: [Linux-cluster] Two node cluster - a potential problem of node fencing each other?

2011-03-13 Thread Parvez Shaikh
redundant network link - i trust you were referring to ethernet bonding. On Sun, Mar 13, 2011 at 1:19 PM, Ian Hayes cthulhucall...@gmail.com wrote: On Sat, Mar 12, 2011 at 11:19 PM, Parvez Shaikh parvez.h.sha...@gmail.com wrote: Hi all, I have a question pertaining to two node cluster, I

[Linux-cluster] Clustat exit code for service status

2011-03-15 Thread Parvez Shaikh
Hi all, Command clustat -s service name gives status of service. If service is started (i.e. running on some node), exit code of this command is 0, if however service is not running, its exit code is non-zero (found it to be 119). Is this right and going to be continued in subsequent cluster

[Linux-cluster] Node without fencing method, is it possible to failover from such a node?

2011-03-16 Thread Parvez Shaikh
Hi all, I have a red hat cluster on IBM blade center with blades being my clusternodes and fence_bladecenter fencing agent. I have couple of resources - IP which activate or deactivate floating IP and script which start my server listening on this floating IP. This is a stateless server with no

Re: [Linux-cluster] Node without fencing method, is it possible to failover from such a node?

2011-03-24 Thread Parvez Shaikh
. Gratefully, Parvez On Thu, Mar 17, 2011 at 10:19 PM, Rajagopal Swaminathan raju.rajs...@gmail.com wrote: Greetings, On 3/17/11, Digimer li...@alteeve.com wrote: On 03/17/2011 01:25 AM, Parvez Shaikh wrote: Hi all, Life was good until I am now required to support cluster of nodes

[Linux-cluster] Plugged out blade from bladecenter chassis - fence_bladecenter failed

2011-04-27 Thread Parvez Shaikh
Hi all, I am using RHCS on IBM bladecenter with blade center fencing. I plugged out a blade from blade center chassis slot and was hoping that failover to occur. However when I did so, I get following message - fenced[10240]: agent fence_bladecenter reports: Failed: Unable to obtain correct plug

Re: [Linux-cluster] oracle DB is not failing over on killin PMON deamon

2011-05-14 Thread Parvez Shaikh
Hi Sufyan Does your status function return 0 or 1 if database is up or down respectively (i.e. have you tested it works outside script_db.sh) when run as root? On Thu, May 12, 2011 at 12:52 PM, Sufyan Khan sufyan.k...@its.ws wrote: First of all thanks for you quick response. Secondly please

Re: [Linux-cluster] Plugged out blade from bladecenter chassis - fence_bladecenter failed

2011-06-14 Thread Parvez Shaikh
Hi, Has anyone used missing_as_off in cluster.conf file? Any help where to put this option in cluster.conf would be greatly appreciated Thanks, Parvez On Mon, May 2, 2011 at 6:49 PM, Parvez Shaikh parvez.h.sha...@gmail.comwrote: Hi Marek, I tried the option missing_as_off=1 and now I get

Re: [Linux-cluster] Plugged out blade from bladecenter chassis - fence_bladecenter failed

2011-06-19 Thread Parvez Shaikh
: There is a bug related to missing_as_off - https://bugzilla.redhat.com/show_bug.cgi?id=689851 - expects the fix in rhel5u7 . regards, On Wed, Apr 27, 2011 at 1:59 PM, Parvez Shaikh parvez.h.sha...@gmail.comwrote: Hi all, I am using RHCS on IBM bladecenter with blade center fencing. I

[Linux-cluster] fence_ipmilan fails to reboot

2011-06-30 Thread Parvez Shaikh
Hi all, I am on RHEL 5.5; and I have two rack mounted servers with IPMI configured. When I run command from the prompt to reboot the server through fence_ipmilan, it shutsdown the server fine but it fails to power it on # fence_ipmilan -a IPMI IP Address -l admin -p password -o reboot

Re: [Linux-cluster] fence_ipmilan fails to reboot - SOLVED

2011-07-01 Thread Parvez Shaikh
think you need to add the power_wait10 lanplus=1 Try this line: fencedevice agent=fence_ipmilan power_wait=10 ipaddr=xx.xx.xx.xx lanplus=1 login=xxxt name=node1_ilo passwd=yyy Regards Shalom. On Thu, Jun 30, 2011 at 1:03 PM, Parvez Shaikh parvez.h.sha...@gmail.comwrote: Hi all, I am

[Linux-cluster] Configuring failover time with Red Hat Cluster

2011-07-05 Thread Parvez Shaikh
Hi all, I was trying to find out how much time does it take for RHCS to detect failure and recover from it. I found the link - http://www.redhat.com/whitepapers/rha/RHA_ClusterSuiteWPPDF.pdf It says that network polling interval is 2 seconds and 6 retries are attempted before declaring a node as

Re: [Linux-cluster] Configuring failover time with Red Hat Cluster

2011-07-05 Thread Parvez Shaikh
more I recommend you read the documentation for the actual version of clustering you are going to install https://access.redhat.com/**knowledge/docs/Red_Hat_**Enterprise_Linux/https://access.redhat.com/knowledge/docs/Red_Hat_Enterprise_Linux/ Chrissie On 05/07/11 12:32, Parvez Shaikh wrote

[Linux-cluster] $OCF_ERR_CONFIGURED - recovers service on another cluster node

2012-01-27 Thread Parvez Shaikh
Hi guys, I am using Red Hat Cluster Suite which comes with RHEL 5.5 - cman_tool version 6.2.0 config xxx Now I have a script resource in which I return $OCF_ERR_CONFIGURED; in case of a Fatal irrecoverable error, hoping that my service would not start on another cluster node. But I see that

Re: [Linux-cluster] $OCF_ERR_CONFIGURED - recovers service on another cluster node

2012-01-27 Thread Parvez Shaikh
it's wrong 2012/1/27 Parvez Shaikh parvez.h.sha...@gmail.com Hi guys, I am using Red Hat Cluster Suite which comes with RHEL 5.5 - cman_tool version 6.2.0 config xxx Now I have a script resource in which I return $OCF_ERR_CONFIGURED; in case of a Fatal irrecoverable error, hoping that my

[Linux-cluster] [TOTEM] The consensus timeout expired.

2012-03-26 Thread Parvez Shaikh
Hi all, I have a cluster with two blades in IBM BladeCenter. Following error is appearing when I start cman service and it keep repeating the message /var/log/messages - openais[10770]: [TOTEM] The consensus timeout expired. openais[10770]: [TOTEM] entering GATHER state from 3. Heart

[Linux-cluster] Multicast address by CMAN

2012-04-03 Thread Parvez Shaikh
Hi all, As per my understanding, CMAN uses cluster name to internally generate multi-cast address. In my cluster.conf Having a cluster with same name in a given network leads to issue and is undesirable. I want to know is there anyway to find if multicast address is already in use by some other

[Linux-cluster] clurgmgrd : notice relocating a service to better node

2012-04-11 Thread Parvez Shaikh
Hi, When I start or enable a service (that was previously disabled) on a a cluster node, I see message saying clurmgrd relocating service to better node. I am not understanding why. I can relocate service back to a node where I see above message and it runs fine there. What does better node

Re: [Linux-cluster] clurgmgrd : notice relocating a service to better node

2012-04-11 Thread Parvez Shaikh
fence_daemon clean_start=1 post_fail_delay=0 post_join_delay=0/ /cluster On Wed, Apr 11, 2012 at 11:51 AM, Digimer li...@alteeve.ca wrote: On 04/11/2012 02:14 AM, Parvez Shaikh wrote: Hi, When I start or enable a service (that was previously disabled) on a a cluster node, I see message

Re: [Linux-cluster] How to add shell script to cluster.conf

2012-09-16 Thread Parvez Shaikh
From this link - https://access.redhat.com/knowledge/docs/en-US/Red_Hat_Enterprise_Linux/4/html/Cluster_Administration/s1-config-service-dev-CA.html Script *Name* — Enter a name for the custom user script. *File (with path)* — Enter the path where this custom script is located (for example,

Re: [Linux-cluster] 2 node cluster showing strange behaviour

2012-09-17 Thread Parvez Shaikh
Had similar issues however I was using RHEL 5.5 Please refer - https://access.redhat.com/knowledge/solutions/18542 On Mon, Sep 17, 2012 at 9:22 PM, Ben .T.George bentech4...@gmail.comwrote: HI i am just started building 2 node cluster.i installed all packages of red hat cluster suite by

Re: [Linux-cluster] linux-cluster

2012-10-02 Thread Parvez Shaikh
What kind of cluster is this - an academic project or production quality solution? If its former - go for manual fencing. You wont need fence device but failover wont be automatic If its later - yes you'll need fence device On Mon, Oct 1, 2012 at 10:15 PM, Rajagopal Swaminathan

Re: [Linux-cluster] linux-cluster

2012-10-02 Thread Parvez Shaikh
Hi Digimer, Could you please give me reference/case studies of problem about why manual fencing was dropped and how automated fencing is fixing those? Thanks, Parvez On Tue, Oct 2, 2012 at 7:08 PM, Digimer li...@alteeve.ca wrote: On 10/02/2012 04:00 AM, Parvez Shaikh wrote: What kind

Re: [Linux-cluster] Hi

2012-10-02 Thread Parvez Shaikh
A curious observation, there is a sudden surge of sending emails on private addresses rather than sending over a mailing list. Please send your doubts / questions on mailing list linux-cluster@redhat.com instead of addressing personally. Regarding configuration for manual fencing - I don't have

[Linux-cluster] Not restarting max_restart times before relocating failed service

2012-10-30 Thread Parvez Shaikh
Hi experts, I have defined a service as follows in cluster.conf - service autostart=0 domain=mydomain exclusive=0 max_restarts=5 name=mgmt recovery=restart script ref=myHaAgent/ ip ref=192.168.51.51/ /service I

[Linux-cluster] Monitoring Frequency - can it be changed?

2012-10-30 Thread Parvez Shaikh
Hi experts, Can we change frequency at which resources are monitored by Cluster? I observed 30 seconds as monitoring frequency. Thanks, Parvez -- Linux-cluster mailing list Linux-cluster@redhat.com https://www.redhat.com/mailman/listinfo/linux-cluster

Re: [Linux-cluster] Not restarting max_restart times before relocating failed service

2012-10-30 Thread Parvez Shaikh
/ ip ref=192.168.51.51/ /service /rm fence_daemon clean_start=1 post_fail_delay=0 post_join_delay=0/ /cluster Thanks, Parvez On Tue, Oct 30, 2012 at 9:25 PM, Digimer li...@alteeve.ca wrote: On 10/30/2012 01:54 AM, Parvez Shaikh wrote: Hi

Re: [Linux-cluster] Not restarting max_restart times before relocating failed service

2012-10-31 Thread Parvez Shaikh
Hi, I am using recovery=restart as evident from earlier attached cluster.conf Thanks, Parvez On Wed, Oct 31, 2012 at 2:53 PM, emmanuel segura emi2f...@gmail.com wrote: Hello Maybe you missing recovery=restart in your services 2012/10/31 Parvez Shaikh parvez.h.sha...@gmail.com Hi Digimer

[Linux-cluster] Normal startup vs startup due to failover on cluster node - can they be distinguished?

2012-11-22 Thread Parvez Shaikh
Hi experts, I am using Red Hat Cluster available on RHEL 5.5. And it doesn't have any inbuilt mechanism to generate SNMP traps in failures of resources or failover of services from one node to another. I have a script agent, which starts, stops and checks status of my application. Is it possible

Re: [Linux-cluster] Normal startup vs startup due to failover on cluster node - can they be distinguished?

2012-11-27 Thread Parvez Shaikh
you have On 23 November 2012 10:55, Parvez Shaikh parvez.h.sha...@gmail.com wrote: Hi experts, I am using Red Hat Cluster available on RHEL 5.5. And it doesn't have any inbuilt mechanism to generate SNMP traps in failures of resources or failover of services from one node to another