Re: [ClusterLabs] question on www.gossamer-threads.com/lists/linuxha/users/

2015-08-13 Thread Stefan Bauer
DOH! Please ignore my mail - i live in the past ;)

Last mail in archive is from Jul.



Stefan

___
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


Re: [ClusterLabs] Antw: CentOS 7 - Pacemaker - Problem with nfs-server and system

2015-08-12 Thread Stefan Bauer
Hi,



thank you for your reply. It seems to be a problem with the systemd unit files 
for nfs-server - specifically a timing issue.



[root@centos7-n1 ~]# systemctl list-unit-files --type=service | grep rpcbind
rpcbind.service static



rpcbind is set to static - should be started on demand by other units.



Invoking systemctl start nfs-server is pulling in rpcbind and nfs-lock



rpcbind is started - but nfs-lock is maybe trying to start too early:



Invoking manually systemctl start rpcbind and then systemctl start nfs-lock 
works within a second.

Invoking manually systemctl start rpcbind and then sytemctl start nfs-server 
works within a few seconds as well.



Invoking manually systemctl start nfs-server is only working randomly due to 
some timing issues.



My current workaround is to also start rpcbind by the cluster - just before 
nfsserver.



I also tried /usr/lib/ocf/resource.d/heartbeat/nfsserver - it is capable of 
handling systemd systems but start nfs-lock and nfs-server manually - hence hit 
the same problem in my case.



Cheers,



Stefan





-Ursprüngliche Nachricht-
Von: Ulrich Windl ulrich.wi...@rz.uni-regensburg.de

 [root@centos7-n1 ˜]# time systemctl start nfs-server
 
 real1m0.480s

Probably time to look into syslog. I suspect a name/address resolving
issue...

___
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


[ClusterLabs] group resources without order behavior / monitor timeout smaller than interval?

2016-09-14 Thread Stefan Bauer
Hi,

I'm trying to understand some cluster internals and would be happy to get some 
best practice recommendations:


monitor interval and timeout: shouldn't timeout value always be smaller than 
interval to avoid another check even though the first is not over yet?

Additionally i would like to use the group function to put all my VMS 
(ocf:heartbeat:VirtualDomain) in one group and colocate the group with the VIP 
and my LVM-volume. Unfortunately group function starts the resources in the 
listed order. So if i stop one VM, the following VMs are also stopped.

Right now I'm having the following configuration and want to make it less 
redundant:


# never let the stonith_service run on the host to stonith

location l_st_srv20 st_ipmi_srv20 -inf: srv20
location l_st_srv21 st_ipmi_srv21 -inf: srv21


# do not run resources on quorum only node
location loc_r_lvm_vg-storage_quorum_only_node r_lvm_vg-storage -inf: 
quorum_only_node
location loc_r_vm_ado01_quorum_only_node r_vm_ado01 -inf: quorum_only_node
location loc_r_vm_bar01_quorum_only_node r_vm_bar01 -inf: quorum_only_node
location loc_r_vm_cmt01_quorum_only_node r_vm_cmt01 -inf: quorum_only_node
location loc_r_vm_con01_quorum_only_node r_vm_con01 -inf: quorum_only_node
location loc_r_vm_con02_quorum_only_node r_vm_con02 -inf: quorum_only_node
location loc_r_vm_dsm01_quorum_only_node r_vm_dsm01 -inf: quorum_only_node
location loc_r_vm_jir01_quorum_only_node r_vm_jir01 -inf: quorum_only_node
location loc_r_vm_jir02_quorum_only_node r_vm_jir02 -inf: quorum_only_node
location loc_r_vm_prx02_quorum_only_node r_vm_prx02 -inf: quorum_only_node
location loc_r_vm_src01_quorum_only_node r_vm_src01 -inf: quorum_only_node


# colocate ip with lvm storage
colocation col_r_Failover_IP_r_lvm_vg-storage inf: r_Failover_IP 
r_lvm_vg-storage


# colocate each VM with lvm storage
colocation col_r_vm_ado01_r_lvm_vg-storage inf: r_vm_ado01 r_lvm_vg-storage
colocation col_r_vm_bar01_r_lvm_vg-storage inf: r_vm_bar01 r_lvm_vg-storage
colocation col_r_vm_cmt01_r_lvm_vg-storage inf: r_vm_cmt01 r_lvm_vg-storage
colocation col_r_vm_con01_r_lvm_vg-storage inf: r_vm_jir01 r_lvm_vg-storage
colocation col_r_vm_con02_r_lvm_vg-storage inf: r_vm_con02 r_lvm_vg-storage
colocation col_r_vm_dsm01_r_lvm_vg-storage inf: r_vm_dsm01 r_lvm_vg-storage
colocation col_r_vm_jir01_r_lvm_vg-storage inf: r_vm_con01 r_lvm_vg-storage
colocation col_r_vm_jir02_r_lvm_vg-storage inf: r_vm_jir02 r_lvm_vg-storage
colocation col_r_vm_prx02_r_lvm_vg-storage inf: r_vm_prx02 r_lvm_vg-storage
colocation col_r_vm_src01_r_lvm_vg-storage inf: r_vm_src01 r_lvm_vg-storage


# start lvm storage before VIP

order ord_r_lvm_vg-storage_r_Failover_IP inf: r_lvm_vg-storage r_Failover_IP


# start lvm storage before each VM
order ord_r_lvm_vg-storage_r_vm_ado01 inf: r_lvm_vg-storage r_vm_ado01
order ord_r_lvm_vg-storage_r_vm_bar01 inf: r_lvm_vg-storage r_vm_bar01
order ord_r_lvm_vg-storage_r_vm_cmt01 inf: r_lvm_vg-storage r_vm_cmt01
order ord_r_lvm_vg-storage_r_vm_con01 inf: r_lvm_vg-storage r_vm_con01
order ord_r_lvm_vg-storage_r_vm_con02 inf: r_lvm_vg-storage r_vm_con02
order ord_r_lvm_vg-storage_r_vm_dsm01 inf: r_lvm_vg-storage r_vm_dsm01
order ord_r_lvm_vg-storage_r_vm_jir01 inf: r_lvm_vg-storage r_vm_jir01
order ord_r_lvm_vg-storage_r_vm_jir02 inf: r_lvm_vg-storage r_vm_jir02
order ord_r_lvm_vg-storage_r_vm_prx02 inf: r_lvm_vg-storage r_vm_prx02
order ord_r_lvm_vg-storage_r_vm_src01 inf: r_lvm_vg-storage r_vm_src01

any help is greatly appreciated.

thank you.

Stefan

___
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


Re: [ClusterLabs] group resources without order behavior / monitor timeout smaller than interval?

2016-09-15 Thread Stefan Bauer
Hi Ken,



thank you very much. I will test it!



Cheers,



Stefan



-Ursprüngliche Nachricht-
Von: Ken Gaillot <kgail...@redhat.com>
Gesendet: Mit 14 September 2016 21:59
An: users@clusterlabs.org
Betreff: Re: [ClusterLabs] group resources without order behavior / monitor 
timeout smaller than interval?


On 09/14/2016 03:01 AM, Stefan Bauer wrote:
> Hi,
> 
> I'm trying to understand some cluster internals and would be happy to
> get some best practice recommendations:
> 
> monitor interval and timeout: shouldn't timeout value always be smaller
> than interval to avoid another check even though the first is not over yet?

The cluster handles it intelligently. If the previous monitor is still
in progress when the interval expires, it won't run another one.

It certainly makes sense that the timeout will generally be smaller than
the interval, but there may be cases where a monitor on rare occasions
takes a long time, and the user wants the high timeout for those
occasions, but a shorter interval that will be used most of the time.

> Additionally i would like to use the group function to put all my VMS
> (ocf:heartbeat:VirtualDomain) in one group and colocate the group with
> the VIP and my LVM-volume. Unfortunately group function starts the
> resources in the listed order. So if i stop one VM, the following VMs
> are also stopped.
> 
> Right now I'm having the following configuration and want to make it
> less redundant:

You can use one ordering constraint and one colocation constraint, each
with two resource sets, one containing the IP and volume with
sequential=true, and the other containing the VMs with sequential=false.
See:

http://clusterlabs.org/doc/en-US/Pacemaker/1.1-pcs/html-single/Pacemaker_Explained/index.html#s-resource-sets


> 
> # never let the stonith_service run on the host to stonith
> 
> location l_st_srv20 st_ipmi_srv20 -inf: srv20
> location l_st_srv21 st_ipmi_srv21 -inf: srv21
> 
> 
> # do not run resources on quorum only node
> location loc_r_lvm_vg-storage_quorum_only_node r_lvm_vg-storage -inf:
> quorum_only_node
> location loc_r_vm_ado01_quorum_only_node r_vm_ado01 -inf: quorum_only_node
> location loc_r_vm_bar01_quorum_only_node r_vm_bar01 -inf: quorum_only_node
> location loc_r_vm_cmt01_quorum_only_node r_vm_cmt01 -inf: quorum_only_node
> location loc_r_vm_con01_quorum_only_node r_vm_con01 -inf: quorum_only_node
> location loc_r_vm_con02_quorum_only_node r_vm_con02 -inf: quorum_only_node
> location loc_r_vm_dsm01_quorum_only_node r_vm_dsm01 -inf: quorum_only_node
> location loc_r_vm_jir01_quorum_only_node r_vm_jir01 -inf: quorum_only_node
> location loc_r_vm_jir02_quorum_only_node r_vm_jir02 -inf: quorum_only_node
> location loc_r_vm_prx02_quorum_only_node r_vm_prx02 -inf: quorum_only_node
> location loc_r_vm_src01_quorum_only_node r_vm_src01 -inf: quorum_only_node
> 
> 
> # colocate ip with lvm storage
> colocation col_r_Failover_IP_r_lvm_vg-storage inf: r_Failover_IP
> r_lvm_vg-storage
> 
> 
> # colocate each VM with lvm storage
> colocation col_r_vm_ado01_r_lvm_vg-storage inf: r_vm_ado01 r_lvm_vg-storage
> colocation col_r_vm_bar01_r_lvm_vg-storage inf: r_vm_bar01 r_lvm_vg-storage
> colocation col_r_vm_cmt01_r_lvm_vg-storage inf: r_vm_cmt01 r_lvm_vg-storage
> colocation col_r_vm_con01_r_lvm_vg-storage inf: r_vm_jir01 r_lvm_vg-storage
> colocation col_r_vm_con02_r_lvm_vg-storage inf: r_vm_con02 r_lvm_vg-storage
> colocation col_r_vm_dsm01_r_lvm_vg-storage inf: r_vm_dsm01 r_lvm_vg-storage
> colocation col_r_vm_jir01_r_lvm_vg-storage inf: r_vm_con01 r_lvm_vg-storage
> colocation col_r_vm_jir02_r_lvm_vg-storage inf: r_vm_jir02 r_lvm_vg-storage
> colocation col_r_vm_prx02_r_lvm_vg-storage inf: r_vm_prx02 r_lvm_vg-storage
> colocation col_r_vm_src01_r_lvm_vg-storage inf: r_vm_src01 r_lvm_vg-storage
> 
> # start lvm storage before VIP
> 
> order ord_r_lvm_vg-storage_r_Failover_IP inf: r_lvm_vg-storage r_Failover_IP
> 
> 
> # start lvm storage before each VM
> order ord_r_lvm_vg-storage_r_vm_ado01 inf: r_lvm_vg-storage r_vm_ado01
> order ord_r_lvm_vg-storage_r_vm_bar01 inf: r_lvm_vg-storage r_vm_bar01
> order ord_r_lvm_vg-storage_r_vm_cmt01 inf: r_lvm_vg-storage r_vm_cmt01
> order ord_r_lvm_vg-storage_r_vm_con01 inf: r_lvm_vg-storage r_vm_con01
> order ord_r_lvm_vg-storage_r_vm_con02 inf: r_lvm_vg-storage r_vm_con02
> order ord_r_lvm_vg-storage_r_vm_dsm01 inf: r_lvm_vg-storage r_vm_dsm01
> order ord_r_lvm_vg-storage_r_vm_jir01 inf: r_lvm_vg-storage r_vm_jir01
> order ord_r_lvm_vg-storage_r_vm_jir02 inf: r_lvm_vg-storage r_vm_jir02
> order ord_r_lvm_vg-storage_r_vm_prx02 inf: r_lvm_vg-storage r_vm_prx02
> order ord_r_lvm_vg-storage_r_vm_src01 inf: r_lvm_vg-storage r_vm_src01
> 
> any help is greatly appreciated.
> 
> thank you.
> 
>

[ClusterLabs] best practice fencing with ipmi in 2node-setups / cloneresource/monitor/timeout

2016-09-20 Thread Stefan Bauer
Hi,



i run a 2 node cluster and want to be save in split-brain scenarios. For this i 
setup external/ipmi to stonith the other node.



Some possible issues jumped to my mind and i would ike to find the best 
practice solution:



- I have a primitive for each node to stonith. Many documents and guides 
recommend to never let them run on the host it should fence. I would setup 
clone resources to avoid dealing with locations that would also influence 
scoring. Does that make sense?



- Monitoring operation on the stonith primitive is dangerous. I read that if 
monitor operations fail for the stonith device, stonith action is triggered. I 
think its not clever to give the cluster the option to fence a node just 
because it has an issue to monitor a fence device. That should not be a reason 
to shutdown a node. What is your opinion on this? Can i just set the primitive 
monitor operation to disabled?



Thank you.



Stefan

___
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org