Re: [ClusterLabs] Announcing ClusterLabs Summit 2020

2019-11-05 Thread Valentin Vidić
On Mon, Nov 04, 2019 at 08:07:51PM -0600, Ken Gaillot wrote: > A reminder: We are still interested in ideas for talks, and rough > estimates of potential attendees. "Maybe" is perfectly fine at this > stage. It will let us negotiate hotel rates and firm up the location > details. Not sure if I

Re: [ClusterLabs] Announcing ClusterLabs Summit 2020

2019-11-05 Thread Valentin Vidić
On Tue, Nov 05, 2019 at 09:55:33PM +0100, Jehan-Guillaume de Rorthais wrote: > There's the Cluster Test Suite (CTS) provided with Pacemaker source code. It > can exercice any Pacemaker cluster you are able to build, with predefined > scenarios. > > Sadly, there's no way (yet) to extend it with

[ClusterLabs] SBD build problem

2019-11-10 Thread Valentin Vidić
Hi, I have some problems building the latest sbd from the repo, it seems like the file 'tests-opt.m4' might be missing? dpkg-buildpackage - Command: dpkg-buildpackage -us -uc -rfakeroot dpkg-buildpackage: info: source package sbd dpkg-buildpackage: info: source version

Re: [ClusterLabs] Antw: Re: Concept of a Shared ipaddress/resource for generic applicatons

2019-12-03 Thread Valentin Vidić
On Tue, Dec 03, 2019 at 08:03:18AM +0100, Ulrich Windl wrote: > Probably while doing so, also provide better documentation (e.g. hardware > requirements). I could not get the cluster IP working when I tried several > years ago. Maybe it was due to our networking equipment, but actually I never >

Re: [ClusterLabs] Concept of a Shared ipaddress/resource for generic applicatons

2019-12-03 Thread Valentin Vidić
On Tue, Dec 03, 2019 at 03:06:14PM +0100, Jan Pokorný wrote: > You likely refer to > > https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=43270b1bc5f1e33522dacf3d3b9175c29404c36c > > however this extension is activelly maintained to this day, so don't > see any

Re: [ClusterLabs] Concept of a Shared ipaddress/resource for generic applicatons

2019-12-03 Thread Valentin Vidić
On Tue, Dec 03, 2019 at 08:38:06PM +0100, Valentin Vidić wrote: > The module might still work but the iptables command from the agent fails: > > [ 842.536916] ipt_CLUSTERIP: ClusterIP Version 0.8 loaded successfully > [ 842.539215] ipt_CLUSTERIP: cannot use CLUSTERIP target from nft

Re: [ClusterLabs] Concept of a Shared ipaddress/resource for generic applicatons

2019-12-03 Thread Valentin Vidić
On Tue, Dec 03, 2019 at 11:14:41PM +0100, Jan Pokorný wrote: > The conclusion is hence that even with bleeding edge software > collection, there's no real problem in using ipt_CLUSTERIP > (when compiled in or alongside kernel) when a proper interface > is used, which may boil down to using an

Re: [ClusterLabs] Safe way to stop pacemaker on both nodes of a two node cluster

2019-10-20 Thread Valentin Vidić
On Sun, Oct 20, 2019 at 09:24:31PM +0530, Dileep V Nair wrote: > I am confused about the best way to stop pacemaker on both nodes of a > two node cluster. The options I know of are > 1. Put the cluster in Maintenance Mode, stop the applications manually and > then stop pacemaker on both

Re: [ClusterLabs] Debian 10 pacemaker - CIB did not pass schema validation

2020-03-02 Thread Valentin Vidić
On Mon, Mar 02, 2020 at 11:22:55AM +, Bala Mutyam wrote: > I'm trying to setup Pacemaker cluster with 2 VIPs and a group with the VIPs > and service for squid proxy. But the CIB verification is failing with below > errors. Could someone help me with this please? > > Errors: > > crm_verify

Re: [ClusterLabs] clusterlabs.org upgrade done

2020-03-03 Thread Valentin Vidić
On Sat, Feb 29, 2020 at 03:44:50PM -0600, Ken Gaillot wrote: > The clusterlabs.org server OS upgrade is (mostly) done. > > Services are back up, with the exception of some cosmetic issues and > the source code continuous integration testing for ClusterLabs github > projects (ci.kronosnet.org).

Re: [ClusterLabs] Antw: [EXT] Re: clusterlabs.org upgrade done

2020-03-04 Thread Valentin Vidić
On Wed, Mar 04, 2020 at 10:05:50AM +0200, Strahil Nikolov wrote: > Maybe I will be unsubscribed every 10th email instead of every 5th one. AFAICT from the reports, the mail I send to the list might not get delivered, perhaps this is causing the unsubscribe too: 78.46.95.29 2

Re: [ClusterLabs] Antw: [EXT] Re: clusterlabs.org upgrade done

2020-03-05 Thread Valentin Vidić
On Thu, Mar 05, 2020 at 11:07:04PM +0200, Strahil Nikolov wrote: > After random amount of e-mails, I got a notification that I'm > unsubscribed due to maximum ammount of bounces reached, but I got no > e-mail about that from yahoo. > > Actually I have no clue about the reason. Yep, you probably

Re: [ClusterLabs] Antw: [EXT] Re: clusterlabs.org upgrade done

2020-03-05 Thread Valentin Vidić
On Wed, Mar 04, 2020 at 10:05:50AM +0200, Strahil Nikolov wrote: > Maybe I will be unsubscribed every 10th email instead of every 5th one. In the default Mailman config unsubscribe score seems to be 5.0, but you can only get 1.0 per day if there are bounces. Also score is reset to 0 if there

Re: [ClusterLabs] Two-node cluster stops resources when second node is running alone

2020-02-22 Thread Valentin Vidić
On Thu, Feb 20, 2020 at 05:05:58PM +, Reynolds, John F - San Mateo, CA - Contractor wrote: > There is one anomalous entry in cib.xml, the line: > > > > That syntax is wrong, and there should be an opening and closing constraint, > shouldn't there? Nope, this is fine: = As for the

Re: [ClusterLabs] Making xt_cluster IP load-sharing work with IPv6 (Was: Concept of a Shared ipaddress/resource for generic applicatons)[

2020-01-03 Thread Valentin Vidić
On Thu, Jan 02, 2020 at 09:52:09PM +0100, Jan Pokorný wrote: > What you've used appears to be akin to what this chunk of manpage > suggests (amongst others): > https://git.netfilter.org/iptables/tree/extensions/libxt_cluster.man > > which is (yet another) indicator to me that xt_cluster extension

Re: [ClusterLabs] Concept of a Shared ipaddress/resource for generic applicatons

2019-12-27 Thread Valentin Vidić
On Wed, Dec 04, 2019 at 02:44:49PM +0100, Jan Pokorný wrote: > For the record, based on my feedback, iptables-extensions man page is > headed to (finally) align with the actual in-kernel deprecation > message: > https://lore.kernel.org/netfilter-devel/20191204130921.2914-1-p...@nwl.cc/ >From a

Re: [ClusterLabs] Antw: [EXT] Re: clusterlabs.org upgrade done

2020-03-05 Thread Valentin Vidić
On Thu, Mar 05, 2020 at 11:44:55AM -0600, Ken Gaillot wrote: > What sort of issue are you seeing exactly? Is your account being > unsubscribed from the list automatically, or are you not receiving some > of the emails sent by the list? He is on yahoo and based on this Mailman page it seems yahoo

Re: [ClusterLabs] Antw: [EXT] Re: clusterlabs.org upgrade done

2020-03-05 Thread Valentin Vidić
On Thu, Mar 05, 2020 at 11:46:16AM -0600, Ken Gaillot wrote: > Hmm, not sure what the best approach is. I think some people like > having the [ClusterLabs] tag in the subject line. If anyone has > suggested config changes for mailman 2, I can take a look. In that case it would be best to rewrite

Re: [ClusterLabs] Two-node Pacemaker cluster with "fence_aws" fence agent

2020-09-04 Thread Valentin Vidić
On Fri, Sep 04, 2020 at 05:24:00PM -0400, Digimer wrote: > It would depend on AWS, and I don't believe it's a good idea to design a > solution that depends on a third party's behaviour. It would be strange if AWS control API for node1 would be serialized with control for node2. In fact fence_aws

Re: [ClusterLabs] Antw: [EXT] Re: Removing DRBD w/out Data Loss?

2020-09-10 Thread Valentin Vidić
On Thu, Sep 10, 2020 at 09:39:09AM +0200, Ulrich Windl wrote: > But doesn't hat make resizing the volume a big mess (similar to > resizing GPT disks)? Yes, it is tricky but should be possible by extending the LVM or shrinking the filesystem by the required DRBD size. Also I think create-md checks

Re: [ClusterLabs] Removing DRBD w/out Data Loss?

2020-09-09 Thread Valentin Vidić
On Wed, Sep 09, 2020 at 12:10:54PM +, Eric Robinson wrote: > With DRBD stopped, wipefs only showed one signature... > > [root@001db01 ~]# wipefs /dev/vg0/lv0 > offset type > > 0x438ext4

Re: [ClusterLabs] Removing DRBD w/out Data Loss?

2020-09-08 Thread Valentin Vidić
On Tue, Sep 08, 2020 at 02:33:37PM +, Eric Robinson wrote: > I checked the DRBD manual for this, but didn't see an answer. We need to > convert a DRBD cluster node into standalone server and remove DRBD without > losing the data. Is that possible? I asked on the DRBD list but it didn't get

Re: [ClusterLabs] Open Source Linux Load Balancer with HA and Split Brain Prevention?

2020-10-04 Thread Valentin Vidić
On Sun, Oct 04, 2020 at 09:28:40PM +, Eric Robinson wrote: > I don't want to proxy the services. I just want NAT redirection at > Layer 4, using LVS. Basically, all I need is a good health-checker > that works with LVS, like ldirectord does (except newer technology). keepalived is one

Re: [ClusterLabs] Open Source Linux Load Balancer with HA and Split Brain Prevention?

2020-10-05 Thread Valentin Vidić
On Sun, Oct 04, 2020 at 11:34:52PM +, Eric Robinson wrote: > I've been experimenting with keepalived. It relies on VRRP, but VRRP > does not have split-brain prevention. Perhaps keepalived can be configured to only setup IPVS (no VRRP) and than added to pacemaker as a systemd service. --

Re: [ClusterLabs] VirtualDomain does not stop via "crm resource stop" - modify RA ?

2020-10-23 Thread Valentin Vidić
On Fri, Oct 23, 2020 at 08:08:31PM +0200, Lentes, Bernd wrote: > But when the timeout has run out the RA tries to kill the machine with a > "virsh destroy". > And if that does not work (what is occasionally my problem) because the domain > is in uninterruptable sleep (D state) the RA gives a

Re: [ClusterLabs] Antw: [EXT] Re: Setting up HA cluster on Raspberry pi4 with ubuntu 20.04 aarch64 architecture

2020-06-15 Thread Valentin Vidić
On Mon, Jun 15, 2020 at 09:44:49AM +0200, Ulrich Windl wrote: > I wonder what "resource-agents-deps.target" > (Description=resource-agents dependencies) is for (in SLES12 SP5). Seems to be for external services that resource-agents might need:

Re: [ClusterLabs] Redudant Ring Network failure

2020-06-11 Thread Valentin Vidić
On Thu, Jun 11, 2020 at 09:46:14AM +0200, Jan Friesse wrote: > > Jan, > > > > actually we using this. > > > > [root@lvm-nfscpdata-05ct::~ 100 ]# apt show corosync > > Package: corosync > > Version: 3.0.1-2+deb10u1 > > > > [root@lvm-nfscpdata-05ct::~]# apt show libknet1 > > Package: libknet1 > >

Re: [ClusterLabs] Setting up HA cluster on Raspberry pi4 with ubuntu 20.04 aarch64 architecture

2020-06-11 Thread Valentin Vidić
On Thu, Jun 11, 2020 at 11:40:22AM +0530, Jayadeva DB wrote: > I have installed ubuntu 20.04 aarch64 OS on raspberry pi4. > I want to set up HA cluster using pacemaker corosync and crm . > I have been following this link > https://clusterlabs.org/quickstart-ubuntu.html . > I am able to install

Re: [ClusterLabs] Beginner Question about VirtualDomain

2021-01-21 Thread Valentin Vidić
On Wed, Aug 19, 2020 at 01:10:08AM -0400, Digimer wrote: > 3. We changed DRBD from v8.4 to 9.0, and this meant a few things had to > change. We will integrate support for short-throw DR hosts (async "third > node" in DRBD that is outside pacemaker). We run the resources to only > allow a single

Re: [ClusterLabs] "Error: unable to fence '001db02a'" but It got fenced anyway

2021-02-28 Thread Valentin Vidić
On Sun, Feb 28, 2021 at 03:34:20PM +, Eric Robinson wrote: > 001db02b rebooted. After it came back up, I tried it in the other direction. > > On node 001db02b, the command... > > # pcs stonith fence 001db02a > > ...produced output... > > Error: unable to fence '001db02a'. > > However,

Re: [ClusterLabs] Filesystem Resource Move Fails Because Underlying DRBD Resource Won't Move

2021-02-28 Thread Valentin Vidić
On Sun, Feb 28, 2021 at 12:45:55PM +, Eric Robinson wrote: > Colocation Constraints: > p_fs_clust03 with ms_drbd0 (score:INFINITY) > (id:colocation-p_fs_clust03-ms_drbd0-INFINITY) > p_fs_clust04 with ms_drbd1 (score:INFINITY) > (id:colocation-p_fs_clust04-ms_drbd1-INFINITY) This

Re: [ClusterLabs] "Error: unable to fence '001db02a'" but It got fenced anyway

2021-02-28 Thread Valentin Vidić
On Sun, Feb 28, 2021 at 05:54:26PM +, Eric Robinson wrote: > I made the changes and tried again. Fencing took about 3.5 minutes and > did not throw an error. Which raises the question, what happens if > fencing takes more than 900 seconds? Will Pacemaker on the survivor > node refuse to start

Re: [ClusterLabs] Our 2-Node Cluster with a Separate Qdevice Went Down Anyway?

2021-02-28 Thread Valentin Vidić
On Sun, Feb 28, 2021 at 07:45:27AM +, Strahil Nikolov wrote: > As this is in Asure and they support shared disks , I think that a simple SBD > could solve the stonith case. Also fence_azure_arm: Azure Resource Manager :) -- Valentin ___ Manage

Re: [ClusterLabs] @ - unsupported char - bug or something to improve upon?

2021-02-20 Thread Valentin Vidić
On Sat, Feb 20, 2021 at 09:35:33AM +, lejeczek wrote: > -> $ pcs resource create "dropbox\@me" systemd:"dropbox\@me" > > as you can see I've been trying to escape '@' too, in various ways, to no > avail. Right, but just changing the resource name should work: $ pcs resource create

Re: [ClusterLabs] resource start after network reconnected

2021-11-20 Thread Valentin Vidić via Users
On Sat, Nov 20, 2021 at 08:33:26PM +, Strahil Nikolov via Users wrote: > You can also use this 3rd node to provide iSCSI and then the SBD will > be disk-full :D . The good thing about this type of setup is that you > do won't need to put location constraints for the 3rd node. Wouldn't that

Re: [ClusterLabs] resource start after network reconnected

2021-11-18 Thread Valentin Vidić via Users
On Thu, Nov 18, 2021 at 02:33:28PM -0500, john tillman wrote: > preamble: RHEL8, PCS 0.10.8, COROSYNC 3.1.0, PACEMAKER 2.0.5 > > I have a mysql resource, cloned, that is behaving the way I wanted. When > the node it is on is unplugged from the network quorum is lost and the > mysqld service

Re: [ClusterLabs] resource start after network reconnected

2021-11-18 Thread Valentin Vidić via Users
On Thu, Nov 18, 2021 at 03:42:48PM -0500, john tillman wrote: > I don't believe I can since I do not have a fencing device available. As this page explains, fencing is required for the cluster to behave correctly: https://www.alteeve.com/w/The_2-Node_Myth Can you share what kind of nodes are

Re: [ClusterLabs] resource start after network reconnected

2021-11-19 Thread Valentin Vidić via Users
On Fri, Nov 19, 2021 at 11:26:01AM -0500, john tillman wrote: > Anyone have any other ideas for a configuration setting that will > effectively do whatever 'pcs resource refresh' is doing when quorum is > restored? Since you have three nodes you may want to use the third node as QDevice instead:

Re: [ClusterLabs] How many nodes redhat cluster does supports

2022-04-27 Thread Valentin Vidić via Users
On Thu, Apr 28, 2022 at 12:25:37AM +0500, Umar Draz wrote: > * sharedfs1_start_0 on g2fs-1 'error' (1): call=158, status='complete', > exitreason='Couldn't mount device [/dev/shared_vg1/shared_lv1] as > /mnt/webgfs', last-rc-change='Tue Apr 26 01:07:45 2022', queued=0ms, > exec=806ms Maybe the

Re: [ClusterLabs] [External] : Re: Fence Agent tests

2022-11-05 Thread Valentin Vidić via Users
On Sat, Nov 05, 2022 at 06:47:59PM +, Robert Hayden wrote: > That was my impression as well...so I may have something wrong. My > expectation was that SBD daemon > should be writing to the /dev/watchdog within 20 seconds and the kernel > watchdog would self fence. I don't see anything

Re: [ClusterLabs] [External] : Re: Fence Agent tests

2022-11-05 Thread Valentin Vidić via Users
On Sat, Nov 05, 2022 at 05:20:47PM +, Robert Hayden wrote: > The OCI compute instances don't have a hardware watchdog, only the software > watchdog. > So, when the network goes completely hung (e.g. firewall-cmd panic-on), all > network > traffic stops which implies that IO to the SBD

Re: [ClusterLabs] [External] : Re: Fence Agent tests

2022-11-06 Thread Valentin Vidić via Users
On Sun, Nov 06, 2022 at 09:08:19PM +, Robert Hayden wrote: > When SBD_PACEMAKER was set to "yes", the lack of network connectivity to the > node > would be seen and acted upon by the remote nodes (evicts and takes > over ownership of the resources). But the impacted node would just > sit

Re: [ClusterLabs] Problem with MariaDB cluster

2023-02-01 Thread Valentin Vidić via Users
On Tue, Jan 31, 2023 at 02:45:46PM +, Thomas CAS wrote: > What solution can I use while waiting for a fix for this bug? > Modify RA? AFAICT this is not a bug in RA and notify variables are also set for start/stop/promote/demote actions:

Re: [ClusterLabs] Planning for Pacemaker 3

2024-01-03 Thread Valentin Vidić via Users
On Wed, Jan 03, 2024 at 11:06:27AM -0600, Ken Gaillot wrote: > I'd like to release Pacemaker 3.0.0 around the middle of this year. > I'm gathering proposed changes here: > > https://projects.clusterlabs.org/w/projects/pacemaker/pacemaker_3.0_changes/ > > Please review for anything that might