Re: [ClusterLabs] Upgrade to OLE8 + Pacemaker

Miroslav Lisik Tue, 10 Oct 2023 08:32:29 -0700

Hi Qusay,

please, find respones in-line.


On 10/6/23 14:53, Jibrail, Qusay (GfK) via Users wrote:

Hi,

May I get an answer please?

Kind regards,

*––*

*Qusay Jibrail*

Senior Infrastructure Engineer – Linux | GfK IT Services
GfK – an NIQ company |The Netherlands
Krijgsman 22-25 | Amstelveen | 1186 DM
T: +31 88 435 1232 | M: +31 628 927 686

website<https://www.gfk.com/home?utm_campaign=global_all_2022_email%20signature&utm_source=email&utm_medium=signature&utm_content=>

blog<https://www.gfk.com/blog?utm_campaign=global_all_2022_email%20signature&utm_source=email&utm_medium=signature&utm_content=>


                

instagram <https://instagram.com/gfk.insider>

                

linkedin <https://linkedin.com/company/gfk>

                

youtube <https://www.youtube.com/user/GfKTube>

                

twitter <https://twitter.com/gfk>

        

                                                        

*From:*Jibrail, Qusay (GfK)
*Sent:* Wednesday, 4 October 2023 11:11

*To:* Cluster Labs - All topics related to open-source clusteringwelcomed <users@clusterlabs.org>

*Subject:* RE: [ClusterLabs] Upgrade to OLE8 + Pacemaker

Hi Tomas,

Ok.. it is getting little bit complicated.

What about this approach:

  * pcs cluster stop “*server3*”, do upgrade to OLE8 + update
      pacemaker, corosync and pcs, check postfix is working, wait 1 day.
  * pcs cluster stop “*server4*”, do upgrade to OLE8 + update pacemaker,
    corosync and pcs, check postfix is working, wait 1 day. Now we have
    both servers running the same version of OS, pacemaker, corosync and pcs
  * Then pcs cluster *start* “server3” and pcs cluster *start* “server4”

Will above works?


If you expect to start postfix in cluster on upgraded node "*server3*" it
will not start, because cluster will wait until it forms quorum with
the other node. The other node will not be able to communicate until it
is upgraded. One possible way how to start the possix resource would be
removing second node from the cluster configuration... This brings us
back to the one-node cluster solution which Tomas proposed.

Your soulution could work after solving some obstacles but you will not
have updated cluster configuration files. For example you will not
benefit from new corosync network transport layer knet. Therefore I
think, new cluster is a better solution.

I would create a new one-node cluster with resources and than i would
add second node by using command `pcs cluster node add` after second node
upgrade. This will make sure that cluster configuration files will be at
current state.

Example procedure could look like:
* backup configuration files (`pcs config backup`, /etc files...)
* put one node to standby mode by using command `pcs cluster standby` and
  check that all resources are moved to the other node (`pcs status`)
* stop the cluster on the node (`pcs cluster stop`)
* remove cluster configuration from the node (`pcs cluster destroy`)
* upgrade to OLE8 or do a fresh install (which will be easier)
* after reboot setup a new on-node cluster (`pcs host auth`, `pcs cluster
  setup`)
* start cluster (`pcs cluster start`)
* setup resources (`pcs stonith create`, `pcs resource create`)
* check that resources are working
* backup cofiguration files on the second node
* stop the cluster on the second node (`pcs cluster stop`)
* remove cluster configuration from the second node (`pcs cluster
    destroy`)
* upgrade or fresh install + reboot
* on the first node add the second node (`pcs host auth`, `pcs cluster
    node add`)
* check if added node is running resources (`pcs status`)

We will have 2 days without load balancing.
As the server will be rebooted during OLE8 upgrade, is it better todisable pacemaker, corosync and pcs services from starting after reboot?
Or will it not start till I run the command pcs cluster *start*“server3” and pcs cluster *start* “server4”?


In order to prevent cluster start after a reboot you need to disable
pacemaker and corosync services by using command `pcs cluster disable`

Your status output shows that they are enabled:
```
Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled
```

> Kind regards,

*––*

*Qusay Jibrail*

Senior Infrastructure Engineer – Linux | GfK IT Services
GfK – an NIQ company |The Netherlands
Krijgsman 22-25 | Amstelveen | 1186 DM
T: +31 88 435 1232 | M: +31 628 927 686

website<https://www.gfk.com/home?utm_campaign=global_all_2022_email%20signature&utm_source=email&utm_medium=signature&utm_content=>

blog<https://www.gfk.com/blog?utm_campaign=global_all_2022_email%20signature&utm_source=email&utm_medium=signature&utm_content=>


                

instagram <https://instagram.com/gfk.insider>

                

linkedin <https://linkedin.com/company/gfk>

                

youtube <https://www.youtube.com/user/GfKTube>

                

twitter <https://twitter.com/gfk>

*From:*Users <users-boun...@clusterlabs.org<mailto:users-boun...@clusterlabs.org>> *On Behalf Of *Tomas Jelinek

*Sent:* Tuesday, 3 October 2023 16:50
*To:* users@clusterlabs.org <mailto:users@clusterlabs.org>
*Subject:* Re: [ClusterLabs] Upgrade to OLE8 + Pacemaker

Dne 03. 10. 23 v 16:24 Jibrail, Qusay (GfK) via Users napsal(a):

    Hi Reid,

    Thank you for the answer.

    So my plan will be:

     1. pcs config backup  /root/"Server Name"
     2. create a backup of /etc/corosync/
     3. create a backup of /etc/postfix
     4. pcs cluster stop “server3” àjust to do the failover to server4.
        The command pcs cluster stop “server3” will stop corosync and
        pacemaker right?

Hi,

Yes, 'pcs cluster stop' command stops both pacemaker and corosync.

     5.
     6. run pcs status on server3 which should give an error message.
        And on server4 should show 1 offline node and one online node
     7. Upgrade server3 to OLE8 which will upgrade these 3 package to:

corosyncx86_64 3.1.7-1.el8pacemakerx86_64 2.1.5-9.3.0.1.el8_8

pcs x86_64 0.10.15-4.0.1.el8_8.1


     8. Then run crm_verify to check the configuration. If the
        verification is OK then,
     9. pcs cluster start “server3”
    10. run pcs status on both nodes.

    Please see the version of the current installed software.

    [root@server3 ~]# corosync -v

    Corosync Cluster Engine, version '2.4.5'

    Copyright (c) 2006-2009 Red Hat, Inc.

    [root@server3 ~]# pacemakerd --version

    Pacemaker 1.1.23-1.0.1.el7_9.1

    Written by Andrew Beekhof

    [root@server3 ~]# pcs --version

    0.9.169

    Did I missed anything?

Corosync 3 is not compatible with Corosync 2. So once you update server3to OLE8, it won't be able to join server4 in the cluster and take overcluster resources.

If you are restricted to two nodes, you may remove server3 from thecluster, update server3 to OLE8 and create a one node cluster onserver3. Once you have two one-node clusters, move resources fromserver4 cluster to server3 cluster manually. Then you destroy cluster onserver4, update server4 to OLE8, and add server4 to the new cluster.


Regards,
Tomas

    Kind regards,

    *––*

    *Qusay Jibrail*

    Senior Infrastructure Engineer – Linux | GfK IT Services
    GfK – an NIQ company |The Netherlands
    Krijgsman 22-25 | Amstelveen | 1186 DM
    T: +31 88 435 1232 | M: +31 628 927 686

        

    website
    
<https://www.gfk.com/home?utm_campaign=global_all_2022_email%20signature&utm_source=email&utm_medium=signature&utm_content=>

                

    blog
    
<https://www.gfk.com/blog?utm_campaign=global_all_2022_email%20signature&utm_source=email&utm_medium=signature&utm_content=>

                

    instagram <https://instagram.com/gfk.insider>

                

    linkedin <https://linkedin.com/company/gfk>

                

    youtube <https://www.youtube.com/user/GfKTube>

                

    twitter <https://twitter.com/gfk>

        

                                                        

    *From:*Reid Wahl <nw...@redhat.com> <mailto:nw...@redhat.com>
    *Sent:* Tuesday, 3 October 2023 09:03
    *To:* Cluster Labs - All topics related to open-source clustering
    welcomed <users@clusterlabs.org> <mailto:users@clusterlabs.org>
    *Cc:* Jibrail, Qusay (GfK) <qusay.jibr...@gfk.com>
    <mailto:qusay.jibr...@gfk.com>
    *Subject:* Re: [ClusterLabs] Upgrade to OLE8 + Pacemaker

    *WARNING:* This email originated outside of GfK.
       DO NOT CLICK links or attachments unless you recognize the sender
    and know the content is safe.

    On Mon, Oct 2, 2023 at 10:51 PM Jibrail, Qusay (GfK) via Users
    <users@clusterlabs.org <mailto:users@clusterlabs.org>> wrote:

        Hello,

        I am aiming for upgrading one of the cluster node to OLE8
        (current version OLE7) and test if postfix is working fine.

        If yes then upgrade the second node to OLE8.

        My questions:

        Will Pacemaker configuration works after the upgrade?

    Hi,

    It should. Pacemaker supports rolling upgrades from 1.1.11 (and
    above) to 2.x.x. Other components besides Pacemaker may break, so
    I'd suggest having a backout plan before any upgrade activity.

        Do I need to make any changes before or after the upgrade to OLE8?

        So server3 will be done first and then server4. Is that the
        right order?

        Do I need to stop any services before the upgrade?

    For Pacemaker, follow the procedure at
    
https://clusterlabs.org/pacemaker/doc/2.1/Pacemaker_Administration/singlehtml/#upgrading-a-pacemaker-cluster
 
<https://clusterlabs.org/pacemaker/doc/2.1/Pacemaker_Administration/singlehtml/#upgrading-a-pacemaker-cluster>.

    Based on your plan, you should focus particularly on the rolling
    upgrade section.

    Refer to your OS vendor's documentation for any other steps you
    should take during an OS upgrade.

        *[root@server3 ~]# pacemakerd --version*

        Pacemaker 1.1.23-1.0.1.el7_9.1

        Written by Andrew Beekhof

        *[root@server3 ~]# pcs status*

        Cluster name: xxxxxxxxxx

        Stack: corosync

        Current DC: *server3* (version 1.1.23-1.0.1.el7_9.1-9acf116022)
        - partition with quorum

        Last updated: Tue Oct  3 07:29:46 2023

        Last change: Sun May  1 17:02:03 2022 by hacluster via crmd on
        server3

        2 nodes configured

        2 resource instances configured

        Online: [ server3 server4 ]

        Full list of resources:

        Clone Set: smtpout-postfix-res-clone [smtpout-postfix-res]

              Started: [ server3 server4 ]

        Daemon Status:

           corosync: active/enabled

           pacemaker: active/enabled

           pcsd: active/enabled

        *[root@server3 ~]# postconf -d | grep mail_version*

        mail_version = 2.10.1

        milter_macro_v = $mail_name $mail_version

        *[root@server3 ~]# lsb_release -a*

        LSB Version:    :core-4.1-amd64:core-4.1-noarch

        Distributor ID: OracleServer

        Description:    Oracle Linux Server release 7.9

        Release:        7.9

        Codename:       n/a

        Kind regards,

        *––*

        *Qusay Jibrail*

        Senior Infrastructure Engineer – Linux | GfK IT Services
        GfK – an NIQ company |The Netherlands
        Krijgsman 22-25 | Amstelveen | 1186 DM
        T: +31 88 435 1232| M: +31 628 927 686

                

        website
        
<https://www.gfk.com/home?utm_campaign=global_all_2022_email%20signature&utm_source=email&utm_medium=signature&utm_content=>

                        

        blog
        
<https://www.gfk.com/blog?utm_campaign=global_all_2022_email%20signature&utm_source=email&utm_medium=signature&utm_content=>

                        

        instagram <https://instagram.com/gfk.insider>

                        

        linkedin <https://linkedin.com/company/gfk>

                        

        youtube <https://www.youtube.com/user/GfKTube>

                        

        twitter <https://twitter.com/gfk>

                

                                                                

        _______________________________________________
        Manage your subscription:
        https://lists.clusterlabs.org/mailman/listinfo/users
        <https://lists.clusterlabs.org/mailman/listinfo/users>

        ClusterLabs home: https://www.clusterlabs.org/
        <https://www.clusterlabs.org/>

--

    Regards,

    Reid Wahl (He/Him)

    Senior Software Engineer, Red Hat

    RHEL High Availability - Pacemaker

    _______________________________________________

    Manage your subscription:

    https://lists.clusterlabs.org/mailman/listinfo/users
    <https://lists.clusterlabs.org/mailman/listinfo/users>

    ClusterLabs home: https://www.clusterlabs.org/
    <https://www.clusterlabs.org/>


_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/


Regards,
Miroslav

_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Upgrade to OLE8 + Pacemaker

Reply via email to