system shutdown - FIX

lejeczek via Users Fri, 10 Nov 2023 03:27:41 -0800


On 07/11/2023 17:57, lejeczek via Users wrote:

hi guys
Having 3-node pgSQL cluster with PAF - when all threesystems are shutdown at virtually the same time then PAFfails to start when HA cluster is operational again.
from status:
...
Migration Summary:
  * Node: ubusrv2 (2):
* PGSQL-PAF-5433: migration-threshold=1000000fail-count=1000000 last-failure='Tue Nov 7 17:52:38 2023'
  * Node: ubusrv3 (3):
* PGSQL-PAF-5433: migration-threshold=1000000fail-count=1000000 last-failure='Tue Nov 7 17:52:38 2023'
  * Node: ubusrv1 (1):
* PGSQL-PAF-5433: migration-threshold=1000000fail-count=1000000 last-failure='Tue Nov 7 17:52:38 2023'
Failed Resource Actions:
* PGSQL-PAF-5433_stop_0 on ubusrv2 'error' (1): call=90,status='complete', exitreason='Unexpected state forinstance "PGSQL-PAF-5433" (returned 1)',last-rc-change='Tue Nov 7 17:52:38 2023', queued=0ms,exec=84ms * PGSQL-PAF-5433_stop_0 on ubusrv3 'error' (1): call=82,status='complete', exitreason='Unexpected state forinstance "PGSQL-PAF-5433" (returned 1)',last-rc-change='Tue Nov 7 17:52:38 2023', queued=0ms,exec=82ms * PGSQL-PAF-5433_stop_0 on ubusrv1 'error' (1): call=86,status='complete', exitreason='Unexpected state forinstance "PGSQL-PAF-5433" (returned 1)',last-rc-change='Tue Nov 7 17:52:38 2023', queued=0ms,exec=108ms
and all three pgSQLs show virtually identical logs:
...
2023-11-07 16:54:45.532 UTC [24936] LOG: startingPostgreSQL 14.9 (Ubuntu 14.9-0ubuntu0.22.04.1) onx86_64-pc-linux-gnu, compiled by gcc (Ubuntu11.4.0-1ubuntu1~22.04) 11.4.0, 64-bit2023-11-07 16:54:45.532 UTC [24936] LOG: listening onIPv4 address "0.0.0.0", port 54332023-11-07 16:54:45.532 UTC [24936] LOG: listening onIPv6 address "::", port 54332023-11-07 16:54:45.535 UTC [24936] LOG: listening onUnix socket "/var/run/postgresql/.s.PGSQL.5433"2023-11-07 16:54:45.547 UTC [24938] LOG: database systemwas interrupted while in recovery at log time 2023-11-0715:30:56 UTC2023-11-07 16:54:45.547 UTC [24938] HINT: If this hasoccurred more than once some data might be corrupted andyou might need to choose an earlier recovery target.2023-11-07 16:54:45.819 UTC [24938] LOG: entering standbymode2023-11-07 16:54:45.824 UTC [24938] FATAL: could not opendirectory "/var/run/postgresql/14-paf.pg_stat_tmp": Nosuch file or directory2023-11-07 16:54:45.825 UTC [24936] LOG: startup process(PID 24938) exited with exit code 12023-11-07 16:54:45.825 UTC [24936] LOG: aborting startupdue to startup process failure2023-11-07 16:54:45.826 UTC [24936] LOG: database systemis shut down
Is this "test" case's result, as I showed above, expected?It reproduces every time.
If not - what might it be I'm missing?

many thanks, L.

_______________________________________________

to share my "fix" for it - perhaps it was introduced byOS/packages (Ubuntu 22) updates - ? - as oppose to resourceagent itself.

As the logs point out - pg_stat_tmp - is missing and fromwhat I see it's only the master, within a cluster, doingthose stats.That appeared, I use the word for I did not put it intoconfigs, on all nodes.

fix = to not use _pg_stat_tmp_ directive/option at all.

_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] PAF / pgSQL fails after OS/system shutdown - FIX

Reply via email to