Using ocf-tester I get:

ocf-tester -n pgsql -o repuser="ha" -o pgdba="postgres" -o
restart_on_promote="true" -o pgctl="/usr/lib/postgresql/9.1/bin/pg_ctl" -o
psql="/usr/lib/postgresql/9.1/bin/psql" -o
pgdata="/var/lib/postgresql/9.1/main/" -o
config="/etc/postgresql/9.1/main/postgresql.conf" -o rep_mode="async" -o
node_list="pz01 pz02" -o restore_command="cp
/var/lib/postgresql/9.1/main/archive/%f %p" -o
primary_conninfo_opt="keepalives_idle=60 keepalives_interval=5
keepalives_count=5" -o master_ip="10.10.10.80" -o stop_escalate="0"
/usr/lib/ocf/resource.d/heartbeat/pgsql
Beginning tests for /usr/lib/ocf/resource.d/heartbeat/pgsql...
/usr/sbin/ocf-tester: 268: export: /var/lib/postgresql/9.1/main/archive/%f:
bad variable name

Is this the reason why I get `invalid parameter` ? Do you know what is
wrong there?

And without restore_command:

ocf-tester -n msPgsql -o repuser="ha" -o pgdba="postgres" -o
restart_on_promote="true" -o pgctl="/usr/lib/postgresql/9.1/bin/pg_ctl" -o
psql="/usr/lib/postgresql/9.1/bin/psql" -o
pgdata="/var/lib/postgresql/9.1/main/" -o
config="/etc/postgresql/9.1/main/postgresql.conf" -o rep_mode="async" -o
node_list="pz01 pz02" -o primary_conninfo_opt="keepalives_idle=60
keepalives_interval=5 keepalives_count=5" -o master_ip="10.10.10.80" -o
stop_escalate="0" /usr/lib/ocf/resource.d/heartbeat/pgsql
Beginning tests for /usr/lib/ocf/resource.d/heartbeat/pgsql...
* rc=6: Validation failed.  Did you supply enough options with -o ?
pgsql[57665]: ERROR: Replication(rep_mode=async or sync) requires
Master/Slave configuration.
Aborting tests

 Thanks in advanced

2014-09-11 12:07 GMT-03:00 Federico Castro <fcaste...@gmail.com>:

> Takehiro, Ulrich, thanks for your responses.
> I have some interesting information.
>
> If I only run pz02, then it keeps on starting as Postgres Slave, it is not
> promoted to Master:
>
> Online: [ pz02 ]
> OFFLINE: [ pz01 ]
>
> Full list of resources:
>
>  Master/Slave Set: msPostgresql [pgsql]
>      Slaves: [ pz02 ]
>      Stopped: [ pgsql:1 ]
>  Resource Group: master-group
>      vip-master (ocf::heartbeat:IPaddr2):       Stopped
>      vip-rep    (ocf::heartbeat:IPaddr2):       Stopped
>
> Node Attributes:
> * Node pz02:
>     + master-pgsql:0                    : -INFINITY
>     + pgsql-data-status                 : DISCONNECT
>     + pgsql-status                      : HS:alone
>
>
> Logs:
>
> Sep 11 11:36:00 pz02 pengine: [42962]: notice: unpack_config: On loss of
> CCM Quorum: Ignore
> Sep 11 11:36:00 pz02 pengine: [42962]: notice: LogActions: Start   pgsql:0
> (pz02)
> Sep 11 11:36:00 pz02 crmd: [42963]: notice: do_state_transition: State
> transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS
> cause=C_IPC_MESSAGE origin=handle_response ]
> Sep 11 11:36:00 pz02 crmd: [42963]: info: do_te_invoke: Processing graph 0
> (ref=pe_calc-dc-1410446160-7) derived from
> /var/lib/pengine/pe-input-3875.bz2
> Sep 11 11:36:00 pz02 crmd: [42963]: info: te_rsc_command: Initiating
> action 4: monitor pgsql:0_monitor_0 on pz02 (local)
> Sep 11 11:36:00 pz02 lrmd: [42960]: info: rsc:pgsql:0 probe[2] (pid 42970)
> Sep 11 11:36:00 pz02 pengine: [42962]: notice: process_pe_message:
> Transition 0: PEngine Input stored in: /var/lib/pengine/pe-input-3875.bz2
> Sep 11 11:36:00 pz02 crmd: [42963]: info: te_rsc_command: Initiating
> action 5: monitor vip-master_monitor_0 on pz02 (local)
> Sep 11 11:36:00 pz02 lrmd: [42960]: info: rsc:vip-master probe[3] (pid
> 42972)
> Sep 11 11:36:00 pz02 crmd: [42963]: info: te_rsc_command: Initiating
> action 6: monitor vip-rep_monitor_0 on pz02 (local)
> Sep 11 11:36:00 pz02 lrmd: [42960]: info: rsc:vip-rep probe[4] (pid 42973)
> Sep 11 11:36:00 pz02 lrmd: [42960]: info: operation monitor[3] on
> vip-master for client 42963: pid 42972 exited with return code 7
> Sep 11 11:36:00 pz02 lrmd: [42960]: info: operation monitor[4] on vip-rep
> for client 42963: pid 42973 exited with return code 7
> Sep 11 11:36:00 pz02 crmd: [42963]: info: process_lrm_event: LRM operation
> vip-master_monitor_0 (call=3, rc=7, cib-update=26, confirmed=true) not
> running
> Sep 11 11:36:00 pz02 crmd: [42963]: info: process_lrm_event: LRM operation
> vip-rep_monitor_0 (call=4, rc=7, cib-update=27, confirmed=true) not running
> Sep 11 11:36:00 pz02 lrmd: [42960]: info: operation monitor[2] on pgsql:0
> for client 42963: pid 42970 exited with return code 7
> Sep 11 11:36:00 pz02 crmd: [42963]: info: process_lrm_event: LRM operation
> pgsql:0_monitor_0 (call=2, rc=7, cib-update=28, confirmed=true) not running
> Sep 11 11:36:00 pz02 crmd: [42963]: info: te_rsc_command: Initiating
> action 3: probe_complete probe_complete on pz02 (local) - no waiting
> Sep 11 11:36:00 pz02 crmd: [42963]: info: te_rsc_command: Initiating
> action 7: start pgsql:0_start_0 on pz02 (local)
> Sep 11 11:36:00 pz02 attrd: [42961]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: probe_complete (true)
> Sep 11 11:36:00 pz02 attrd: [42961]: notice: attrd_perform_update: Sent
> update 4: probe_complete=true
> Sep 11 11:36:00 pz02 lrmd: [42960]: info: rsc:pgsql:0 start[5] (pid 43082)
> Sep 11 11:36:00 pz02 crm_attribute: [43131]: info: Invoked:
> /usr/sbin/crm_attribute -l reboot -N pz02 -n pgsql-status -v STOP
> Sep 11 11:36:00 pz02 attrd: [42961]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: pgsql-status (STOP)
> Sep 11 11:36:00 pz02 attrd: [42961]: notice: attrd_perform_update: Sent
> update 7: pgsql-status=STOP
> Sep 11 11:36:00 pz02 crmd: [42963]: info: abort_transition_graph:
> te_update_diff:176 - Triggered transition abort (complete=0, tag=nvpair,
> id=status-pz02-pgsql-status, name=pgsql-status, value=STOP, magic=NA,
> cib=0.287.18) : Transient attribute: update
> Sep 11 11:36:00 pz02 crm_attribute: [43132]: info: Invoked:
> /usr/sbin/crm_attribute -l reboot -N pz02 -n pgsql-master-baseline -D
> Sep 11 11:36:00 pz02 crm_attribute: [43136]: info: Invoked: crm_attribute
> -N pz02 -n master-pgsql:0 -l reboot -v -INFINITY
> Sep 11 11:36:00 pz02 attrd: [42961]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: master-pgsql:0 (-INFINITY)
> Sep 11 11:36:00 pz02 attrd: [42961]: notice: attrd_perform_update: Sent
> update 10: master-pgsql:0=-INFINITY
> Sep 11 11:36:00 pz02 crmd: [42963]: info: abort_transition_graph:
> te_update_diff:176 - Triggered transition abort (complete=0, tag=nvpair,
> id=status-pz02-master-pgsql.0, name=master-pgsql:0, value=-INFINITY,
> magic=NA, cib=0.287.19) : Transient attribute: update
> Sep 11 11:36:00 pz02 crm_attribute: [43146]: info: Invoked:
> /usr/sbin/crm_attribute -l reboot -N pz02 -n pgsql-xlog-loc -D
> Sep 11 11:36:00 pz02 lrmd: [42960]: info: RA output:
> (pgsql:0:start:stderr) psql: could not connect to server: No such file or
> directory
> Is the server running locally and accepting
> connections on Unix domain socket "/var/run/postgresql/.s.PGSQL.5432"?
>
>
> Sep 11 11:36:02 pz02 crm_attribute: [43259]: info: Invoked:
> /usr/sbin/crm_attribute -l reboot -N pz02 -n pgsql-status -v HS:alone
> Sep 11 11:36:02 pz02 attrd: [42961]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: pgsql-status (HS:alone)
> Sep 11 11:36:02 pz02 lrmd: [42960]: info: operation start[5] on pgsql:0
> for client 42963: pid 43082 exited with return code 0
> Sep 11 11:36:02 pz02 attrd: [42961]: notice: attrd_perform_update: Sent
> update 12: pgsql-status=HS:alone
> Sep 11 11:36:02 pz02 crmd: [42963]: info: process_lrm_event: LRM operation
> pgsql:0_start_0 (call=5, rc=0, cib-update=29, confirmed=true) ok
> Sep 11 11:36:02 pz02 crmd: [42963]: info: abort_transition_graph:
> te_update_diff:176 - Triggered transition abort (complete=0, tag=nvpair,
> id=status-pz02-pgsql-status, name=pgsql-status, value=HS:alone, magic=NA,
> cib=0.287.20) : Transient attribute: update
> Sep 11 11:36:02 pz02 crmd: [42963]: info: te_rsc_command: Initiating
> action 46: notify pgsql:0_post_notify_start_0 on pz02 (local)
> Sep 11 11:36:02 pz02 lrmd: [42960]: info: rsc:pgsql:0 notify[6] (pid 43260)
> Sep 11 11:36:02 pz02 lrmd: [42960]: info: operation notify[6] on pgsql:0
> for client 42963: pid 43260 exited with return code 0
> Sep 11 11:36:02 pz02 crmd: [42963]: info: process_lrm_event: LRM operation
> pgsql:0_notify_0 (call=6, rc=0, cib-update=0, confirmed=true) ok
> Sep 11 11:36:02 pz02 crmd: [42963]: notice: run_graph: ==== Transition 0
> (Complete=13, Pending=0, Fired=0, Skipped=1, Incomplete=0,
> Source=/var/lib/pengine/pe-input-3875.bz2): Stopped
> But it is correctly DC: [42963]: info: update_dc: Set DC to pz02 (3.0.6)
>
>
>
>
>
> *I’ve found that using "rep_mode=sync” with my actual configuration then I
> get “unknown error” when pgsl try to start Postgres:*
>
>
>
> Sep 11 11:50:12 pz01 pengine: [54416]: notice: LogActions: Start   pgsql:0
> (pz01)
> Sep 11 11:50:12 pz01 crmd: [54417]: notice: do_state_transition: State
> transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS
> cause=C_IPC_MESSAGE origin=handle_response ]
> Sep 11 11:50:12 pz01 crmd: [54417]: info: do_te_invoke: Processing graph 0
> (ref=pe_calc-dc-1410450612-7) derived from
> /var/lib/pengine/pe-input-1198.bz2
> Sep 11 11:50:12 pz01 crmd: [54417]: info: te_rsc_command: Initiating
> action 4: monitor pgsql:0_monitor_0 on pz01 (local)
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: rsc:pgsql:0 probe[2] (pid 54424)
> Sep 11 11:50:12 pz01 crmd: [54417]: info: te_rsc_command: Initiating
> action 5: monitor vip-master_monitor_0 on pz01 (local)
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: rsc:vip-master probe[3] (pid
> 54425)
> Sep 11 11:50:12 pz01 crmd: [54417]: info: te_rsc_command: Initiating
> action 6: monitor vip-rep_monitor_0 on pz01 (local)
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: rsc:vip-rep probe[4] (pid 54427)
> Sep 11 11:50:12 pz01 pengine: [54416]: notice: process_pe_message:
> Transition 0: PEngine Input stored in: /var/lib/pengine/pe-input-1198.bz2
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: operation monitor[3] on
> vip-master for client 54417: pid 54425 exited with return code 7
> Sep 11 11:50:12 pz01 crmd: [54417]: info: process_lrm_event: LRM operation
> vip-master_monitor_0 (call=3, rc=7, cib-update=26, confirmed=true) not
> running
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: operation monitor[4] on vip-rep
> for client 54417: pid 54427 exited with return code 7
> Sep 11 11:50:12 pz01 crmd: [54417]: info: process_lrm_event: LRM operation
> vip-rep_monitor_0 (call=4, rc=7, cib-update=27, confirmed=true) not running
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: operation monitor[2] on pgsql:0
> for client 54417: pid 54424 exited with return code 7
> Sep 11 11:50:12 pz01 crmd: [54417]: info: process_lrm_event: LRM operation
> pgsql:0_monitor_0 (call=2, rc=7, cib-update=28, confirmed=true) not running
> Sep 11 11:50:12 pz01 crmd: [54417]: info: te_rsc_command: Initiating
> action 3: probe_complete probe_complete on pz01 (local) - no waiting
> Sep 11 11:50:12 pz01 crmd: [54417]: info: te_rsc_command: Initiating
> action 7: start pgsql:0_start_0 on pz01 (local)
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: probe_complete (true)
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: rsc:pgsql:0 start[5] (pid 54537)
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_perform_update: Sent
> update 4: probe_complete=true
> Sep 11 11:50:12 pz01 crm_attribute: [54586]: info: Invoked:
> /usr/sbin/crm_attribute -l reboot -N pz01 -n pgsql-status -v STOP
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: pgsql-status (STOP)
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_perform_update: Sent
> update 7: pgsql-status=STOP
> Sep 11 11:50:12 pz01 crmd: [54417]: info: abort_transition_graph:
> te_update_diff:176 - Triggered transition abort (complete=0, tag=nvpair,
> id=status-pz01-pgsql-status, name=pgsql-status, value=STOP, magic=NA,
> cib=0.330.18) : Transient attribute: update
> Sep 11 11:50:12 pz01 crm_attribute: [54587]: info: Invoked:
> /usr/sbin/crm_attribute -l reboot -N pz01 -n pgsql-master-baseline -D
> Sep 11 11:50:12 pz01 crm_attribute: [54591]: info: Invoked: crm_attribute
> -N pz01 -n master-pgsql:0 -l reboot -v -INFINITY
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: master-pgsql:0 (-INFINITY)
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_perform_update: Sent
> update 10: master-pgsql:0=-INFINITY
> Sep 11 11:50:12 pz01 crmd: [54417]: info: abort_transition_graph:
> te_update_diff:176 - Triggered transition abort (complete=0, tag=nvpair,
> id=status-pz01-master-pgsql.0, name=master-pgsql:0, value=-INFINITY,
> magic=NA, cib=0.330.19) : Transient attribute: update
> Sep 11 11:50:12 pz01 crm_attribute: [54601]: info: Invoked:
> /usr/sbin/crm_attribute -l reboot -N pz01 -n pgsql-xlog-loc -D
> *Sep 11 11:50:12 pz01 lrmd: [54414]: info: operation start[5] on pgsql:0
> for client 54417: pid 54537 exited with return code 1*
> *Sep 11 11:50:12 pz01 crmd: [54417]: info: process_lrm_event: LRM
> operation pgsql:0_start_0 (call=5, rc=1, cib-update=29, confirmed=true)
> unknown error*
> Sep 11 11:50:12 pz01 crmd: [54417]: WARN: status_from_rc: Action 7
> (pgsql:0_start_0) on pz01 failed (target: 0 vs. rc: 1): Error
> Sep 11 11:50:12 pz01 crmd: [54417]: WARN: update_failcount: Updating
> failcount for pgsql:0 on pz01 after failed start: rc=1 (update=INFINITY,
> time=1410450612)
> Sep 11 11:50:12 pz01 crmd: [54417]: info: abort_transition_graph:
> match_graph_event:277 - Triggered transition abort (complete=0,
> tag=lrm_rsc_op, id=pgsql:0_last_failure_0,
> magic=0:1;7:0:0:af61405d-b520-4e01-8f85-0443bd508a04, cib=0.330.20) : Event
> failed
> Sep 11 11:50:12 pz01 crmd: [54417]: info: te_rsc_command: Initiating
> action 46: notify pgsql:0_post_notify_start_0 on pz01 (local)
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: fail-count-pgsql:0 (INFINITY)
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: rsc:pgsql:0 notify[6] (pid 54616)
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_perform_update: Sent
> update 13: fail-count-pgsql:0=INFINITY
> Sep 11 11:50:12 pz01 crmd: [54417]: info: abort_transition_graph:
> te_update_diff:176 - Triggered transition abort (complete=0, tag=nvpair,
> id=status-pz01-fail-count-pgsql.0, name=fail-count-pgsql:0, value=INFINITY,
> magic=NA, cib=0.330.21) : Transient attribute: update
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: last-failure-pgsql:0 (1410450612)
> Sep 11 11:50:12 pz01 attrd: [54415]: notice: attrd_perform_update: Sent
> update 16: last-failure-pgsql:0=1410450612
> Sep 11 11:50:12 pz01 crmd: [54417]: info: abort_transition_graph:
> te_update_diff:176 - Triggered transition abort (complete=0, tag=nvpair,
> id=status-pz01-last-failure-pgsql.0, name=last-failure-pgsql:0,
> value=1410450612, magic=NA, cib=0.330.22) : Transient attribute: update
> Sep 11 11:50:12 pz01 lrmd: [54414]: info: operation notify[6] on pgsql:0
> for client 54417: pid 54616 exited with return code 0
> Sep 11 11:50:12 pz01 crmd: [54417]: info: process_lrm_event: LRM operation
> pgsql:0_notify_0 (call=6, rc=0, cib-update=0, confirmed=true) ok
> Sep 11 11:50:12 pz01 crmd: [54417]: notice: run_graph: ==== Transition 0
> (Complete=13, Pending=0, Fired=0, Skipped=1, Incomplete=0,
> Source=/var/lib/pengine/pe-input-1198.bz2): Stopped
> Sep 11 11:50:12 pz01 crmd: [54417]: notice: do_state_transition: State
> transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC
> cause=C_FSA_INTERNAL origin=notify_crmd ]
> Sep 11 11:50:12 pz01 pengine: [54416]: notice: unpack_config: On loss of
> CCM Quorum: Ignore
> Sep 11 11:50:12 pz01 pengine: [54416]: WARN: unpack_rsc_op: Processing
> failed op pgsql:0_last_failure_0 on pz01: unknown error (1)
> Sep 11 11:50:12 pz01 pengine: [54416]: WARN: common_apply_stickiness:
> Forcing msPostgresql away from pz01 after 1000000 failures (max=1)
> Sep 11 11:50:12 pz01 pengine: [54416]: WARN: common_apply_stickiness:
> Forcing msPostgresql away from pz01 after 1000000 failures (max=1)
> Sep 11 11:50:12 pz01 pengine: [54416]: notice: LogActions: Stop    pgsql:0
> (pz01)
>
>
>
>
> *But If I change to “rep_mode=async” then I get  'invalid parameter’ from *
> *pgsql:0_monitor_0*
>
> Sep 11 11:54:27 pz01 pengine: [54884]: notice: LogActions: Start   pgsql:0
> (pz01)
> Sep 11 11:54:27 pz01 crmd: [54885]: notice: do_state_transition: State
> transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS
> cause=C_IPC_MESSAGE origin=handle_response ]
> Sep 11 11:54:27 pz01 crmd: [54885]: info: do_te_invoke: Processing graph 0
> (ref=pe_calc-dc-1410450867-7) derived from
> /var/lib/pengine/pe-input-1202.bz2
> Sep 11 11:54:27 pz01 crmd: [54885]: info: te_rsc_command: Initiating
> action 4: monitor pgsql:0_monitor_0 on pz01 (local)
> Sep 11 11:54:27 pz01 lrmd: [54882]: info: rsc:pgsql:0 probe[2] (pid 54892)
> Sep 11 11:54:27 pz01 pengine: [54884]: notice: process_pe_message:
> Transition 0: PEngine Input stored in: /var/lib/pengine/pe-input-1202.bz2
> Sep 11 11:54:27 pz01 crmd: [54885]: info: te_rsc_command: Initiating
> action 5: monitor vip-master_monitor_0 on pz01 (local)
> Sep 11 11:54:27 pz01 lrmd: [54882]: info: rsc:vip-master probe[3] (pid
> 54894)
> Sep 11 11:54:27 pz01 crmd: [54885]: info: te_rsc_command: Initiating
> action 6: monitor vip-rep_monitor_0 on pz01 (local)
> Sep 11 11:54:27 pz01 lrmd: [54882]: info: rsc:vip-rep probe[4] (pid 54895)
> Sep 11 11:54:27 pz01 lrmd: [54882]: info: operation monitor[3] on
> vip-master for client 54885: pid 54894 exited with return code 7
> Sep 11 11:54:27 pz01 crmd: [54885]: info: process_lrm_event: LRM operation
> vip-master_monitor_0 (call=3, rc=7, cib-update=26, confirmed=true) not
> running
> Sep 11 11:54:27 pz01 lrmd: [54882]: info: operation monitor[4] on vip-rep
> for client 54885: pid 54895 exited with return code 7
> Sep 11 11:54:27 pz01 crmd: [54885]: info: process_lrm_event: LRM operation
> vip-rep_monitor_0 (call=4, rc=7, cib-update=27, confirmed=true) not running
> *Sep 11 11:54:27 pz01 lrmd: [54882]: info: RA output:
> (pgsql:0:probe:stderr) /usr/lib/ocf/resource.d//heartbeat/pgsql: 1646:
> /usr/lib/ocf/resource.d//heartbeat/pgsql: Bad substitution*
>
> *Sep 11 11:54:27 pz01 lrmd: [54882]: info: operation monitor[2] on pgsql:0
> for client 54885: pid 54892 exited with return code 2*
> *Sep 11 11:54:27 pz01 crmd: [54885]: info: process_lrm_event: LRM
> operation pgsql:0_monitor_0 (call=2, rc=2, cib-update=28, confirmed=true)
> invalid parameter*
> *Sep 11 11:54:27 pz01 crmd: [54885]: WARN: status_from_rc: Action 4
> (pgsql:0_monitor_0) on pz01 failed (target: 7 vs. rc: 2): Error*
> Sep 11 11:54:27 pz01 crmd: [54885]: info: abort_transition_graph:
> match_graph_event:277 - Triggered transition abort (complete=0,
> tag=lrm_rsc_op, id=pgsql:0_last_failure_0,
> magic=0:2;4:0:7:58ccffc4-db11-44a2-aa1a-bc79cd4924ae, cib=0.331.16) : Event
> failed
> Sep 11 11:54:27 pz01 crmd: [54885]: info: te_rsc_command: Initiating
> action 3: probe_complete probe_complete on pz01 (local) - no waiting
> Sep 11 11:54:27 pz01 crmd: [54885]: notice: run_graph: ==== Transition 0
> (Complete=7, Pending=0, Fired=0, Skipped=3, Incomplete=4,
> Source=/var/lib/pengine/pe-input-1202.bz2): Stopped
> Sep 11 11:54:27 pz01 crmd: [54885]: notice: do_state_transition: State
> transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC
> cause=C_FSA_INTERNAL origin=notify_crmd ]
> Sep 11 11:54:27 pz01 attrd: [54883]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: probe_complete (true)
> Sep 11 11:54:27 pz01 pengine: [54884]: notice: unpack_config: On loss of
> CCM Quorum: Ignore
> Sep 11 11:54:27 pz01 pengine: [54884]: ERROR: unpack_rsc_op: Preventing
> msPostgresql from re-starting on pz01: operation monitor failed 'invalid
> parameter' (rc=2)
> Sep 11 11:54:27 pz01 crmd: [54885]: notice: do_state_transition: State
> transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS
> cause=C_IPC_MESSAGE origin=handle_response ]
> Sep 11 11:54:27 pz01 crmd: [54885]: info: do_te_invoke: Processing graph 1
> (ref=pe_calc-dc-1410450867-12) derived from
> /var/lib/pengine/pe-input-1203.bz2
> Sep 11 11:54:27 pz01 crmd: [54885]: info: te_rsc_command: Initiating
> action 3: probe_complete probe_complete on pz01 (local) - no waiting
> Sep 11 11:54:27 pz01 crmd: [54885]: notice: run_graph: ==== Transition 1
> (Complete=1, Pending=0, Fired=0, Skipped=0, Incomplete=0,
> Source=/var/lib/pengine/pe-input-1203.bz2): Complete
> Sep 11 11:54:27 pz01 crmd: [54885]: notice: do_state_transition: State
> transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS
> cause=C_FSA_INTERNAL origin=notify_crmd ]
> Sep 11 11:54:27 pz01 attrd: [54883]: notice: attrd_perform_update: Sent
> update 4: probe_complete=true
> Sep 11 11:54:27 pz01 pengine: [54884]: notice: process_pe_message:
> Transition 1: PEngine Input stored in: /var/lib/pengine/pe-input-1203.bz2
> Sep 11 11:54:27 pz01 attrd: [54883]: notice: attrd_trigger_update: Sending
> flush op to all hosts for: probe_complete (true)
> Sep 11 11:54:27 pz01 attrd: [54883]: notice: attrd_perform_update: Sent
> update 6: probe_complete=true
>
>
>
> *Some questions:*
>
> 1. Could my issue be related with some erroneous postgres configuration?
> (I’ll check my postgres conf and I’ll post it)
> 2. How can I debug values from pgsl?
>     For example to check values in pgsl:L1646
>          sed -i "/${rep_mode_string//\//\\/}/d" $OCF_RESKEY_config
>
>     If I use:  *ocf_log err "$OCF_RESKEY_config”*   in pgsql
>     Where do I have to check this print? Because I’m not seeing it in
> corosync.log.
>     Do I have to enable anything else?
>
> Thanks for your help!!
>
>
>
>
> 2014-09-11 3:06 GMT-03:00 Ulrich Windl <ulrich.wi...@rz.uni-regensburg.de>
> :
>
> >>> Takehiro Matsushima <takehiro.dream...@gmail.com> schrieb am
>> 11.09.2014 um
>> 01:15 in Nachricht
>> <caabhu_dtkdgfj425j3wwruu7ex_oxs--cmcia-ugvjcnvqh...@mail.gmail.com>:
>> > Hi.
>> >
>> > Let me ask some things,
>> >
>> > Can pz01 run under pacemker, without pz02?
>> > Something differed from running only pz02?
>> > Could you find some hints in the logs of each postgresql(and
>> pgstartup.log)?
>>
>> Sometimes it's handy to write a script for ocf-tester (many params to
>> pass) and run it in verbose and RA debugging mode (hoping the RA says
>> something reasonable). But stop the RA in the cluster first. Try the script
>> on both nodes looking for differences.
>>
>> >
>> > Good-luck,
>> >
>> > Takehiro Matsushima
>> > _______________________________________________
>> > Linux-HA mailing list
>> > Linux-HA@lists.linux-ha.org
>> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> > See also: http://linux-ha.org/ReportingProblems
>>
>>
>>
>> _______________________________________________
>> Linux-HA mailing list
>> Linux-HA@lists.linux-ha.org
>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> See also: http://linux-ha.org/ReportingProblems
>>
>
>
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to