Le 24/02/2011 18:17, Micka a écrit : > Yes it was, Thx ! > > Now the replication functionnality work well ! > > but when the main database server goes down I got : > > 2011-02-24 18:06:42 DEBUG: pid 1313: starting health checking > 2011-02-24 18:06:42 DEBUG: pid 1313: health_check: 0 th DB node status: 2 > 2011-02-24 18:06:42 DEBUG: pid 1313: health_check: 1 th DB node status: 2 > 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: T > 2011-02-24 18:06:44 DEBUG: pid 1347: num_fileds: 1 > 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: D > 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: C > 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: Z > 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: T > 2011-02-24 18:06:44 DEBUG: pid 1347: num_fileds: 1 > 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: D > 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: C > 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: Z > 2011-02-24 18:06:52 DEBUG: pid 1313: starting health checking > 2011-02-24 18:06:52 DEBUG: pid 1313: health_check: 0 th DB node status: 2 > 2011-02-24 18:06:52 DEBUG: pid 1313: health_check: 1 th DB node status: 2 > 2011-02-24 18:06:54 ERROR: pid 1347: pool_flush_it: write failed to backend > (0). reason: Broken pipe offset: 0 wlen: 39 > 2011-02-24 18:06:54 ERROR: pid 1347: check_replication_time_lag: SELECT > pg_current_xlog_location() failed > 2011-02-24 18:07:02 DEBUG: pid 1313: starting health checking > 2011-02-24 18:07:02 DEBUG: pid 1313: health_check: 0 th DB node status: 2 > 2011-02-24 18:07:02 ERROR: pid 1313: connect_unix_domain_socket_by_port: > connect() failed: No such file or directory > 2011-02-24 18:07:02 ERROR: pid 1313: health check failed. 0 th host at port > 5432 is down > 2011-02-24 18:07:02 LOG: pid 1313: set 0 th backend down status > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler called > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: starting to select > new master node > 2011-02-24 18:07:02 LOG: pid 1313: starting degeneration. shutdown host > (5432) > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1314 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1315 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1316 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1317 > > and : > > 011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1334 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1335 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1336 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1337 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1338 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1339 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1340 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1341 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1342 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1343 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1344 > 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1345 > 2011-02-24 18:07:02 LOG: pid 1313: execute command: > /usr/local/etc/failover.sh 0 "" 5432 /usr/local/pgsql/data 1 0 "" 0 > 2011-02-24 18:07:02 DEBUG: pid 1314: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1315: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1316: child received shutdown request signal > 3 > *touch: cannot touch `/var/log/pgpool/trigger/trigger_file1': Permission > denied* > 2011-02-24 18:07:02 LOG: pid 1313: failover_handler: set new master node: > 1 > 2011-02-24 18:07:02 DEBUG: pid 1317: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1318: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1319: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1320: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 12178: I am 12178 > 2011-02-24 18:07:02 DEBUG: pid 12179: I am 12179 > 2011-02-24 18:07:02 DEBUG: pid 12180: I am 12180 > 2011-02-24 18:07:02 DEBUG: pid 12177: I am 12177 > 2011-02-24 18:07:02 DEBUG: pid 12181: I am 12181 > 2011-02-24 18:07:02 DEBUG: pid 1322: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 12176: I am 12176 > > 011-02-24 18:07:02 DEBUG: pid 12205: I am 12205 > 2011-02-24 18:07:02 DEBUG: pid 1338: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 12206: I am 12206 > 2011-02-24 18:07:02 LOG: pid 1313: failover done. shutdown host (5432) > 2011-02-24 18:07:02 DEBUG: pid 1339: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1313: pool_ssl: SSL requested but SSL support > is not available > 2011-02-24 18:07:02 DEBUG: pid 1340: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1341: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: auth kind: 0 > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data > received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: backend key data received > 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: transaction state: I > 2011-02-24 18:07:02 DEBUG: pid 1342: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1345: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1344: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1343: child received shutdown request signal > 3 > 2011-02-24 18:07:02 DEBUG: pid 1313: do_query: kind: T > 2011-02-24 18:07:02 DEBUG: pid 1313: num_fileds: 1 > 2011-02-24 18:07:02 DEBUG: pid 1313: do_query: kind: D > 2011-02-24 18:07:02 DEBUG: pid 1313: do_query: kind: C > 2011-02-24 18:07:02 DEBUG: pid 1313: do_query: kind: Z > 2011-02-24 18:07:02 LOG: pid 1313: find_primary_node: primary node id is 1 > > > 2011-02-24 18:07:02 DEBUG: pid 1313: child 1340 exits with status 0 > 2011-02-24 18:07:02 DEBUG: pid 1313: child 1341 exits with status 0 > 2011-02-24 18:07:02 DEBUG: pid 1313: child 1342 exits with status 0 > 2011-02-24 18:07:02 DEBUG: pid 1313: child 1343 exits with status 0 > 2011-02-24 18:07:02 DEBUG: pid 1313: child 1344 exits with status 0 > 2011-02-24 18:07:02 DEBUG: pid 1313: child 1345 exits with status 0 > 2011-02-24 18:07:02 DEBUG: pid 1313: reap_handler: normally exited > 2011-02-24 18:07:12 DEBUG: pid 1313: starting health checking > 2011-02-24 18:07:12 DEBUG: pid 1313: health_check: 0 th DB node status: 3 > 2011-02-24 18:07:12 DEBUG: pid 1313: health_check: 1 th DB node status: 2 > 2011-02-24 18:07:22 DEBUG: pid 1313: starting health checking > > > what does it mean ? > > in pgpoolAdmin, it says : > > 5432 Down 0.5005433 Up. Connected. Running as standby server 0.500 > > Shouldn't I have the 5433 take the primary role ? >
Well, "running as standby server" says it all. It didn't take the primary role. Reason is that the script couldn't create the trigger file (/var/log/pgpool/trigger/trigger_file1). So you first need to fix this. -- Guillaume http://www.postgresql.fr http://dalibo.com _______________________________________________ Pgpool-general mailing list [email protected] http://pgfoundry.org/mailman/listinfo/pgpool-general
