On Thu, Nov 22, 2012 at 8:25 AM, Andres Freund <and...@2ndquadrant.com>wrote:

> I really don't understand whats going on here then. Youve said you made
> sure that there is a catalog snapshot. Which means you would need
> something like:
> WARNING:  connecting to postgres
> WARNING:  Initiating logical rep
> LOG:  computed new xmin: 16566894
> LOG:  start reading from 3/E62457C0, scrolled back to 3/E6244000
> LOG:  found initial snapshot (via running xacts). Done: 1
> WARNING:  reached consistent point, stopping!
> WARNING:  Starting logical replication
> LOG:  start reading from 3/E62457C0, scrolled back to 3/E6244000
> LOG:  found initial snapshot (via running xacts). Done: 1
>
> in the log *and* it means that snapbuild->state has to be
> CONSISTENT. But the backtrace youve posted:
>
> #3  0x000000000070c409 in SnapBuildTxnIsRunning
> (snapstate=0x19e4f10,xid=0) at snapbuild.c:877
> #4  0x000000000070b8e4 in SnapBuildProcessChange
> (reorder=0x19e4e80,snapstate=0x19e4f10, xid=0, buf=0x1a0a368,
> relfilenode=0x1a0a450) at snapbuild.c:388
> #5  0x000000000070c088 in SnapBuildDecodeCallback
> (reorder=0x19e4e80,snapstate=0x19e4f10, buf=0x1a0a368) at snapbuild.c:732
>
> shows pretty clearly that snapstate *can't* be consistent because line
> 387ff is:
>       else if (snapstate->state < SNAPBUILD_CONSISTENT &&
>                SnapBuildTxnIsRunning(snapstate, xid))
>                 ;
> so #3 #4 can't happen at those line numbers with state == CONSISTENT.
>
Still this *impossible* thing happens.
Here are some more information on the logs I get on server side:

Yes I have the logical replication correctly initialized:
[629 0] LOG:  database system was shut down at 2012-11-22 09:02:42 JST
[628 0] LOG:  database system is ready to accept connections
[633 0] LOG:  autovacuum launcher started
[648 0] WARNING:  connecting to postgres
[648 0] WARNING:  Initiating logical rep
[648 0] LOG:  computed new xmin: 684
[648 0] LOG:  start reading from 0/178C1B8, scrolled back to 0/178C000

And I am also getting logs of this type with pg_receivellog:
BEGIN 698
table "pgbench_accounts": UPDATE: aid[int4]:759559 bid[int4]:8
abalance[int4]:-3641 filler[bpchar]:
table "pgbench_tellers": UPDATE: tid[int4]:93 bid[int4]:10
tbalance[int4]:-3641 filler[bpchar]:(null)
table "pgbench_branches": UPDATE: bid[int4]:10 bbalance[int4]:-3641
filler[bpchar]:(null)
table "pgbench_history": INSERT: tid[int4]:93 bid[int4]:10 aid[int4]:759559
delta[int4]:-3641 mtime[timestamp]:2012-11-22 09:05:34.535651
filler[bpchar]:(null)
COMMIT 698

Until the assertion failure:
TRAP: FailedAssertion("!(((xid) >= ((TransactionId) 3)) &&
((snapstate->xmin_running) >= ((TransactionId) 3)))", File: "snapbuild.c",
Line: 878)
I still have the core file and its binary at hand if you want, so can send
them at will.
I have not been able to read your code yet, but there should be something
you are missing.

Thanks,
-- 
Michael Paquier
http://michael.otacoo.com

Reply via email to