date:20160311

Re: [HACKERS] eXtensible Transaction Manager API (v2)

2016-03-11 Thread Konstantin Knizhnik

On 03/11/2016 11:35 PM, Tom Lane wrote:

Robert Haas writes:

On Fri, Mar 11, 2016 at 1:11 PM, David Steele wrote:

Is anyone willing to volunteer a review or make an argument for the
importance of this patch?

There's been a lot of discussion on another thread about this patch.
The subject is "The plan for FDW-based sharding", but the thread kind
of got partially hijacked by this issue. The net-net of that is that
I don't think we have a clear enough idea about where we're going with
global transaction management to make it a good idea to adopt an API
like this. For example, if we later decide we want to put the
functionality in core, will we keep the hooks around for the sake of
alternative non-core implementations? I just don't believe this
technology is nearly mature enough to commit to at this point.
Konstantin does not agree with my assessment, perhaps unsurprisingly.

I re-read the original thread,

http://www.postgresql.org/message-id/flat/f2766b97-555d-424f-b29f-e0ca0f6d1...@postgrespro.ru

I think there is no question that this is an entirely immature patch.
Not coping with subtransactions is alone sufficient to make it not
credible for production.

Lack of subtractions support is not a limitation of XTM API.
It is limitation of current pg_dtm implementation. And another DTM
implementation - pg_tsdtm supports subtransactions.

Even if the extension API were complete and clearly stable, I have doubts
that there's any great value in integrating it into 9.6, rather than some
later release series. The above thread makes it clear that pg_dtm is very
much WIP and has easily a year's worth of work before anybody would think
of wanting to deploy it. So end users don't need this patch in 9.6, and
developers working on pg_dtm shouldn't really have much of a problem
applying the patch locally --- how likely is it that they'd be using a
perfectly stock build of the database apart from this patch?

I agree with you that pg_dtm is very far from production.
But I wan to notice two things:

1. pg_dtm and pg_tsdtm are not complete cluster solutions, them are just one
(relatively small) part of them.
pg_tsdtm seems to be even more "mature", may be because it is simpler and do
not have many limitations which pg_dtm has (like subtrasaction support).

2. Them can be quite easily integrated with other (existed) cluster solutions.
We have integrated bother of them with postgres_fwd and pg_shard.
Postgres_fdw is also not a ready solution, but just a mechanism which can be
used also for sharding.
But pg_shard & CitusDB are quite popular solutions for distributed execution
of queries which provide good performance for analytic and single node OLTP queries.
Integration with DTMs adds ACID semantic for distributed transactions and
makes it possible to support more complex OLTP and OLAP queries involving
multiple nodes.

Such integration is already done, performance was evaluated, so it is not
quite correct to say that we need a year or more to make pg_dtm/pg_tsdtm ready
to deploy.

But my real takeaway from that thread is that there's no great reason
to believe that this API definition *is* stable. The single existing
use-case is very far from completion, and it's hardly unlikely that
what it needs will change.

Sorry, may be I am completely wrong, but I do not thing that it is possible to
develop stable API if nobody is using it.
It is like "fill pool with a water only after you learn how to swim".

I also took a very quick look at the patch itself:

1. No documentation. For something that purports to be an API
specification, really the documentation should have been written *first*.

Sorry, it was my fault. I have already written documentation and it will be
included in next version of the patch.
But please notice, that starting work on DTM we do not have good understanding
with PostgreSQL TM features have to be changed.
Only during work on pg_dtm, pg_tsdtm, multimaster current view of XTM has been
formed.

And yet another moment: we have not introduce new abstractions in XTM.
We just override existed PostgreSQL functions.
Certainly when some internal functions become part of API, it should be much
better documented.

2. As noted in the cited thread, it's not clear that
Get/SetTransactionStatus are a useful cutpoint; they don't provide any
real atomicity guarantees.

I wonder how such guarantees can be provided at API level?
Atomicity means that all other transaction either see this transaction as
committed, either uncommitted.
So transaction commit should be coordinated with visibility check.
In case of pg_dtm atomicity is simply enforced by the fact that decision
whether to commit transaction is taken by central coordinator.
When it decides that transaction is committed, it marks it as committed in all
subsequently obtained snapshots.

1 2 >

1 - 100 of 160 matches

Mail list logo