Re: [HACKERS] Proposal: Commit timestamp

Theo Schlossnagle Sat, 03 Feb 2007 13:58:56 -0800


On Feb 3, 2007, at 4:38 PM, Jan Wieck wrote:

On 2/3/2007 4:05 PM, Theo Schlossnagle wrote:
On Feb 3, 2007, at 3:52 PM, Jan Wieck wrote:
On 2/1/2007 11:23 PM, Jim Nasby wrote:
On Jan 25, 2007, at 6:16 PM, Jan Wieck wrote:
If a per database configurable tslog_priority is given, thetimestamp will be truncated to milliseconds and the incrementlogic is done on milliseconds. The priority is added to thetimestamp. This guarantees that no two timestamps for commitswill ever be exactly identical, even across different servers.
Wouldn't it be better to just store that informationseparately, rather than mucking with the timestamp?Though, there's anothe issue here... I don't think NTP is goodfor any better than a few milliseconds, even on a local network.How exact does the conflict resolution need to be, anyway?Would it really be a problem if transaction B committed 0.1seconds after transaction A yet the cluster thought it was theother way around?
Since the timestamp is basically a Lamport counter which is justbumped be the clock as well, it doesn't need to be too precise.
Unless I'm missing something, you are _treating_ the counter as aLamport timestamp, when in fact it is not and thus does notprovide semantics of a Lamport timestamp. As such, anyalgorithms that use lamport timestamps as a basis or assumptionfor the proof of their correctness will not translate (provably)to this system.
How are your counter semantically equivalent to Lamport timestamps?
Yes, you must be missing something.
The last used timestamp is remembered. When a remote transaction isreplicated, the remembered timestamp is set to max(remembered,remote). For a local transaction, the remembered timestamp is setto max(remembered+1ms, systemclock) and that value is used as thetransaction commit timestamp.

A Lamport clock, IIRC, require a cluster wide tick. This seems basedonly on activity and is thus an observational tick only which meansvarious nodes can have various perspectives at different times.

Given that time skew is prevalent, why is the system clock involvedat all?

As is usual distributed systems problems, they are very hard toexplain casually and also hard to review from a theoretical anglewithout a proof. Are you basing this off a paper? If so which one?If not, have you written a rigorous proof of correctness for thisapproach?


// Theo Schlossnagle
// CTO -- http://www.omniti.com/~jesus/
// OmniTI Computer Consulting, Inc. -- http://www.omniti.com/



---------------------------(end of broadcast)---------------------------
TIP 7: You can help support the PostgreSQL project by donating at

               http://www.postgresql.org/about/donate

Re: [HACKERS] Proposal: Commit timestamp

Reply via email to