Re: [Standards] Proposed XMPP Extension: Roster Versioning

Richard Dobson Wed, 05 Mar 2008 05:25:13 -0800

You can't use timestamps - they're not strictly increasing, forvarious reasons.

Why does it need to be strictly increasing? As already explained theversion identifiers should IMO be opaque and just be a serverimplementation issue, I still can't see any reason why this needs to beset in stone the protocol as a MUST, should only be a RECOMMENDED sothat server implementors have an idea of where to start from as a way toimplement this.

Firstly, two roster changes could happen at precisely the same moment.To be fair, by introducing cluster node identifiers, and having astrict strong ordering of them, you could avoid this.

It could do yes, why is it a problem if they happen at the same time andare marked with the same timestamp?, it will just result in them bothbeing pushed, how is that an issue?.

Secondly, the clock on a computer can, and surprisingly often does, gobackwards. That's a much harder problem to solve.

Maybe so, do you have any more information on how prevalent this is? Howlong it lasts for etc?

Thirdly, in a clustering situation, you'd have to ensure that the timeon each cluster node was perfectly synchronized.

No you wouldn't necessarily, not if the timestamping was happening atthe central data storage layer (i.e. the database server), and againthis is just an implementation issue and is easily overcome, notsomething that means its impossible.

So the closest you can do would be a modified timestamp that hadadditional logic during generation to ensure it never went backwards,in which case you don't need the cluster identifier anymore, andthat's effectively the same as having a strictly increasing integersequence anyway, so it's easier to just do that. But even if you didwant to use timestamps, just representing them as an integer is prettytrivial. Look at the definition of "modtime" in ACAP (RFC 2244), whichdefines a strictly increasing modified timestamp represented using digits.

Yes I know I could represent them as integers, but id rather not if Idon't have do, id prefer to have the flexibility to compress and shortenthem to reduce bandwidth consumption as much as possible.

It's useful for clients to be able to determine the ordering locally,on occasion. If we removed this, we'd also have to ensure that rosterpushes were sent to the client in-order, which currently we don'tmandate. (Making this a SHOULD is sane, but in the cluster case, it'squite hard).

Well im pretty sure XMPP dictates in order processing of stanzas sosurely the roster updates should thus be in order? Also you haven'treally answered my concerns about allowing clients to determine meaningfrom the version identifier introducing the possibility of bugs andinteroperability problems which IMO is a far more serious issue, and onethat doesn't exist if the client just treats them as opaque strings.

Also even if they were out of order (which I think would be unlikelybecause it would be only likely to happen when several roster updateswere happening at the same time) it doesn't really cause much of anissue as far as I can see, its just that you might have one or tworoster pushes that you have already cached pushed to you again, hardlythe end of the world, and should be something the clients should be ableto cope with, as what would happen if a servers database server crashedand needed to be restored from a backup and the most recent rosterupdates that have already been pushed arn't there, or the server nowthinks it hasn't pushed the changes yet and ends up re-pushing changes,it shouldn't make any difference to the client, some method tore-synchronize needs to be in place to handle this, I think to solvethis issue if the version id (be that timestamp or incrementing id) theclient specifies is further on than any of the ones the server has youwould need to re-push the entire list the server has invalidating theclient list somehow (to ensure new now non-existent contacts that werecreated in between the db backup and the crash do not hang around).

Plus, nobody can get it wrong.
How exactly are they going to get it wrong if its an identifier thatonly the server is interpreting the meaning of?
It's the server I'm worried about. :-)

OK but that doesn't really answer my question.

You just use a 128-bit unsigned integer. There is no upper limit here- in particular, there is no upper limit specified anywhere in thisdocument - XSD merely states that a xs:nonNegativeInteger is asequence of digits, and has "countably infinite" cardinality.
If you really and truly believe that practical limits of 64-bitunsigned integers can cause problems in the real world, I honestlydon't know what to say except show you the figures - you could havethousands of updates every millisecond, and still last over half amillion years - 574,542 roughly, assuming a fixed year length of365.25 days.
I'm all for designing for the future, but you have to draw the linesomewhere, and besides, I figure we'll be on something bigger than64-bit well before then - a jump to 128-bit gains us 10^25 years ofbreathing space, and I'd like to imagine we can think up a solutionwithin that time, assuming that's prior to the heat death of the universe.

Sure you can keep increasing the bit size of your integer in yourimplementation, but the spec still needs to dictate what happens onceyou reach overflow if its going to define that you have to implement itthat way as well as how long the integer should be, although I stillfail to see why strictly increasing numbers should be a MUST at theprotocol level, IMO this is an internal server implementation issue andnot a protocol one, im all for recommending that as a way for the serverto implement it, but still don't think it should be MUST only RECOMMENDED.


Richard

Re: [Standards] Proposed XMPP Extension: Roster Versioning

Reply via email to