Re: [Social] OMB and XMPP

Dave Cridland Fri, 30 Apr 2010 12:31:06 -0700

On Fri Apr 30 18:28:35 2010, Bob Wyman wrote:

Dave Cridland <[email protected]> wrote:
> you rely on the clients maintaining state.
It would be nice if we could, in fact, rely on clients to maintainstate and
to do so without error (This is hard to do in XMPP since there is no
guarantee of message delivery...), however, while this relying onstate-fullclients is an attractive idea, it is likely that in many cases, wesimplycan't generally make this assumption. Other than the limitations ofthe XMPPprotocol itself (i.e. no delivery guarantee), we also need torecognizethat we're seeing significant growth in the use of clients thatdon't have agreat deal of long-term storage capacity -- i.e. mobile phones,tablets,etc. Thus, in any case, "last mile" delivery of messages will needtosupport delivery of all data that might be needed by the client inorder todo whatever it is that it does with new messages. (Yes, we canreduce theclient's to simply be display devices for state maintained onintermediary
servers, but this is not, I think, ideal.)

OK, I have to point out that until I replaced it, the averagesmartphone actually had more storage than my only *somewhat* aged andcheap laptop - I bought it for IETF-63 in Paris, back in 2005. Myn800 - a 2007 tablet device - easily ranged up to 16G storage when itwas launched. Really, I don't buy the argument that these things havesignificantly limited storage for these purposes.

Moreover, there's no need to rely on error-free, complete, storage,as long as clients are able to recover from failure, and if suchprotocols are well-designed, they'll form a generalized capabilityfor clients to acquire sync with new feeds efficiently. This is avery well-understood problem, after all - Mark Crispin wassynchronizing message feeds without transferring all the data somethirty years ago, on diskless devices and over 2600 baud.

Finally, XMPP reliability (or otherwise) is also a well-understoodand examined problem, which is why we're seeing the beginnings ofXEP-0198 deployment, which does indeed provide reliability andstanza-level retransmission. Yet even without this, people have beencontent to use XMPP for really quite serious and criticalcommunications anyway, so I'm deeply unconvinced that this is apractical problem for the vast majority at this point in time.

And all this really doesn't help explain why you want to deliver datathat's not only redundant, but utterly ignored by the client. We'renot talking about small amounts, here - we're talking about messagestaking up 4k, and that becomes really significant over the mobiledevices you're talking about, since it's dramatically over the MTUfor each and every message. I have a strong suspicion - albeit notone backed up by data - that this will increase battery usage onmobile devices.

A reliance on clients' maintaining state would also seem to assumethat a
reasonably high percentage of the traffic shares message-independent
"static" information with messages received earlier and thus thatcache-hitrates are reasonably high. Client maintenance of state is mostuseful whenall messages have the same originator. It is least useful whenevery message
has a unique sender.
Today, most applications relevant to this discussion only support
"topic-based" publish/subscribe. Thus, they implement what we tendto call
"follow" -- messages will be received from some whitelisted set of
publishers. However, in the future, I'm fairly confident that we'llsee anincrease in the number of systems that support "content-based"publish and
subscribe. Thus, we'll see messages being delivered because of their
content, not simply because of their author. This sort of thingwill be very
much like the "Track" function that originally influenced, in part,
Twitter's adoption of "Atom over XMPP". In the "Track" use case,(when youmight subscribe to all messages containing the keyword "XMPP")you'll oftenget messages from senders that you've never seen before or willnever seeagain. Thus, you'll often find that cache hit rates are lower thanyou'dlike even though you may dedicate a great deal of resource tomaintaining
that cache.

All of which is true, yet the Atom feed doesn't contain the avatarimage, which is likely to be the only thing the device will careabout - this is only contained by reference.

These, instead, would need to be currently fetched by HTTP. This isquite sensible - persistent and one-time URLs allow this data to befetched once only, and the usage of a distinct domain allows for amore efficient distribution architecture.

So the kind of fetching on demand and caching is actuyally going tobe happening anyway, for one of two peices of data the client'slikely to be displaying to the user - the other being the messageitself, of course.

That aside, in a subject, or content-based system, you're stilllikely to end up seeing a lot of self-similarity between theauthorships of messages, if only because people often do go throughbursts of talking about a particular topic, and moreover talk inrelatively compact sets of participants. We humans call this aconversation.

So, we see that, at least, limitations in the XMPP protocol,resource
limitations on the clients, and a move towards cache-inefficient
content-based routing all tend to argue against an assumption thatwe can
rely on clients to maintain state...

You might, but "we" don't see that at all. Maybe it's because I'mmore focussed on practise than theory.

You've failed to show that the reliability issue in XMPP issignficant, or insurmountable.

You've utterly failed to convince me that a mobile device is so punyas you claim.

And while I agree that content-based routing will be *less*cache-efficient, it is not clear at all that the same strategies willsimply be less efficient, as opposed to your implication that it'llbe a net loss.


Dave.
--
Dave Cridland - mailto:[email protected] - xmpp:[email protected]
 - acap://acap.dave.cridland.net/byowner/user/dwd/bookmarks/
 - http://dave.cridland.net/
Infotrope Polymer - ACAP, IMAP, ESMTP, and Lemonade

Re: [Social] OMB and XMPP

Reply via email to