Re: [Standards] presence muc element

Dave Cridland Fri, 25 Jun 2010 02:02:55 -0700

On Thu Jun 24 21:52:22 2010, Matthew Wild wrote:

On 24 June 2010 21:33, Justin Karneges
<[email protected]> wrote:
> It's a common problem to join a muc that already thinks you arejoined, and> then the presence you send is interpretted as a mere statuschange rather> than a full join. Then you don't get the room roster, history,etc. Kev> informs me that the <x xmlns="http://jabber.org/protocol/muc";>element> (hereby referred to as "the muc element") is supposed to solvethis problem.> You include it only on join stanzas, but not on status changestanzas. This> way, if a muc sees the element but thought you were alreadyjoined, it can do
> a proper rejoin.
>
Yes, Prosody has had this code since the early days, however we
currently have it commented out due to Google Talk's issues. Gajim
also included the element on nick changes, but we ensured this was
fixed, and added a workaround for it.

But there's little way we can work around Google's oddity (well
technically there is, but none I'd be happy with releasing).

There is, in fact, a workaround in M-Link, too, in as much as it'spossible to strip out the XEP-0045 control element on inboundpresence from a domain before the processing code ever sees it. I'dbe loathe to put that into production.

But don't be coy about this - this is an interop bug, not a mereoddity. While I don't see anything in the spec suggesting thatdirected presence should be repeated, I admit there's nothing in thespec about it not being repeated either, so we either have a bug inthe spec (if Google insist the spec allows them to do this) or a bugin GTalk (if they admit they shouldn't). Either way, it needsresolution.

> However, this seems to break with servers that replay directedpresence.> Allegedly gtalk does this. Every 5 minutes, the client's serverreplays the> directed presence to the muc, which includes the muc element,causing the> user to constantly rejoin the muc (at least, for those mucs thatrespect the
> muc element properly).
>
> Some solutions:
>  1) Servers shouldn't replay directed presence.
I don't see that randomly re-sending join requests shouldn't resultinmultiple joins to a room. Broadcasted presence is a different case,it
is more of a "state" than an instruction.
> 2) If presence is replayed, replay only the elements safe toreplay.
>

That requires the server to know which elements those are - MUC
obviously isn't, but what's to say more won't come along? (e.g.
temporary presence subscriptions).

There's actually a (3), which is "If you replay directed presence,then add a delay".

We already add a delay element to other cases of repeated presence,after all, such as in response to probes.


In this particular case, this'd leave the following cases:

User            Not Joined              Joined
Delay           Join                            Ignore
No Delay                Join                            Rejoin

Of course, I'm not convinced that this solves nickname changes, whichis another long-standing Google interop bug.

There's another issue caused by this bug - for every repeatedpresence, that causes a presence element to be resent by the MUC toevery occupant, unless we also start to filter presence stanzas.

(Stanzas per sec from this is G*(G+N)/300, where G is Google usersand N is non-Google users, kids. In jdev, as I write, that's G==3 andN==25, hence averaging 0.3 stanzas a second - nothing we can'thandle, but add in another 6 Googlers and it's already reached 1/sec.)

Agreed, but what's done is done, and without using presence for MUCwe
wouldn't have the unavailable on disconnect.

Right - MUC is a presence-based system, so needs to operate overpresence.

The only other solution would be to make distinct the history,current occupants, and current subject retrieval, which is also anoption. Ideally, you'd also need to include nick changes here whichstarts to radically impinge on the design.

> Kev additionally informs me that M-Link's muc service may be theonly one that
> performs rejoins properly when receiving the muc element.

It wasn't the first, but it likely is the only one at the moment. I
didn't consider it acceptable to release logic that is broken withone
of the largest XMPP deployments on the internet, so as I said, we
removed it from Prosody with a view to re-adding it if/when Google
finally cleaned up their act.

It's not clear to me why Google does this, I have attempted tocontact them to resolve the interop issue, and I've yet to have aresponse, but I'd hope that they're keen to sort this out - it'sentirely possible I'm using the wrong contact details or something,so if Google people are reading this, please, I'm keen to get asolution.

But aside from being rather spectacularly wasteful of bandwidth, aclient should be able to tell that at least the history is just that- due again to the delay. In addition, it's only the last presencethat Google replays, as far as I understand things, so clients canworkaround this bug by sending an "update" presence after joining.They can even workaround the nickname changing bug, as Kev pointedout to me, by sending unavailable to the old nickname aftersuccessfully changing nickname. So if client developers find theirusers are suffering, there are workarounds they could deploy, whichare much easier to do there.

The alternative is that either servers ignore Google, have specialworkaround for Google, or else avoid improvements across the boarddue to Google.

Personally, I'm willing to dig my heels in a bit on this. I can'tthink of any cases where repeating directed presence is a good idea,and in the absence of the specification mentioning this, I don'tbelieve that implementations should be doing so. Unless I see areason to change my mind, and/or unless customers start to complain,I see no reason to change our behaviour.

> If indeed few mucs
> are supporting this, then maybe we have an opportunity to amendthis problem> in XEP-0045. That is, change the XEP to make it clear that themuc element> does not cause rejoins, and possibly look for a different rejoinsolution
> that does not break the presence model.
>

That would be fine, we can update our code easily, and would be glad
to see it back in action. However the issue can be solved more
generally by implementing XEP-0198 for s2s, and logic to make
unavailable any remote users when it's detected that their serverhas
crashed. This is my current goal.

Yes, also if we ensure servers respond correctly to probes whendirected presence is involved we can probe in various cases - thatsaid, I know M-Link doesn't respond correctly in this case, althoughwe're working on that. (I'm curious as to whether other servers do,as well - this is a bis thing we've not caught up with yet, AFAIK).

As an aside, here, it may be required that clients send unavailableto their old nickname after a nick change, as suggested above as aworkaround to Google, since the server has to track the directedpresence in order to send unavailable and respond to pings - if theclient never sends the unavailable to match the directed presence,then various state mismatches could occur.


Dave.
--
Dave Cridland - mailto:[email protected] - xmpp:[email protected]
 - acap://acap.dave.cridland.net/byowner/user/dwd/bookmarks/
 - http://dave.cridland.net/
Infotrope Polymer - ACAP, IMAP, ESMTP, and Lemonade

Re: [Standards] presence muc element

Reply via email to