Re: QMFv2 object update bug [WAS Re: Questions from a novice]

Fraser Adams Sat, 27 Apr 2013 02:47:17 -0700

On 26/04/13 17:18, Ken Giusti wrote:

Hey Bill,



I first started to implement the additional objectUpdate callback as originally 
proposed.  Very easy to do.  But that additional api required other tools to be 
updated - and additional documentation changes, etc.

Hmm I remain a little baffled here. So as I understood it prior to anyfixing going on here there existed a bug that actually prevented QMF2updates actually hitting callbacks on the asynchronous python API,that's good IMHO because most clients of that API almost certainly wereexpecting a single callback call for a given object.

I'm kind of unclear about "other tools to be updated" 'cause I'd haveassumed given the previous QMF2 bug that no existing tools were actuallyreceiving v2 updates so wouldn't need "fixed". Sure they will needmodified to handle V2 if they want to consume both, but I'd argue thatit might be better to modify tools that need to handle both than to riskbreaking changes which will blat an indeterminate number of third partytools. It's not ideal for sure, but a non-breaking API extensionrequiring an addition to clients to use it feels better and TBH it'sactually clearer IMHO because there's some conscious choice being made.


After implementing this, I discovered a problem with the fix: tools that need 
to monitor both QMFv1 and QMFv2 agents starting reporting incorrect results 
(qpid-tool, to be specific).  These tools need to support both the old 
callbacks (for QMFv1 updates) and the new one (for QMFv2 updates), in order to 
see both QMFv1 and QMFv2 agents.

To be honest as I suggest above IMHO I actually think it's a good thingfor tools that wish to support both to be explicit. I think from thevarious sections of this thread it's pretty clear that trying to crowbareverything together is a *bad idea*. Sure QMF1 and QMF2 are essentiallyrepresenting equivalent information but there are sufficient differencesand subtleties to flash a great big red light. The reason it's good tohave a separate objectUpdate is because the QMF2 API *is* different atthat point because the protocol is sending what amounts to incompatibledata.

At that point there are two choices 1) an additional non-breaking APIcallback or 2) a shim in the client side that makes things behaveidentically.

The latter may be a bad choice if both v1 and v2 updates arrive becauseit'd need to apply a filter for the case of v1 plus v2 but it'd probablybe a good choice if it could select either v1 or v2 because thenexisting clients wouldn't need to be modified at all.


And that's when I realized that this fix was just a hack to work around the 
actual problem.  The real issue at hand is that a QMF agent must not transmit 
both QMFv1 and QMFv2 updates for the same object.

That's really what's broken here, and the C++ broker is doing precisely that.

Whoa there :-) I think that the problem is that this is *exactly* whatit should be doing :-)

OK what I mean is how does it know? Bear in mind that the updatescausing the fun here are *unsolicited* data pushes! These updates arebeing pushed to a topic, so unless any client is subscribing toqmf.default.topic/data.ind.# or whatever it isn't going to be getting v2updates, similarly if it's not subscribing to the equivalent onqpid.management/ it's not going to be getting v1 updates. OTOH if it'ssubscribing to both that's exactly what it's getting.

The key thing is there's a fairly large level of decoupling going onhere, so I don't believe that there's an especially easy way for thebroker ManagementAgent to work out who's asking for what and thus avoidtransmitting both v1 and v2.

Funnily enough in the v2 protocol and API there's actually no concept ofan *unsolicited* data push, so I'd argue that what the broker iscurrently doing for the v2 updates is an "undocumented feature" :-),what V2 actually talks about is the concept of "query subscriptions".With query subscriptions the asynchronous pushes are actually*solicited* by the client actually requesting some specified subset ofthe data, so the Management Agent does actually have a mechanism totrack what data any given subscribers are actually interested in.


That's clearly a moot point, but it would help with this pickle.

I was musing over this problem and I believe that there is an option,albeit a non-trivial one :-(

So the *real* issue is that it's difficult to manage the carnage causedwhen both v1 and v2 updates happen. You've previously suggesteddisabling v1 by default, I've sort of got sympathy with that :-) Howeverthere are two (I suspect significant) use cases that will be impacted bythat choice:1) Any asynchronous console that relies on the existing v1 callbackswill stop working - I suspect that in practice that'll mean every singledarned one of them except (at a push) qpid-tool. There may be a lot ofthird party tools using this API so it may be an issue. At the veryleast there needs to be some real good communication to the user list.2) If you disable v1 then any third party Agents will break too. Againperhaps they need to change to v2 and this is a catalyst, but it justmay cause non-trivial business impact on users.

Ultimately I think that it's very worth pushing out a survey on the userlist to gauge the potential impact.

The non-trivial mitigation that I was musing over was whether there wasa possibility of constructing a v1->v2 bridge Agent? What I mean is thatit might be possible to get the broker ManagementAgent to only send v2updates if there's a mechanism for the broker to intercept v1 updatesfrom third party Agents and map those into v2 too, thus making everydata push consistently V2? I *think* this could work because I believethat v1 requires broker involvement in order to work (I think that wasone of the reasons for moving to v2).

If it were possible to get to a clear point of only sending v2 updatesthen it should be possible to have the asynchronous Console API shimmedin such a way that it behaves *exactly* the same way with v2 updates asit did with v1 updates so it's possible to mitigate the impact on thirdparty console applications (yay!).

It has to be said It's be much better if users didn't have to rewriteapplications because of objectUpdate versus propUpdate/statUpdate butclearly a shim would be required to make v2 updates behave correctly andtransparently for propUpdate/statUpdate callbacks.

The result of the survey question "how many people have custom v1Agents" would help determine whether it's worth investing in a v1->v2bridge Agent.

A bridge ought to be possible, that's actually exactly what I wrote forthe Java Broker in the Qmf2ManagementAgent to get that to talk QMF2(albeit I was mapping from its internal model and not QMF1, but theprincipal is the same - just don't ask me to do it in C++ though :-))


Why is that a problem?  Because a console receiving these two updates cannot 
tell that they are for the same object.  To the console application, it appears 
as two separate objects.

As I mention above I don't believe that it's any easier for the brokerto do a "diff" given the unsolicited nature of these updates :-)

I'm proposing that we should solve this problem by disabling QMFv1 updates 
coming from the C++ broker.  This should be the default for the next release 
(0.26).  It should still be possible to turn them on manually if desired, but 
the C++ broker should only transmit QMFv2 type updates going forward.

As I say above I've got a fair amount of sympathy with this, andparochially if it means that by doing this you can make statUpdate andpropUpdate for v2 behave *exactly* the same as for v1 so clients don'tneed to be rewritten for v2 then it'd be better for me 'cause I onlyever use v2 aside from this.

However as I said above the only reason that this Console API eversupported both was to cater for the case where a user needed to handleboth v1 and v2 updates and that use case is actually third party v1 Agents.

I think it's important to at least check for how much impact this isgoing to cause users.


We can minimize the impact to console applications by having the objectProps 
and objectStats callbacks be invoked when a QMFv2 object arrives instead of 
introducing a new callback. [Bill - I'll fix the console to not call 
objectProps when only stats are present].  I think this should eliminate the 
need to re-code console applications.

So I don't think it's about "minimising the impact" I think that the APIneeds to behave *exactly* the same for this to be any use. Basically anyasynchronous client hit by only v2 updates should behave identically tohow they behaved prior to this update or it's still introducing aregression into an indeterminate number of applications. I've no ideahow hard it'd be to achieve this nirvana though.


This will cause problems for folks that use an old python console against the 
next release of the broker.  These folks will have to upgrade the console as 
well.

Does anyone have a better alternative?

Well I'm not necessarily saying that my suggestions above are *better*:-) It's difficult for sure. I guess that my biggest concern isintroducing breaking changes against an unknown number of third partyConsoles and Agents.

One other thing makes me nervous. I guess that there's the scenariowhere users are working with a mixed economy of brokers at differentversions or who (like Bill) may be unable to alter the broker config toonly push out a particular version.

All brokers from at least 0.8 can push out v2 so it *might* be safe tohave the console only subscribe to v2 updates (providing a switch tospecify v1). If you can make propUpdate and statUpdate behaveconsistently then that only leaves the case where there are v1 Agents tosupport, which is where the bridge Agent may come in.


How much do you regret starting on this fix now Ken? :-D

It's worth pointing out that this is I suspect only the tip of theiceberg for the sort of problems that might start to be encountered whenmigration to AMQP 1.0 Management starts to gain traction. My stronghunch is that bridge Agents/ManagementNodes are going to become prettyimportant friends when it comes to trying to manage any transition. It'ssomething that's going to bite sooner or later and the C++, Java &Python communities are going to have to be pretty joined up. I quitelike QMF, but clearly some mistakes were made along the way given thelack of support in the Java community and frankly the fragmentation ofAPIs and lack of take up on the "official" QMF2 API. We *really* need totry harder next time round and act as an exemplar in the AMQP community.


Frase













---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: QMFv2 object update bug [WAS Re: Questions from a novice]

Reply via email to