Re: CAS/CasView design, another summary

Thilo Goetz Sun, 07 Jan 2007 04:37:07 -0800

Marshall Schor wrote:

Thilo Goetz wrote:
As I see it, we're not going to reach consensus on this issue. Iguess this is at least in part due to the fact that we disagree on thebasic premises underlying this redesign. I am -1 to the currentproposal, and I'll give my reasons below. However, I think we'vemostly discussed most of what I have to say, and if everybody elsethinks the current proposal is a good idea, I will not stand in itsway. Anyway, here goes.
Thanks for trying to clearly restate your thinking :-)
<snip>
The switch from single-artifact CASes to multi-Sofa CASes and viewswas a fundamental change in the basic UIMA architecture. We are notdoing our users a favor by hiding this change from them.
I think it's useful to recognize there are sofa-aware components, andsofa-unaware components. Most of the components so far are sofa-unawareones - although this may change somewhat over time. I think it isuseful to make writing sofa-unaware components "easy" in the sense of notforcing these to deal with multiple sofas/views concepts. This is whatis driving some of the design thinking, I believe, not just trying to be"backward compatible". I also think this is useful for UIMA adoption -in helping new users climb the learning curve - initially they would beable to ignore multi-sofa/ multi-views.

But that's not what the proposal suggests. In the current proposal, theview methods on the CAS are deprecated.

For sofa-aware components, I agree it is not useful to hide this.
By sacrificing a clean design to backward compatibility, we may keepsome existing users happy, but we're not going to gain any new ones.If even UIMA developers find it that hard to get their heads wrappedaround the concepts and APIs, how much harder is it going to be fornew users?
Is the main sacrifice you see to "clean design" the inclusion offorwarding methods in the CasView to avoid users having to pay attention tocasView.getCas(), and recognizing this distinction? If not, I may havemissed it...
If so, I've had more than one user tell me they don't like to have toremember how we organize our objects at this level, to follow chains toget to methods they want. They like the forwarding methods. Sincethese APIs are mainly for the users, I would say we should be willing tosacrifice some "cleanliness" for this.

I've never heard this before. Let those users come forward and discussthat on the mailing list. I could simply claim that for each user whowants forwarding functions there are two who don't, but who have neversaid so because there is no need.

I think the difficult UIMA developers find is not necessarily anindication of the difficulty users would have. UIMA developers aretrying to keep multiple use cases of multiple kinds of users (e.g.sofa-aware, sofa-unaware) in mind simultaneously, and come up with adesign which satisifies all of these somewhat conflicting goals,simultaneously. (That's why we're having all these headaches :-) )

I don't think so. I've helped many a new user through their initial andgrowing UIMA pains (and so have you, I know). There's certainly a lotof confusion with the CAS/JCas interfaces we already have, and thecurrent proposal doesn't make things easier (it eliminates virtually noAPIs, but makes some deprecated and adds a whole bunch of new ones,without adding any functionality).

I question the need for backward compatibility for Sofa-unawareannotators. Those days are over. This basic tenet robs us of theability to clean up the CAS APIs. For example, when I look at the CASAPIs from a world where views are real, I naturally expectCAS.getIndexRepository() to return all indexes in the CAS to me, notjust the ones for the default view.
In the case where the world is one where "views are real" - thatnaturally feels like the case of a sofa-aware component.- For sofa-aware components, I would expect this call to be invalid (asa User), because Index-sets "belong" to Views, and the CAS isn't a view.- For sofa-unaware components, I would expect this to work as before -this is more like the case where "views aren't real".

This contradicts the "everything is accessible from the CAS withoutviews" axiom.

The CommonCas interface adds to the confusion, because it isn't (acommon CAS API). It follows the methodology that everything that canbe abstracted, is abstracted. However, that's not how people think.We like to think in API groups and what things logically belongtogether, not what can and can not be grouped because of method returntypes. So all it does is add to the confusion because you always haveto look in two places for APIs.
Well, we can rename the CommonCas interface, if we can come up with abetter name. But I find that abstraction is very useful for ongoingcode maintenance, and understanding what is intended to be the sameand/or different among things. It often shows up design oversights -something is done in one case and not "thought of" by the developer inanother case.

It certainly shows that something is wrong with our design, I agree. Ijust disagree on the way of fixing it. Let's come up with a definitionof what makes a CAS a CAS. Then have an interface that defines that.Then we can implement it a couple or three or four times.

From a documentation point of view, I hope to have it both ways - bydescribing the JCas and Cas APIs as including the CommonCas API (whichof course it does, as a super-interface). So the users won't need topay attention to this detail of abstraction; they can ignore theCommonCas API as a separate entity. The current IDEs like Eclipsesupport this, too (e.g., autocompletion shows the whole set).

The JCas is a wrapper around the plain old CAS. If we position it thatway, we don't need a common ancestor. Why doesn't the JCas just extendthe CAS interface, if it inherits that many functions?

Re: CAS/CasView design, another summary

Reply via email to