Re: Fieldable, AbstractField, Field

robert engels Wed, 19 Mar 2008 11:18:25 -0700

Probably going to disagree here... but that's ok !

I think IndexReader and IndexWriter would have been perfectinterfaces - as long as the concepts were kept very abstract.


putDocument(), getDocument(), findDocument(), etc.

and supported the semantics.

That is what I find is key to hiding the implementation. If you startwith very abstract concepts, the impl does not creep into the API.

When "things change" even with abstract classes, you still have aproblem of sorts. Take locking for instance. If you never had lockingin a system, and add it later, even though you could add methods like'lockIndex()' and 'unlockIndex()' other clients that worked wellbefore if they were not updated to call these methods properly wouldeither fail, or lock the index from everyone else while they wereopen. Probably not good in either case. So adding concepts usuallyrequires code changes anyway.

In the example you cite, there are two possibilities: 1) a MapContextcan ALWAYS be created from a map,value,reporter. If this is the case,your API change is for programmer convenience only and can easily beperformed in other ways. 2) this is not the case, then internallyyour code has two paths anyway, with branching logic based on thetype of key

I think your statement 'static problem domains' is a bit incorrect.It depends on what you define as the problem. Going back to theTableModel interface, its problem domain could be stated as'interface to tabular data for display and edit', and makes noassumption on how the data is stored, what the data is, etc.

Taken to Lucene, IndexWriter could be defined as 'place document inindex suitable for later retrieval'.

Yes you need to know a bit more about the available operations, butif kept abstract enough, the interfaces follow quite easily, AND theydon't paint you into a corner.


On Mar 19, 2008, at 1:01 PM, Doug Cutting wrote:

robert engels wrote:
The problem with abstract classes, is that any methods you provide"know" something of the implementation, unless the methods areimplemented solely by calling other abstract methods (which israrely the case if the abstract class contains ANY private members).
Yes, abstract classes should generally avoid private fields thatdon't have both setters and getters.
This is possible because the interfaces were designed very well.You MUST completely understand the problem domain in abstractterms in order to define proper interfaces.
That works for static problem domains. If Lucene is resolved toonly make bugfix releases, and not to substantially evolve itsfeature set, then this might be appropriate.
IndexReader and IndexWriter should have been interfaces. If theywere, lots of the code would not have been structured as it was,and many problems people had in producing "other" implementationscould have been avoided.
The problem is not that they were not interfaces, but that theywere not originally intended to be abstract and replaceable. Theoriginal design was that indexing would be the primaryimplementation that Lucene provided, and that things aroundindexing would be extensible, but that indexing itself would notbe. Extensibility was retrofitted onto an existing design, and itstill shows some.
If IndexReader and IndexWriter were originally written to beextensible it would have been foolish to implement them asinterfaces given the amount that these have evolved. Each releasewould have broken every application.
As for future expansion, it is improbable in most cases thatadding new abstract methods will work - if that is the case, theycan easily be added to a static utility class. If the API isreally changing/adding, it is easy to create 'interfaceV2 extendsinterfaceV1'. If the code worked before, and you want to supportbackwards code compatibility between versions, this is a foolproof way to accomplish it.
This is not foolproof. Not all extension is the addition of newmethods. In Hadoop, for example, we wish to move from Mapper#map(key, value, reporter) to Mapper#map(MapContext), where MapContexthas getKey(), getValue(), getReporter() and other methods. IfMapper were an abstract class, back-compatibility would be easy,since we could provide a default implementation of Mapper#map(MapContext) that calls Mapper#map(key, value, reporter). Withinterfaces things are much more complicated, since, for back-compatibility, we must support both versions of the interface for atime, dynamically determining what version of the interface theapplication has specified and calling it accordingly. This is uglycode that we could have avoided if we'd stuck to abstract classes.And the impact is not only where the Mapper is run, but also whereit is specfied (JobConf). So instead of localizing the change toMapper.java, we have to add lots of runtime support and public APImethods in other classes. Yuck.
On the other hand, Hadoop's FileSystem is an abstract class. Ithas evolved considerably and applications have been able to upgradewithout pain. Lucene's Directory has also evolved profitablywithout breaking external Directory implementations.
Interfaces look elegant, but looks can deceive.

Doug

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Fieldable, AbstractField, Field

Reply via email to