Re: Fieldable, AbstractField, Field

Doug Cutting Wed, 19 Mar 2008 11:02:06 -0700

robert engels wrote:

The problem with abstract classes, is that any methods you provide"know" something of the implementation, unless the methods areimplemented solely by calling other abstract methods (which is rarelythe case if the abstract class contains ANY private members).

Yes, abstract classes should generally avoid private fields that don'thave both setters and getters.

This is possible because the interfaces were designed very well. YouMUST completely understand the problem domain in abstract terms in orderto define proper interfaces.

That works for static problem domains. If Lucene is resolved to onlymake bugfix releases, and not to substantially evolve its feature set,then this might be appropriate.

IndexReader and IndexWriter should have been interfaces. If they were,lots of the code would not have been structured as it was, and manyproblems people had in producing "other" implementations could have beenavoided.

The problem is not that they were not interfaces, but that they were notoriginally intended to be abstract and replaceable. The original designwas that indexing would be the primary implementation that Luceneprovided, and that things around indexing would be extensible, but thatindexing itself would not be. Extensibility was retrofitted onto anexisting design, and it still shows some.

If IndexReader and IndexWriter were originally written to be extensibleit would have been foolish to implement them as interfaces given theamount that these have evolved. Each release would have broken everyapplication.

As for future expansion, it is improbable in most cases that adding newabstract methods will work - if that is the case, they can easily beadded to a static utility class. If the API is really changing/adding,it is easy to create 'interfaceV2 extends interfaceV1'. If the codeworked before, and you want to support backwards code compatibilitybetween versions, this is a fool proof way to accomplish it.

This is not foolproof. Not all extension is the addition of newmethods. In Hadoop, for example, we wish to move from Mapper#map(key,value, reporter) to Mapper#map(MapContext), where MapContext hasgetKey(), getValue(), getReporter() and other methods. If Mapper werean abstract class, back-compatibility would be easy, since we couldprovide a default implementation of Mapper#map(MapContext) that callsMapper#map(key, value, reporter). With interfaces things are much morecomplicated, since, for back-compatibility, we must support bothversions of the interface for a time, dynamically determining whatversion of the interface the application has specified and calling itaccordingly. This is ugly code that we could have avoided if we'd stuckto abstract classes. And the impact is not only where the Mapper isrun, but also where it is specfied (JobConf). So instead of localizingthe change to Mapper.java, we have to add lots of runtime support andpublic API methods in other classes. Yuck.

On the other hand, Hadoop's FileSystem is an abstract class. It hasevolved considerably and applications have been able to upgrade withoutpain. Lucene's Directory has also evolved profitably without breakingexternal Directory implementations.


Interfaces look elegant, but looks can deceive.

Doug

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Fieldable, AbstractField, Field

Reply via email to