Re: Back Compatibility

DM Smith Wed, 23 Jan 2008 15:25:03 -0800

Top posting because this is a response to the thread as a whole.

It appears that this thread has identified some different reasons for"needing" to break compatibility:1) A current behavior is now deemed bad or wrong. Examples: the silenttruncation of large documents or an analyzer that works incorrectly.

2) Performance tuning such as seen in Token, allowing reuse.

3) Support of a new language feature, e.g. generics, that make thecode "better".

4) A new feature requires a change to the existing API.

Perhaps there were others? Maybe specifics are in Jira.

It seems to me that the Lucene developers have done an excellent jobat figuring out how to maintain compatibility. This is a testament tohow well grounded the design of the API actually is, from early on andeven now. And changes seem to be well thought out, well designed andcarefully implemented.

I think that when it really gets down to it, the Lucene API will stayvery stable because of this.

On a side note, the cLucene project seems to be languishing (stilltrying to get to 2.0) and any stability of the API is a good thing forit. And perhaps for the other "ports" as well.


Again many thanks for all your hard work,
        DM Smith, a thankful "parasite" :)

On Jan 23, 2008, at 5:16 PM, Michael McCandless wrote:

chris Hostetter wrote:
: I do like the idea of a static/system property to match legacy
: behavior.  For example, the bugs around how StandardTokenizer
: mislabels tokens (eg LUCENE-1100), this would be the perfectsolution.: Clearly those are silly bugs that should be fixed, quickly, withthis
: back-compatible mode to keep the bug in place.
:
: We might want to, instead, have ctors for many classes take arequired: arg which states the version of Lucene you are using? So if youare
: writing a new app you would pass in the current version.  Then, on
: dropping in a future Lucene JAR, we could use that arg to enforcethe: right backwards compatibility. This would save users from havingto: realize they are hitting one of these situations and then know togo
: set the right static/property to retain the buggy behavior.
I'm not sure that this would be better though ... when i write mycode, ipass "2.3" to all these constructors (or factory methods) and thenlater i
want to upgrade to 2.3 to get all the new performance goodness ... i
shouldn't have to change all those constructor calls to get all the2.4goodness, i should be able to leave my code as is -- but if i dothat,
then i might not get all the 2.4 goodness, (like improved
tokenization, or more precise segment merging) because some of that
goodness violates previous assumptions that some code might havehad ...my code doesn't have those assumptions, i know nothing about them,i'll
take whatever behavior the Lucene Developers recommend (unless i see
evidence that it breaks something, in which case i'll happily set a
system property or something that the release notes say will forcethe
old behavior.
The basic principle being: by default, give users the behavior thatis
generally viewed as "correct" -- but give them the option to force
"uncorrect" legacy behavior.
OK, I agree: the vast majority of users upgrading would in fact wantall of the changes in the new release. And then the rare user whois affected by that bug fix to StandardTokenizer would have to setthe compatibility mode. So it makes sense for you to get allchanges on upgrading (and NOT specify the legacy version in allctors).
: Also, backporting is extremely costly over time. I'd much ratherkeep
: compatibility for longer on our forward releases, than spend our
: scarce resources moving changes back.

+1
: So to summarize ... I think we should have (keep) a hightolerance for
: cruft to maintain API compatibility.  I think our current approach
: (try hard to keep compatibility during "minor" releases, then
: deprecate, then remove APIs on a major release; do major releasesonly
: when truly required) is a good one.
i'm with you for the most part, it's just the defintion of "whentruly
required" that tends to hang people up ... there's a chicken vs egg
problem of deciding wether the code should drive what the nextreleasenumber is: "i've added a bitch'n feature but it requires adding amethodto an interface, therefor the next release must be called 4.0" ...vs themindset that "we just had a 3.0 release, it's too soon for anothermajorrelease, the next release should be called 3.1, so we need to holdoff on
commiting non backwards compatible changes for a while."
I'm in the first camp: version numbers should be descriptive,information
carrying, labels for releases -- but the version number of a release
should be dicated by the code contained in that release. (if thatmeans
the next version after 3.0.0 is 4.0.0, then so be it.)
Well, I am weary of doing major releases too often. Though I doagree that the version number should be a "fastmatch" for readingthrough CHANGES.txt.
Say we do this, and zoom forward 2 years when we're up to 6.0, thenpoor users stuck on 1.9 will dread upgrading, but probably shouldn't.
One of the amazing things about Lucene, to me, is how many reallymajor changes we have been able to make while not in fact breakingbackwards compatibility (too much). Being very careful not to makethings public, intentionally not committing to things like exactlywhen does a flush or commit or merge actually happen, marking newAPIs as experimental and freely subject to change, using abstractclasses not interfaces, are all wonderful tools that Lucene employs(and should continue to do so), to enable sizable changes in thefuture while keeping backwards compatibility.
Allowing for future backwards compatibility is one of the mostimportant things we all do when we make changes to Lucene!
Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Back Compatibility

Reply via email to