Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

Grant Ingersoll Thu, 04 Dec 2008 15:23:39 -0800


On Dec 4, 2008, at 2:21 PM, Jason Rutherglen wrote:

To put things in perspective, I believe Microsoft (who couldpotentially place a lot of resources towards Lucene) now uses Lucenethrough Powerset? and I don't think those folks are contributingback. I know of several other companies who do the same, and manypotential contributions that are not submitted because people andtheir companies do not see the benefit of going through the hoopsrequired to get patches committed. A relatively simple patch suchas 1473 Serialization represents this well.

What do you suggest? We didn't force anyone to use Lucene. Heck,most of our users don't even ever participate on the mailing list.

We do provide a very clear, transparent path for making contributionsand becoming a committer. I don't know what else we can do, but we'retotally open to suggestions on how to improve it.

FWIW, just b/c you think 1473 is trivial doesn't make it so. You havea single use case and that's all you care about. The community hasdozens, if not hundreds of use cases, and your "trivial" patch may notbe so trivial in that regards. How would you feel if we "broke"something that you have relied on for years in the name of us movingfaster? I am willing to bet the large number of people here in Luceneappreciate our deliberations for the most part. As for my opinion on1473, I personally think there are better ways of achieving what youare trying to do, as Robert and others have suggested and I don'tthink it is worth it to maintain serialization across versions as itis a too large of a burden, IMO. But, heh, make an argument(preferably w/o the accusations) and convince me otherwise.

For example if a company is developing custom search algorithms,Lucene supports TF/IDF but not much else. Custom search algorithmsrequire rewriting lots of Lucene code. Companies who write newsearch algorithms do not necessarily want to rewrite Lucene as wellto make it pluggable for new scoring as it is out of scope, theywill simply branch the code. It does not help that the core APIsunderneath IndexReader are protected and package protected whichassumes a user that is not advanced. It is repeated in the mailinglists that new features will threaten the existing user base whichis based on opinion rather than fact. More advanced users arecurrently hindered by the conservatism of the project and sonaturally have stopped trying to submit changes that alter the corenon-public code.

So, your mad at us for others not contributing back their forks? Eventhe ones we don't know about? Simply put, I'm sorry we can't pleaseyou. If you go read the archives, you will see plenty of times wheneven us committers have been frustrated from time to time by theprocess (just look at the JDK 1.5 debate, or the Interface/Abstractdebate) but in the end, I feel Lucene is stronger for it. Communityover code, it's the Apache Way. You are free to disagree. In fact,you have several options available to you to show that disagreement:1. You can work to become a committer and change it from within. Thebar really isn't that high, 3 to 4 non-trivial patches and awillingness to work with others in a mostly pleasant way. 2. You canmake us aware of the patches and be persistent about seeing it throughand we'll try to get to it. Just look at CHANGES.txt and JIRA and youwill see that this happens all the time and from a wide variety ofcontributors (including both you and John). 3. You can fork the codeand go do your thing and build your own community, etc.

Personally, I hope you choose 1 or 2, as we're all stronger togetherthan we are apart.

The rancor is from users would benefit from a faster pace and theability to be more creative inside the core Lucene system. As theinternals change frequently and unnannounced the process ofdeveloping core patches is difficult and frustrating.

I'm sorry that we can't work at a faster pace. Suggestions on how todeal with the number of patches we have and still maintain quality andhow to move forward w/o breaking old patches are much appreciated.

As for the internals changing, you have just hit the nail on the headas to why it is so important to maintain back-compat.

I simply don't get the unannounced part. What isn't announced? Geez,I've been a committer for a few years now, and I have yet to seeanother open source project that is as public as Lucene, for better orworse. Look at the archives, we regularly even put our warts out forpublic consumption in an effort to improve ourselves.

Rather than continue hijacking this thread, why don't we either let itdie and focus on serialization, or we go over to java-dev and you andJohn and the rest of us can create a concrete list of suggestions thatwe think could make Lucene better and we can all discuss them in apositive manner and see how we can go about addressing them. I'd bemore than happy to discuss there if you want.


Cheers,
Grant

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

Reply via email to