Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

robert engels Wed, 03 Dec 2008 21:50:21 -0800

My two cents...

I think the committers do a great job of managing the product. Ifeel the single biggest failure when it comes to producing qualitysoftware is lack of vision, and/or enforcement of this vision.

If every "wisher" or "submitter" had their code committed - even ifit is "good code" - the product would quickly become unwieldy tomaintain and/or learn (for new users), lessening its usefulness toeveryone.

The only problem I have with Lucene's current focus is that I feelthe Lucene folks should work on standardizing the API, focusing oninterfaces and/or abstract classes with proper protected level access.

By doing this, people are much freer to develop their ownenhancements, and can quickly apply them to later Lucene releasesjust by applying a patch (at worst), or just a link (at best !).Similar to how the JDK works. We have rarely if ever needed to changeour code between JDK releases.

I realize this is a dream right now, because of the bad shape (sorry)of the structure of much of Lucene, but if the committers spent moretime on issues like this, I think they would hear far less complaintsfrom the community.

As an example of the above - being able to access the underlyingreaders in a multi-reader (I know there is a current bug for this).There is no harm to Lucene folks to expose this, and it is veryhelpful in many cases. If some developer uses this information in thewrong way, that is their fault, not Lucene's.... Making somethingprotected is very different than making it public.


Robert Engels

On Dec 3, 2008, at 11:36 PM, John Wang wrote:

Grant:

        I am sorry that I disagree with some points:
1) "I think it's a sign that Lucene is pretty stable." - Whilelucene is a great project, especially with 2.x releases, greatimprovements are made, but do we really have a clear picture on howlucene is being used and deployed. While lucene works great runningas a vanilla search library, when pushed to limits, one needs to"hack" into lucene to make certain things work. If 90% of the userbase use it to build small indexes and using the vanilla api, andthe other 10% is really stressing both on the scalability and apiside and are running into issues, would you still say: "runningwell for 90% of the users, therefore it is stable or extensible"? Ithink it is unfair to the project itself to be measured by thevanilla use-case. I have done couple of large deployments, e.g. >30million documents indexed and searched in realtime., and I reallyhad to do some tweaking.
2) "You want stuff committed, keep it up to date, make itmanageable to review, document it, respond to questions/concernswith answers as best you can. " - To some degree I would hope itdepends on what the issue is, e.g. enforcing such process on a one-line null check seems to be an overkill. I agree with the processitself, what would make it better is some transparency on howpatches/issues are evaluated to be committed. At least seemed fromthe outside, it is purely being decided on by the committers, andsince my understanding is that an open source project belongs tothe public, the public user base should have some say.
3) which brings me to this point: "I personally, would love to workon Lucene all day every day as I have a lot of things I'd love toengage the community on, but the fact is I'm not paid to do that,so I give what I can when I can. I know most of the othercommitters are that way too." - Is this really true? Isn't a largepart of the committer base also a part of the for-profit,consulting business, e.g. Lucid? Would groups/companies that payfor consulting service get their patches/requirements committedwith higher priority? If so, seems to me to be a conflict ofinterest there.
4) "Lather, rinse, repeat. Next thing you know, you'll be on thereceiving end as a committer." - While I agree that being acommitter is a great honor and many committers are awesome, butassuming everyone would want to be a committer is a littlepresumptuous.
In conclusion, I hope I didn't unleash any wrath from thecommitters for expressing candor.
-John
On Wed, Dec 3, 2008 at 2:52 PM, Grant Ingersoll<[EMAIL PROTECTED]> wrote:
On Dec 3, 2008, at 2:27 PM, Jason Rutherglen (JIRA) wrote:



Hoss wrote: "sort of mythical "Lucene powerhouse"
Lucene seems to run itself quite differently than other open sourceJava projects. Perhaps it would be good to spell out the reasonsfor the reluctance to move ahead with features that developers workon, that work, but do not go in. The developer contributions seemto be quite low right now, especially compared to neighbor projectssuch as Hadoop. Is this because fewer people are using Lucene? Oris it due to the reluctance to work with the developer community?Unfortunately the perception in the eyes of some people who work onsearch related projects it is the latter.
Or, could it be that Hadoop is relatively new and in vogue at themoment, very malleable and buggy(?) and has a HUGE corporatesponsor who dedicates lots of resources to it on a full time basis,whilst Lucene has been around in the ASF for 7+ years (and 12+years total) and has a really large install base and thus must movemore deliberately and basically has 1 person who gets to work on itfull time while the rest of us pretty much volunteer? That's notan excuse, it's just the way it is. I personally, would love towork on Lucene all day every day as I have a lot of things I'd loveto engage the community on, but the fact is I'm not paid to dothat, so I give what I can when I can. I know most of the othercommitters are that way too.
Thus, I don't think any one of us has a reluctance to move aheadwith features or bug fixes. Looking at CHANGES.txt, I see a lotof contributors. Looking at java-dev and JIRA, I see lots ofengagement with the community. Is it near the historical high fortraffic, no it's not, but that isn't necessarily a bad thing. Ithink it's a sign that Lucene is pretty stable.
What we do have a reluctance for are patches that don't have tests(i.e. this one), patches that massively change Lucene APIs in non-trivial ways or break back compatibility or are not kept up todate. Are we perfect? Of course not. I, personally, would lovefor there to be a way that helps us process a larger volume ofpatches (note, I didn't say commit a larger volume). Hadoop'sautomated patch tester would be a huge start in that, but at theend of the day, Lucene still works the way all ASF projects do: viameritocracy and volunteerism. You want stuff committed, keep itup to date, make it manageable to review, document it, respond toquestions/concerns with answers as best you can. To that end, areal simple question can go a long way and getting somethingcommitted, and it simply is: "Hey Lucener's, what else can I doto help you review and commit LUCENE-XXXX?" Lather, rinse,repeat. Next thing you know, you'll be on the receiving end as acommitter.
-Grant



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

Reply via email to