Re: [jira] Created: (LUCENE-1172) Small speedups to DocumentsWriter

2008-02-09 Thread Paul Elschot
Op Saturday 09 February 2008 02:00:02 schreef robert engels: > Curious... on things like this, is it really worth adding (and > maintaining) Lucene's own sort, just to achieve a 1.5 % performance > increase. It is almost doubtful that you can even measure an > improvement at that level, given

Re: [jira] Created: (LUCENE-1172) Small speedups to DocumentsWriter

2008-02-09 Thread Michael Busch
robert engels wrote: > Curious... on things like this, is it really worth adding (and > maintaining) Lucene's own sort, just to achieve a 1.5 % performance > increase. It is almost doubtful that you can even measure an improvement > at that level, given all of the variables you can't control. > I

[jira] Commented: (LUCENE-1169) Search with Filter does not work!

2008-02-09 Thread Eks Dev (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567306#action_12567306 ] Eks Dev commented on LUCENE-1169: - Thank for explaining it! So we have now classes implem

Re: [jira] Created: (LUCENE-1172) Small speedups to DocumentsWriter

2008-02-09 Thread Grant Ingersoll
I also agree w/ Robert and Michael, here. While DocsWriter is really effective, it is very complicated to follow and it makes debugging and maintenance much harder. -Grant On Feb 9, 2008, at 5:03 AM, Michael Busch wrote: robert engels wrote: Curious... on things like this, is it really w

Re: [jira] Created: (LUCENE-1172) Small speedups to DocumentsWriter

2008-02-09 Thread Yonik Seeley
On Feb 8, 2008 8:00 PM, robert engels <[EMAIL PROTECTED]> wrote: > Curious... on things like this, is it really worth adding (and > maintaining) Lucene's own sort, Unfortunately, Java's sort on Object[] is a mergeSort, and they allocate an axillary array to support that. Mike's latest tests show a

[jira] Updated: (LUCENE-325) [PATCH] new method expungeDeleted() added to IndexWriter

2008-02-09 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-325: -- Attachment: LUCENE-325.patch Attached patch. All tests pass. I plan to commit in a da

Re: [jira] Created: (LUCENE-1172) Small speedups to DocumentsWriter

2008-02-09 Thread Michael McCandless
I agree, there comes a point where the cost of added complexity is not worth the gains, on balance. Making that tradeoff is not easy. I don't think the patch in LUCENE-1172 crosses that line: a 1.6% (4.1% on small docs) top line gain is still a sizable gain. The profiler points to many other sm

[jira] Commented: (LUCENE-1172) Small speedups to DocumentsWriter

2008-02-09 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567326#action_12567326 ] Michael McCandless commented on LUCENE-1172: The above numbers were with the f

[jira] Commented: (LUCENE-1169) Search with Filter does not work!

2008-02-09 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567351#action_12567351 ] Paul Elschot commented on LUCENE-1169: -- Some of the bugs caused by this skipTo() beha

Re: Lucene-based Distributed Index Leveraging Hadoop

2008-02-09 Thread Srikant Jakilinki
Hi Ning, In continuation with our offline conversation, here is a public expression of interest in your work and a description of our work. Sorry for the length in advance and I hope that the folk will be able to collaborate and/or share experiences and/or give us some pointers... 1) We are