RE: [VOTE] Release Apache Lucene Java 3.0.0 (take #2)

2009-11-23 Thread Uwe Schindler
Hi all, Hoss reported a bug about two fields missing in the equals/hashCode of BooleanQuery (which exists since 1.9, https://issues.apache.org/jira/browse/LUCENE-2092). Should I respin 3.0 because of this or just release it? Speak out load, if you want to respin (else vote)! We will apply the bug

[jira] Issue Comment Edited: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781806#action_12781806 ] Uwe Schindler edited comment on LUCENE-2086 at 11/24/09 7:16 AM: ---

[jira] Updated: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2086: -- Fix Version/s: 3.0.1 2.9.2 > When resolving deletes, IW should resolve in t

[jira] Commented: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781806#action_12781806 ] Uwe Schindler commented on LUCENE-2086: --- Just commit it. It is so simple, so if I re

[jira] Commented: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781804#action_12781804 ] Uwe Schindler commented on LUCENE-2092: --- bq. Well, if it's been there since 1.9 and

[jira] Updated: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2092: -- Fix Version/s: 3.1 3.0.1 2.9.2 > BooleanQuery.hashCode a

[jira] Updated: (LUCENE-2047) IndexWriter should immediately resolve deleted docs to docID in near-real-time mode

2009-11-23 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-2047: - Attachment: LUCENE-2047.patch TestIndexWriter passes, mostly due to removing assertions

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781746#action_12781746 ] Robert Muir commented on LUCENE-1606: - Yonik, maybe we can use this trick? UTF-8 in U

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781729#action_12781729 ] Yonik Seeley commented on LUCENE-1606: -- geeze... maybe we should have just stuck with

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781721#action_12781721 ] Robert Muir commented on LUCENE-1606: - I spent a while with this, thinking I would be

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781720#action_12781720 ] Robert Muir commented on LUCENE-1458: - Mike, if it means anything, I prefer the new be

[jira] Commented: (LUCENE-2090) convert automaton to char[] based processing and TermRef / TermsEnum api

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781718#action_12781718 ] Robert Muir commented on LUCENE-2090: - right, we could use constant suffix to stay wit

Re: [jira] Commented: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread DM Smith
Since this is a bug fix, please mark it for 2.9.2 if there ever is one. On Nov 23, 2009, at 7:08 PM, Michael McCandless (JIRA) wrote: > >[ > https://issues.apache.org/jira/browse/LUCENE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781706#acti

[jira] Updated: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1458: Attachment: LUCENE-1458_sortorder_bwcompat.patch this one is more serious. the change to byte[] ch

[jira] Commented: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781716#action_12781716 ] Erick Erickson commented on LUCENE-2092: Well, if it's been there since 1.9 and th

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781713#action_12781713 ] Michael McCandless commented on LUCENE-1458: Well, for starters can't we just

[jira] Commented: (LUCENE-2090) convert automaton to char[] based processing and TermRef / TermsEnum api

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781710#action_12781710 ] Michael McCandless commented on LUCENE-2090: That sounds compelling -- you'd s

[jira] Commented: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781706#action_12781706 ] Michael McCandless commented on LUCENE-2092: Fixed on trunk... I'll backport t

[jira] Commented: (LUCENE-2090) convert automaton to char[] based processing and TermRef / TermsEnum api

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781694#action_12781694 ] Robert Muir commented on LUCENE-2090: - Hi Mike, I think an easier win is to perhaps ad

[jira] Updated: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2092: --- Attachment: LUCENE-2092.patch Attached patch... the fix caused TestWildcard.testEmpt

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781689#action_12781689 ] Robert Muir commented on LUCENE-1458: - bq. Maybe open a new issue for this? Or, don't

[jira] Commented: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781687#action_12781687 ] Michael McCandless commented on LUCENE-2086: bq. No problem, you can commit to

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781683#action_12781683 ] Robert Muir commented on LUCENE-1458: - bq. In general, I think things like unpaired su

[jira] Assigned: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-2092: -- Assignee: Michael McCandless > BooleanQuery.hashCode and equals ignore isCoord

[jira] Commented: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781680#action_12781680 ] Michael McCandless commented on LUCENE-2086: Yeah there's an exception to ever

[jira] Commented: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781679#action_12781679 ] Michael McCandless commented on LUCENE-2092: This is quite nasty. Uwe, should

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781677#action_12781677 ] Michael McCandless commented on LUCENE-1458: {quote} the patch and test are in

[jira] Commented: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781666#action_12781666 ] Mark Miller commented on LUCENE-2086: - No objection, but its an awkward precedent - yo

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781667#action_12781667 ] Uwe Schindler commented on LUCENE-2075: --- bq. Perhaps it's just the ratio of seeks to

[jira] Commented: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781663#action_12781663 ] Uwe Schindler commented on LUCENE-2086: --- No problem, you can commit to 3.0 branch as

[jira] Commented: (LUCENE-1781) Large distances in Spatial go beyond Prime MEridian

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781659#action_12781659 ] Michael McCandless commented on LUCENE-1781: No, this is fixed in 3.0 (to be r

[jira] Commented: (LUCENE-1781) Large distances in Spatial go beyond Prime MEridian

2009-11-23 Thread Bill Bell (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781655#action_12781655 ] Bill Bell commented on LUCENE-1781: --- Has this been applied to 2.9.1 ? Or do I need to a

[jira] Commented: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781654#action_12781654 ] Michael McCandless commented on LUCENE-2086: Hmm these really are sizable gain

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781638#action_12781638 ] Michael McCandless commented on LUCENE-1458: Also, on the flex branch I believ

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781635#action_12781635 ] Michael McCandless commented on LUCENE-1458: LUCENE-510 (fixed in 2.4 release)

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781636#action_12781636 ] Yonik Seeley commented on LUCENE-1458: -- In general, I think things like unpaired surr

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781633#action_12781633 ] Yonik Seeley commented on LUCENE-2075: -- bq. For NRQ the overall improvement has no ne

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781628#action_12781628 ] Michael McCandless commented on LUCENE-2075: bq. You see the effect of the cac

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781627#action_12781627 ] Michael McCandless commented on LUCENE-2075: bq. I am quite sure that also Rob

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781624#action_12781624 ] Robert Muir commented on LUCENE-1458: - Also, I am curious in general if we support any

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781621#action_12781621 ] Uwe Schindler commented on LUCENE-2075: --- I changed my benchmark to better show the s

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781617#action_12781617 ] Robert Muir commented on LUCENE-1458: - Mike, what to do about MultiTermQueries now? th

[jira] Commented: (LUCENE-2086) When resolving deletes, IW should resolve in term sort order

2009-11-23 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781615#action_12781615 ] Tim Smith commented on LUCENE-2086: --- Got some performance numbers: Description of test

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781603#action_12781603 ] Robert Muir commented on LUCENE-1458: - the patch and test are in revision 883485. I ad

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781594#action_12781594 ] Uwe Schindler commented on LUCENE-2075: --- bq. I wonder if your test is getting any ca

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781589#action_12781589 ] Robert Muir commented on LUCENE-1458: - bq. OK I think this is a good approach, in the

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781585#action_12781585 ] Robert Muir commented on LUCENE-1606: - I think i have a workaround for this enum that

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781582#action_12781582 ] Michael McCandless commented on LUCENE-1458: bq. if the term ends with a lead

[jira] Created: (LUCENE-2092) BooleanQuery.hashCode and equals ignore isCoordDisabled

2009-11-23 Thread Hoss Man (JIRA)
BooleanQuery.hashCode and equals ignore isCoordDisabled --- Key: LUCENE-2092 URL: https://issues.apache.org/jira/browse/LUCENE-2092 Project: Lucene - Java Issue Type: Bug Componen

[jira] Updated: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1458: Attachment: LUCENE-1458_termenum_bwcompat.patch attached is a patch that provides a workaround for

[jira] Commented: (LUCENE-1877) Use NativeFSLockFactory as default for new API (direct ctors & FSDir.open)

2009-11-23 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781531#action_12781531 ] Marvin Humphrey commented on LUCENE-1877: - >> take it somewhere other than this cl

Socket and file locks

2009-11-23 Thread Marvin Humphrey
On Sun, Nov 22, 2009 at 10:36:57AM +, Thomas Mueller (JIRA) wrote: > Thomas Mueller commented on LUCENE-1877: > > > > take it somewhere other than this closed issue. > > Yes, where? The java-dev list. > > shouldn't active code like that live in the

[jira] Updated: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2075: --- Attachment: LUCENE-2075.patch New patch, folding in Yonik's suggestions, adding a un

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781505#action_12781505 ] Michael McCandless commented on LUCENE-2075: bq. Because of random ranges on t

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781460#action_12781460 ] Uwe Schindler commented on LUCENE-2075: --- Because of random ranges on the whole range

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781454#action_12781454 ] Michael McCandless commented on LUCENE-2075: Uwe, why do you see so much varia

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781452#action_12781452 ] Michael McCandless commented on LUCENE-2075: {quote} What about replacing the

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781431#action_12781431 ] Robert Muir edited comment on LUCENE-1606 at 11/23/09 3:38 PM: -

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781431#action_12781431 ] Robert Muir commented on LUCENE-1606: - Mike, just one comment here. I am definitely w

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781420#action_12781420 ] Robert Muir commented on LUCENE-1458: - {quote} I realize a java String can easily cont

[jira] Issue Comment Edited: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781410#action_12781410 ] Uwe Schindler edited comment on LUCENE-2075 at 11/23/09 2:13 PM: ---

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781410#action_12781410 ] Uwe Schindler commented on LUCENE-2075: --- I tested with an 5 mio doc index containing

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781407#action_12781407 ] Yonik Seeley commented on LUCENE-2075: -- What about replacing the expensive division w

[jira] Updated: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2075: --- Attachment: LUCENE-2075.patch Thanks Uwe! I attached another one: made DBLRU final,

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781401#action_12781401 ] Michael McCandless commented on LUCENE-1458: bq. perhaps it would help convinc

[jira] Created: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-23 Thread Yuval Feinstein (JIRA)
Add BM25 Scoring to Lucene -- Key: LUCENE-2091 URL: https://issues.apache.org/jira/browse/LUCENE-2091 Project: Lucene - Java Issue Type: New Feature Components: contrib/* Reporter: Yuval Feinstei

[jira] Updated: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2075: -- Attachment: LUCENE-2075.patch I updated the patch to add overrides. I also had to add one Supr

Re: How to submit BM25 scoring library to Lucene

2009-11-23 Thread Michael McCandless
Sounds great! Yes, please open a Jira issue, create a patch, and post it. Mike On Mon, Nov 23, 2009 at 5:58 AM, Yuval Feinstein wrote: > Hi. > I would like to submit a BM25 scoring library to Lucene under contrib. > The library is a modification of Joaquin Perez Iglesias' work described at > t

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781391#action_12781391 ] Michael McCandless commented on LUCENE-2075: bq. To your patch: Looks good, I

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781386#action_12781386 ] Robert Muir commented on LUCENE-2075: - Uwe, thanks. so both the enums behave in a very

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781385#action_12781385 ] Uwe Schindler commented on LUCENE-2075: --- bq. think numeric range might do "out of or

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781384#action_12781384 ] Uwe Schindler commented on LUCENE-2075: --- Have you tried aut with NRQ, too? If not I

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781383#action_12781383 ] Robert Muir commented on LUCENE-2075: - bq. Robert maybe you can try this patch plus au

RE: How to submit BM25 scoring library to Lucene

2009-11-23 Thread Yuval Feinstein
Yes, indeed, I want to contribute this to the Lucene trunk. -- Yuval From: Avi Rosenschein [mailto:arosensch...@gmail.com] Sent: Monday, November 23, 2009 2:11 PM To: java-dev@lucene.apache.org Subject: Re: How to submit BM25 scoring library to Lucene Nice. I believe you want to contribute this t

Re: How to submit BM25 scoring library to Lucene

2009-11-23 Thread Avi Rosenschein
Nice. I believe you want to contribute this to the trunk (which is 3.1), not 2.9.1. -- Avi On Mon, Nov 23, 2009 at 5:58 AM, Yuval Feinstein wrote: > Hi. > I would like to submit a BM25 scoring library to Lucene under contrib. > The library is a modification of Joaquin Perez Iglesias' work descr

Re: Bug in Token.copyTo?

2009-11-23 Thread Shai Erera
Oops, I should have read the documentation better :) On Mon, Nov 23, 2009 at 1:22 PM, Uwe Schindler wrote: > That is correct behaviour, because the target of copyTo must support all > interfaces implemented (see docs). So you cannot copy a Token to a > TermAttribute (because it would be a loss

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781372#action_12781372 ] Michael McCandless commented on LUCENE-2075: I ended up subclassing TermInfo (

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781371#action_12781371 ] Robert Muir commented on LUCENE-1606: - Michael, the problem is this code (automaton it

[jira] Updated: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2075: --- Attachment: LUCENE-2075.patch Attached patch; all tests pass: * Switches the term

[jira] Assigned: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-2075: -- Assignee: Michael McCandless > Share the Term -> TermInfo cache across threads

[jira] Updated: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1458: Attachment: UnicodeTestCase.patch same test, coded in a slightly different way, to show how this c

RE: Bug in Token.copyTo?

2009-11-23 Thread Uwe Schindler
That is correct behaviour, because the target of copyTo must support all interfaces implemented (see docs). So you cannot copy a Token to a TermAttribute (because it would be a loss in information), but the other way round is possible. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http:

Bug in Token.copyTo?

2009-11-23 Thread Shai Erera
Hi I think that Token.copyTo is buggy. It checks if the AttributeImpl is instanceof Token, and if so reinits. However if it's not, it casts AttributeImpl to *ALL* of TermAttribute, OffsetAttribute, PositionIncrementAttribute etc. and sets them w/ the proper values. However, one can pass an Attribu

[jira] Commented: (LUCENE-2090) convert automaton to char[] based processing and TermRef / TermsEnum api

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781365#action_12781365 ] Robert Muir commented on LUCENE-2090: - Michael, I think i would have to profile things

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781364#action_12781364 ] Robert Muir commented on LUCENE-1458: - Michael, it is a valid unicode String though, t

How to submit BM25 scoring library to Lucene

2009-11-23 Thread Yuval Feinstein
Hi. I would like to submit a BM25 scoring library to Lucene under contrib. The library is a modification of Joaquin Perez Iglesias' work described at the link below: http://nlp.uned.es/~jperezi/Lucene-BM25/ a. Do I need a Jira issue for this? b. I have explain() methods inside some Scorers. I know

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781353#action_12781353 ] Michael McCandless commented on LUCENE-1458: bq. how do i seek to U+D866 in th

[jira] Commented: (LUCENE-2090) convert automaton to char[] based processing and TermRef / TermsEnum api

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781344#action_12781344 ] Michael McCandless commented on LUCENE-2090: bq. would 'incremental, one char

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781341#action_12781341 ] Michael McCandless commented on LUCENE-1606: bq. Mike, here is an update to yo

RE: KEYS file

2009-11-23 Thread Uwe Schindler
Hi Grant, > > We created new keys during the key-signing on ApacheCon and lot's of > > committers upgraded to 4096. Mine is new and 4096 bit and also > > simonw and > > rmuir got new ones (now appearing in KEYS file). > > > > Grant *replaced* his key in the KEYS file, but if Grant signed an > > ol