Re: Lots of results

2009-12-05 Thread DM Smith
On Dec 5, 2009, at 5:22 PM, Grant Ingersoll wrote: > At ScaleCamp yesterday in the UK, I was listening to a talk on Xapian and the > speaker said one of the optimizations they do when retrieving a large result > set is that instead of managing a Priority Queue, they just allocate a large > arr

Build failed in Hudson: Lucene-trunk #1019

2009-12-05 Thread Apache Hudson Server
See Changes: [uschindler] Use better format for md5sum/sha1 sum on package build (binary files should have * before file name). The format attribute does that automatically. [rmuir] fix enwiki (and a few others) task, DocMak

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786566#action_12786566 ] Robert Muir commented on LUCENE-1606: - btw one thing we could do is put this nextValid

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1606: Attachment: LUCENE-1606.patch here is an update to the last one, using UnicodeUtil constants, etc.

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1606: Attachment: LUCENE-1606.patch in this patch, i take some commented out code in UnicodeUtil (valid

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786557#action_12786557 ] Robert Muir edited comment on LUCENE-1606 at 12/6/09 2:05 AM: --

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786557#action_12786557 ] Robert Muir edited comment on LUCENE-1606 at 12/6/09 2:01 AM: --

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786557#action_12786557 ] Robert Muir edited comment on LUCENE-1606 at 12/6/09 1:59 AM: --

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786557#action_12786557 ] Robert Muir commented on LUCENE-1606: - here is an explanation of the cleanupPosition,

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786555#action_12786555 ] Robert Muir commented on LUCENE-1606: - yes, Mark you have it right. This is not an iss

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786551#action_12786551 ] Mark Miller edited comment on LUCENE-1606 at 12/6/09 1:00 AM: --

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786551#action_12786551 ] Mark Miller commented on LUCENE-1606: - Sorry - haven't been paying a lot of attention

[jira] Commented: (LUCENE-2117) Fix SnowballAnalyzer casing behavior for Turkish Language

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786549#action_12786549 ] Simon Willnauer commented on LUCENE-2117: - Robert, Patch looks good and all tests

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786542#action_12786542 ] Robert Muir commented on LUCENE-1606: - bq. Also going over the code, but thats going t

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786541#action_12786541 ] Mark Miller commented on LUCENE-1606: - bq. Yeah I tried to do some of this in a very q

Re: Lots of results

2009-12-05 Thread Paul Elschot
Could one get the best of both worlds by not heapifying the PQ until it is full? Regards, Paul Elschot Op zondag 06 december 2009 00:01:49 schreef Grant Ingersoll: > > On Dec 5, 2009, at 10:47 PM, Earwin Burrfoot wrote: > > > If someone needs all results, they know it beforehand. Why can't they

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786536#action_12786536 ] Robert Muir commented on LUCENE-1606: - Mark oh ok, well thanks for spending so much ti

Re: [jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir
Hi Ghazal, I think if you are looking to help with Lucene in general, the HowToContribute link is the best place to start: http://wiki.apache.org/lucene-java/HowToContribute We are not working with the source code from the zip file, but instead the latest unreleased code in the subversion reposito

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786529#action_12786529 ] Mark Miller edited comment on LUCENE-1606 at 12/5/09 11:18 PM: -

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786533#action_12786533 ] Mark Miller commented on LUCENE-1606: - bq. And definitely correctness Right - tha

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786531#action_12786531 ] Robert Muir commented on LUCENE-1606: - bq. Right, but I'm not really testing for benef

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786529#action_12786529 ] Mark Miller edited comment on LUCENE-1606 at 12/5/09 11:06 PM: -

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786529#action_12786529 ] Mark Miller commented on LUCENE-1606: - bq. I think thats pretty small Okay, fair eno

Re: [jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Ghazal Gharooni
Hello, Thank you all for your description. Actually, this is my first experiences in an open source community. I downloaded the source code (lucene-3.0.0.zip) and would like to work on part of the code in order to learn new skills from group and have a positive contribution. To be honest, I really

Re: Lots of results

2009-12-05 Thread Grant Ingersoll
On Dec 5, 2009, at 10:47 PM, Earwin Burrfoot wrote: > If someone needs all results, they know it beforehand. Why can't they > write this collector themselves? It's trivial, just like you said. I'm not following your comment. Of course they can write it. But that's true for all the implementat

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786528#action_12786528 ] Robert Muir commented on LUCENE-1606: - bq. I'm not sure at the moment - but its wikipe

Re: Lots of results

2009-12-05 Thread Earwin Burrfoot
If someone needs all results, they know it beforehand. Why can't they write this collector themselves? It's trivial, just like you said. On Sun, Dec 6, 2009 at 01:22, Grant Ingersoll wrote: > At ScaleCamp yesterday in the UK, I was listening to a talk on Xapian and the > speaker said one of the

RE: Push fast-vector-highlighter mvn artifacts for 3.0 and 2.9

2009-12-05 Thread Uwe Schindler
I rebuilt the maven-dir for 2.9.1 and 3.0.0, merged them (3.0.0 is top-level version) and extracted only fast-vector-highlighter: http://people.apache.org/~uschindler/staging-area/ I will copy this dir to the maven folder on people.a.o, when I got votes (how many)? At least someone should check t

[jira] Updated: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2108: Attachment: LUCENE-2108_test_java14.patch Mike, I changed the testcase to be 1.4 compatibl

Lots of results

2009-12-05 Thread Grant Ingersoll
At ScaleCamp yesterday in the UK, I was listening to a talk on Xapian and the speaker said one of the optimizations they do when retrieving a large result set is that instead of managing a Priority Queue, they just allocate a large array to hold all of the results and then sort afterward. Seem

Re: [jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir
Hi Ghazal, I am sorry this one is a bit confusing. I think it is because a lot of people are working on it (which is great) and a lot of ideas going back and forth, causing lots of files to be uploaded, etc. Can you tell us more about your interest in working with NFA/DFA in Lucene? I am very cur

Re: [jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller
Could you be more specific :) This patch is part of an issue to add an AutomatonQuery class to Lucene that allows for a fast RegexpQuery and replaces our WildcardQuery impl. Its being developed in two flavors - one for the current trunk version of Lucene, and a slightly altered version for our "fl

Re: [jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Simon Willnauer
On Sat, Dec 5, 2009 at 10:58 PM, Ghazal Gharooni wrote: > Hello, > > I am new in the community and I've completely been confused. Please anybody > help me out to know which part of codes you are working with. How should I > participate in work? Thank you! Hi Ghazal, what exact information do you

[jira] Commented: (LUCENE-2117) Fix SnowballAnalyzer casing behavior for Turkish Language

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786518#action_12786518 ] Robert Muir commented on LUCENE-2117: - bq. I'm still thinking about moving snowball in

Re: [jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Ghazal Gharooni
Hello, I am new in the community and I've completely been confused. Please anybody help me out to know which part of codes you are working with. How should I participate in work? Thank you! On Sat, Dec 5, 2009 at 1:02 PM, Uwe Schindler (JIRA) wrote: > > [ > https://issues.apache.org/jira/

[jira] Updated: (LUCENE-2117) Fix SnowballAnalyzer casing behavior for Turkish Language

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2117: Attachment: LUCENE-2117.patch this patch includes update to pom.xml.template > Fix SnowballAnalyz

[jira] Commented: (LUCENE-2117) Fix SnowballAnalyzer casing behavior for Turkish Language

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786515#action_12786515 ] Simon Willnauer commented on LUCENE-2117: - Robert, the patch looks almost good. Yo

[jira] Commented: (LUCENE-2100) Make contrib analyzers final

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786514#action_12786514 ] Simon Willnauer commented on LUCENE-2100: - bq. There are not many analyzers in cor

[jira] Commented: (LUCENE-2100) Make contrib analyzers final

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786512#action_12786512 ] Robert Muir commented on LUCENE-2100: - Hi Simon, this sounds good to me if we clean up

RE: Push fast-vector-highlighter mvn artifacts for 3.0 and 2.9

2009-12-05 Thread Uwe Schindler
I will regenerate both artifacts and publish in my people.a.o home (2.9.1 and 3.0, but not 2.9.0). Also 2.9.0? That’s not what you want! Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Simon Willna

[jira] Assigned: (LUCENE-2117) Fix SnowballAnalyzer casing behavior for Turkish Language

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer reassigned LUCENE-2117: --- Assignee: Simon Willnauer > Fix SnowballAnalyzer casing behavior for Turkish Languag

[jira] Assigned: (LUCENE-2100) Make contrib analyzers final

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer reassigned LUCENE-2100: --- Assignee: Simon Willnauer > Make contrib analyzers final > -

Re: Push fast-vector-highlighter mvn artifacts for 3.0 and 2.9

2009-12-05 Thread Simon Willnauer
On Sat, Dec 5, 2009 at 10:25 PM, Grant Ingersoll wrote: > I suppose we could put up the artifacts on a dev site and then we could vote > to release both of them pretty quickly.  I think that should be easy to do, > since it pretty much only involves verifying the jar and the signatures. Yep - th

Re: Push fast-vector-highlighter mvn artifacts for 3.0 and 2.9

2009-12-05 Thread Grant Ingersoll
I suppose we could put up the artifacts on a dev site and then we could vote to release both of them pretty quickly. I think that should be easy to do, since it pretty much only involves verifying the jar and the signatures. On Dec 5, 2009, at 1:03 PM, Simon Willnauer wrote: > hi folks, > The

[jira] Updated: (LUCENE-2100) Make contrib analyzers final

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2100: Attachment: LUCENE-2100.patch This patch marks all analyzers in contrib as final and remov

[jira] Commented: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786506#action_12786506 ] Simon Willnauer commented on LUCENE-2108: - bq. Thanks for fixing this! YW! very go

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786502#action_12786502 ] Robert Muir commented on LUCENE-1606: - bq. I'm not sure at the moment - but its wikipe

[jira] Updated: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2110: -- Attachment: (was: LUCENE-2110.patch) > Change FilteredTermsEnum to work like Iterator, so

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: (was: LUCENE-1606-flex.patch) > Automaton Query/Filter (scalable regex) >

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: LUCENE-1606-flex.patch again - krr to the hell with the AM/PM bug in JIRA! It is *

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786498#action_12786498 ] Mark Miller commented on LUCENE-1606: - bq. how many uniq terms is the field you are te

[jira] Commented: (LUCENE-2100) Make contrib analyzers final

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786499#action_12786499 ] Simon Willnauer commented on LUCENE-2100: - bq. Simon what do you suggest? Instead

[jira] Commented: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2009-12-05 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786497#action_12786497 ] Shalin Shekhar Mangar commented on LUCENE-2108: --- I ran into index corruption

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: LUCENE-1606-flex.patch Here is the patch with the getEnum/getTermsEnum changes ins

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786494#action_12786494 ] Uwe Schindler commented on LUCENE-2110: --- +1, I reverted it here completely. It is no

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786493#action_12786493 ] Robert Muir commented on LUCENE-1606: - Mark, thanks for testing! Yes, the new wildcar

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786489#action_12786489 ] Mark Miller edited comment on LUCENE-1606 at 12/5/09 8:43 PM: --

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786489#action_12786489 ] Mark Miller commented on LUCENE-1606: - The new WildcardQuery is holding up very well u

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786487#action_12786487 ] Robert Muir commented on LUCENE-2110: - yeah compared to the last patch, the backwards

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786486#action_12786486 ] Uwe Schindler commented on LUCENE-2110: --- Robert and me analyzed the latest change. I

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: LUCENE-1606-flex.patch Now the final one. I somehow need a test enum which does v

[jira] Updated: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2110: -- Attachment: LUCENE-2110.patch Now the final one. I somehow need a test enum which does very s

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: (was: LUCENE-1606-flex.patch) > Automaton Query/Filter (scalable regex) >

[jira] Updated: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2110: -- Attachment: (was: LUCENE-2110.patch) > Change FilteredTermsEnum to work like Iterator, so

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786480#action_12786480 ] Uwe Schindler commented on LUCENE-2110: --- Stop everything I get a collaps! Again

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786479#action_12786479 ] Uwe Schindler commented on LUCENE-1606: --- Stop everything I get a collaps! Again

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: LUCENE-1606-flex.patch There was a bug in the patch before, sorry. I will finish w

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: (was: LUCENE-1606-flex.patch) > Automaton Query/Filter (scalable regex) >

[jira] Updated: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2110: -- Attachment: LUCENE-2110.patch fixed patch - i have to stop for today. > Change FilteredTermsE

[jira] Updated: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2110: -- Attachment: (was: LUCENE-2110.patch) > Change FilteredTermsEnum to work like Iterator, so

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: LUCENE-1606-flex.patch An update with the changed nextSeekTerm() semantics from LU

[jira] Updated: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2110: -- Attachment: LUCENE-2110.patch Attached is patch that allows the TermsEnum to go backwards and

[jira] Resolved: (LUCENE-2037) Allow Junit4 tests in our environment.

2009-12-05 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-2037. Resolution: Fixed > Allow Junit4 tests in our environment. > -

[jira] Commented: (LUCENE-2037) Allow Junit4 tests in our environment.

2009-12-05 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786460#action_12786460 ] Michael McCandless commented on LUCENE-2037: OK patch looks good, thanks Erick

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786457#action_12786457 ] Uwe Schindler commented on LUCENE-2110: --- I have a solution for this problem: If the

[jira] Updated: (LUCENE-2117) Fix SnowballAnalyzer casing behavior for Turkish Language

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2117: Attachment: LUCENE-2117.patch patch for the bug that: * for Turkish language, when Version >= 3.1,

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786452#action_12786452 ] Uwe Schindler commented on LUCENE-2110: --- It will work (theoretically) but can fail:

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786451#action_12786451 ] Michael McCandless commented on LUCENE-2110: bq. nextSeekTerm() should always

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786439#action_12786439 ] Robert Muir commented on LUCENE-2110: - Uwe, I really like what you have done here (as

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786438#action_12786438 ] Robert Muir commented on LUCENE-1606: - Hi Uwe, I ran my benchmarks, and with your patc

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786433#action_12786433 ] Mark Miller commented on LUCENE-2110: - No problem, we can get it after - its not reall

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: (was: LUCENE-1606-flex.patch) > Automaton Query/Filter (scalable regex) >

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: LUCENE-1606-flex.patch New patch, there was a lost private field. Also changed the

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786431#action_12786431 ] Robert Muir commented on LUCENE-1606: - bq. Robert: Can you do performance tests with t

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1606: -- Attachment: LUCENE-1606-flex.patch Here a flex patch for automaton. It contains LUCENE-2110, a

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786426#action_12786426 ] Uwe Schindler commented on LUCENE-2110: --- Mark: I do not know about what you are talk

[jira] Updated: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2110: -- Attachment: LUCENE-2110.patch After porting Automaton, I realized, that the seeking code shoul

[jira] Updated: (LUCENE-2037) Allow Junit4 tests in our environment.

2009-12-05 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erick Erickson updated LUCENE-2037: --- Attachment: LUCENE-2037.patch Had enough time this morning to reconcile this with Kay Kay's

[jira] Assigned: (LUCENE-2096) Investigate parallelizing Ant junit tests

2009-12-05 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erick Erickson reassigned LUCENE-2096: -- Assignee: (was: Erick Erickson) Maybe for later > Investigate parallelizing Ant j

Release artifacts

2009-12-05 Thread DM Smith
I'm wondering about the size of the builds, which are surprisingly big to me. The src is 12M/13M and the bin is 17M/26M (tar.gz/zip) for 2.9.1, similar for 3.0.0. In looking at the binary artifact I see the following: * Every contrib jar has a corresponding javadoc jar, but there is no core-jav

[jira] Commented: (LUCENE-2110) Change FilteredTermsEnum to work like Iterator, so it is not positioned and next() must be always called first. Remove empty()

2009-12-05 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786411#action_12786411 ] Mark Miller commented on LUCENE-2110: - Hey Uwe, since your editing this code anyway, w

[jira] Commented: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786410#action_12786410 ] Simon Willnauer commented on LUCENE-2108: - Mike, I just realized that we need to c

[jira] Commented: (LUCENE-1526) For near real-time search, use paged copy-on-write BitVector impl

2009-12-05 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786408#action_12786408 ] John Wang commented on LUCENE-1526: --- Yes, we still see the issue. The performance/stress

[jira] Commented: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2009-12-05 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786406#action_12786406 ] Michael McCandless commented on LUCENE-2108: bq. We should backport this chang

[jira] Commented: (LUCENE-1526) For near real-time search, use paged copy-on-write BitVector impl

2009-12-05 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786404#action_12786404 ] Michael McCandless commented on LUCENE-1526: Jake, have you guys had a chance

[jira] Updated: (LUCENE-2112) Flex on non-flex emulation of TermsEnum incorrectly seeks/nexts beyond current field

2009-12-05 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2112: --- Attachment: LUCENE-2112.patch Attached patch; added 2 new test cases... fixed a few

[jira] Updated: (LUCENE-2116) Add link to irc channel #lucene on the website

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2116: Attachment: LUCENE-2116.patch Created a patch for the website. As mike mentioned in the c

Push fast-vector-highlighter mvn artifacts for 3.0 and 2.9

2009-12-05 Thread Simon Willnauer
hi folks, The maven artifacts for fast-vector-highlighter have never been pushed since it was released because there were no pom.xml.template inside the module. I added a pom file a day ago in the context of LUCENE-2107. I already talked to uwe and grant how to deal with this issues and if we shoul

[jira] Created: (LUCENE-2117) Fix SnowballAnalyzer casing behavior for Turkish Language

2009-12-05 Thread Simon Willnauer (JIRA)
Fix SnowballAnalyzer casing behavior for Turkish Language - Key: LUCENE-2117 URL: https://issues.apache.org/jira/browse/LUCENE-2117 Project: Lucene - Java Issue Type: Bug Comp

[jira] Resolved: (LUCENE-2102) LowerCaseFilter for Turkish language

2009-12-05 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer resolved LUCENE-2102. - Resolution: Fixed Committed in revision 887535 Thanks Ahmet / Robert! > LowerCaseFilte

  1   2   >