RE: [VOTE] Release Apache Lucene Java 3.0.0 (take #2)

2009-11-24 Thread Uwe Schindler
Here it was in my original mail: Manifest-Version: 1.0 Ant-Version: Apache Ant 1.7.0 Created-By: 1.5.0_22-b03 (Sun Microsystems Inc.) Specification-Title: Lucene Search Engine Specification-Version: 3.0.0 Specification-Vendor: The Apache Software Foundation Implementation-Title: org.apache.lucene

RE: Hudson Account for me refused

2009-11-24 Thread Uwe Schindler
So who has an account at the moment, I know of Mike and Grant? - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Mark Miller [mailto:markrmil...@gmail.com] > Sent: Tuesday, November 24, 2009 6:52 PM > To:

[jira] Commented: (LUCENE-2090) convert automaton to char[] based processing and TermRef / TermsEnum api

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782065#action_12782065 ] Robert Muir commented on LUCENE-2090: - Mike, I implemented this common suffix, but onl

Re: Hudson Account for me refused

2009-11-24 Thread Grant Ingersoll
Yes, I have one. What are you wanting to do? On Nov 24, 2009, at 1:13 PM, Uwe Schindler wrote: > So who has an account at the moment, I know of Mike and Grant? > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > >> -Orig

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782069#action_12782069 ] Michael McCandless commented on LUCENE-2075: OK as best I can tell, the reason

RE: Hudson Account for me refused

2009-11-24 Thread Uwe Schindler
Two things: https://issues.apache.org/jira/browse/LUCENE-1769 and setting the version number as stated in the release howto to "3.1-dev" for the future builds (which are not working at the moment because node down). The svn dir "nightly" was already changed. Uwe - Uwe Schindler H.-H.-Meier-

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782071#action_12782071 ] Jason Rutherglen commented on LUCENE-2075: -- {quote} And honestly I'm still tempte

[jira] Commented: (LUCENE-2090) convert automaton to char[] based processing and TermRef / TermsEnum api

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782075#action_12782075 ] Robert Muir commented on LUCENE-2090: - I guess now you have me starting to think about

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1606: Attachment: LUCENE-1606.patch this patch removes constant prefix, as its only used in dumb mode, a

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782085#action_12782085 ] Michael McCandless commented on LUCENE-2075: bq. Sounds better than the cachin

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782094#action_12782094 ] Robert Muir commented on LUCENE-2075: - Mike, I think you also might be seeing strangen

[jira] Updated: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2075: --- Attachment: LUCENE-2075.patch New patch attached -- restores (deprecated) TestSimple

Re: Hudson Account for me refused

2009-11-24 Thread Grant Ingersoll
On Nov 24, 2009, at 1:25 PM, Uwe Schindler wrote: > Two things: > > https://issues.apache.org/jira/browse/LUCENE-1769 > > and setting the version number as stated in the release howto to "3.1-dev" > for the future builds (which are not working at the moment because node > down). Done. > The

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782112#action_12782112 ] Uwe Schindler commented on LUCENE-2075: --- Should we additionally deprecate the Simple

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782117#action_12782117 ] Michael McCandless commented on LUCENE-2075: bq. hould we additionally depreca

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782116#action_12782116 ] Michael McCandless commented on LUCENE-2075: {quote} Mike, I think you also mi

[jira] Updated: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2075: --- Attachment: LUCENE-2075.patch Also deprecates SimpleMapCache. > Share the Term -> T

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782121#action_12782121 ] Robert Muir commented on LUCENE-2075: - {quote} But it sounds like you're talking about

[jira] Issue Comment Edited: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782121#action_12782121 ] Robert Muir edited comment on LUCENE-2075 at 11/24/09 7:45 PM: -

RE: Hudson Account for me refused

2009-11-24 Thread Uwe Schindler
> > and setting the version number as stated in the release howto to "3.1- > dev" > > for the future builds (which are not working at the moment because node > > down). > > Done. Thanks! > > The svn dir "nightly" was already changed. > > > I'm not following L-1769. Is there an action to take

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782137#action_12782137 ] Michael McCandless commented on LUCENE-2075: {quote} I guess I worded this wro

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782141#action_12782141 ] Robert Muir commented on LUCENE-2075: - bq. So it sort of plays ping pong w/ the terms

[jira] Issue Comment Edited: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782141#action_12782141 ] Robert Muir edited comment on LUCENE-2075 at 11/24/09 8:21 PM: -

[jira] Resolved: (LUCENE-1260) Norm codec strategy in Similarity

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1260. Resolution: Fixed Thanks Johan! > Norm codec strategy in Similarity > ---

[jira] Commented: (LUCENE-2075) Share the Term -> TermInfo cache across threads

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782152#action_12782152 ] Michael McCandless commented on LUCENE-2075: OK let's move this over to LUCENE

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782155#action_12782155 ] Michael McCandless commented on LUCENE-1606: Responding from LUCENE-2075... {

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782159#action_12782159 ] Robert Muir commented on LUCENE-1606: - {quote} I think I'm confused - if the query is

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782170#action_12782170 ] Michael McCandless commented on LUCENE-1606: But, take the abcd*1234 case -- y

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782169#action_12782169 ] Robert Muir commented on LUCENE-1606: - mike, here is a more complex example of the pin

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782173#action_12782173 ] Robert Muir commented on LUCENE-1606: - bq. But, take the abcd*1234 case - you first se

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782180#action_12782180 ] Robert Muir commented on LUCENE-1606: - I guess here is the big question Mike, pretend

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782184#action_12782184 ] Michael McCandless commented on LUCENE-1606: {quote} pretend ab* isn't rewritt

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782188#action_12782188 ] Robert Muir commented on LUCENE-1606: - bq. But the fixed trailing prefix case I think

[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782198#action_12782198 ] Robert Muir commented on LUCENE-1606: - benchmark results from mike's idea. I don't use

[jira] Issue Comment Edited: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782198#action_12782198 ] Robert Muir edited comment on LUCENE-1606 at 11/24/09 10:04 PM:

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1606: Attachment: (was: LUCENE-1606.patch) > Automaton Query/Filter (scalable regex) > -

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1606: Attachment: LUCENE-1606.patch in this patch, if the automaton is finite, always seek. if its infin

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1606: Attachment: LUCENE-1606.patch sorry, wrong file. getting lost in iterations of this patch. > Aut

Distributing index over N disks

2009-11-24 Thread Otis Gospodnetic
Hello, Would it make sense and be possible to spread different index files over multiple disks (without resorting to putting an index on a RAID)? For example, what if the index files didn't live in a single index dir, but were organized by their type in a snallow dir tree, like this: /path/to/i

RE: Distributing index over N disks

2009-11-24 Thread Uwe Schindler
It is technically doable since 2.9 with FileSwitchDirectory, where you can define file name endings as a filter to which underlying directory the requests go, see http://lucene.apache.org/java/2_9_1/api/core/org/apache/lucene/store/FileSwi tchDirectory.html To have more directories, just use anoth

[jira] Commented: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782235#action_12782235 ] Robert Muir commented on LUCENE-2091: - I am using a refactored version of this too, lo

[jira] Updated: (LUCENE-1313) Near Realtime Search (using a built in RAMDirectory)

2009-11-24 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1313: - Attachment: LUCENE-1313.patch I went back to trying to utilize a RAM dir inside of IW. T

[jira] Commented: (LUCENE-1486) Wildcards, ORs etc inside Phrase queries

2009-11-24 Thread Ahmet Arslan (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782254#action_12782254 ] Ahmet Arslan commented on LUCENE-1486: -- Hi Mark, Up to now, I was consuming ComplexP

Re: Whither Query Norm?

2009-11-24 Thread Otis Gospodnetic
I'm late to the thread, and although it looks like the discussion is over, I'll inline a Q for Jake. >I should add in my $0.02 on whether to just get rid of queryNorm() altogether: >>> >>> -1 from me, even though it's confusing, because having that call there >>> (somewhere, at least) allows y

Re: Whither Query Norm?

2009-11-24 Thread Otis Gospodnetic
Hello, Regarding that monstrous term->idf map. Is this something that one could use to adjust the scores in http://wiki.apache.org/solr/DistributedSearch#Distributed_Searching_Limitations scenario? Say a map like that was created periodically for each shard and distributed to all other nodes (

Re: Whither Query Norm?

2009-11-24 Thread Jake Mannix
On Tue, Nov 24, 2009 at 9:18 PM, Otis Gospodnetic < otis_gospodne...@yahoo.com> wrote: > I'm late to the thread, and although it looks like the discussion is over, > I'll inline a Q for Jake. > > > > >References on how people do this *with Lucene*, or just how this is done > in general? There are

Re: Whither Query Norm?

2009-11-24 Thread Jake Mannix
On Tue, Nov 24, 2009 at 9:18 PM, Otis Gospodnetic < otis_gospodne...@yahoo.com> wrote: > I'm late to the thread, and although it looks like the discussion is over, > I'll inline a Q for Jake. > > You mentioned this about 3 times in this thread (contrib/queries wants > you!) > Yeah, I've got to

Re: Whither Query Norm?

2009-11-24 Thread Jake Mannix
On Tue, Nov 24, 2009 at 9:31 PM, Otis Gospodnetic < otis_gospodne...@yahoo.com> wrote: > Hello, > > Regarding that monstrous term->idf map. > Is this something that one could use to adjust the scores in > http://wiki.apache.org/solr/DistributedSearch#Distributed_Searching_Limitationsscenario? >

[jira] Updated: (LUCENE-2090) convert automaton to char[] based processing and TermRef / TermsEnum api

2009-11-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2090: Attachment: LUCENE-2090_TermRef_flex.patch Attached is a patch to TermRef to implement endsWith()

Re: Whither Query Norm?

2009-11-24 Thread Jake Mannix
Now that Otis reminded me that this thread existed (I've got a brain like a sieve these days, I tell you)... On Fri, Nov 20, 2009 at 10:08 AM, Grant Ingersoll wrote: > > -1 from me, even though it's confusing, because having that call there > (somewhere, at least) allows you to actually do comp

<    1   2