Re: regex-based query contribution

2005-10-13 Thread Paul Elschot
On Friday 14 October 2005 08:29, Chris Hostetter wrote: > > : A more general solution would be to use a subclass of BooleanQuery that > : provides a Weight that flattens all the weights of the subqueries, for example > : to the maximum weight, and for the rest works like the usual Weight of > : B

Re: regex-based query contribution

2005-10-13 Thread Chris Hostetter
: A more general solution would be to use a subclass of BooleanQuery that : provides a Weight that flattens all the weights of the subqueries, for example : to the maximum weight, and for the rest works like the usual Weight of : BooleanQuery. I'm not grasping all of the ideas in this thread comp

Re: regex-based query contribution

2005-10-13 Thread Paul Elschot
On Thursday 13 October 2005 20:15, markharw00d wrote: > Sounds like a very useful addition but as yet another variant of "term > expanding" queries (fuzzy/prefix/range/wildcard) now might be a good > time to re-raise the scoring issue I originally identified here with all > such queries: http://

Re: regex-based query contribution

2005-10-13 Thread markharw00d
Sounds like a very useful addition but as yet another variant of "term expanding" queries (fuzzy/prefix/range/wildcard) now might be a good time to re-raise the scoring issue I originally identified here with all such queries: http://issues.apache.org/jira/browse/LUCENE-329 The issue is that "

Re: regex-based query contribution

2005-10-13 Thread Erik Hatcher
On Oct 13, 2005, at 7:36 AM, Mikko Noromaa wrote: Hi, It would be possible to do a PatternQuery("*") that would enumerate every term. Does this work differently than the current logic where wildcard queries are constructed as BooleanQueries with many terms OR'ed together? I think this

RE: regex-based query contribution

2005-10-13 Thread Mikko Noromaa
Erik Hatcher [mailto:[EMAIL PROTECTED] > Sent: Thursday, October 13, 2005 1:54 PM > To: [email protected] > Subject: Re: regex-based query contribution > > > > On Oct 13, 2005, at 3:15 AM, Paul Elschot wrote: > >> The main negative to this query, just lik

Re: regex-based query contribution

2005-10-13 Thread Erik Hatcher
On Oct 13, 2005, at 3:15 AM, Paul Elschot wrote: The main negative to this query, just like with WildcardQuery and FuzzyQuery, is the possible performance issue. However, just like WildcardQuery, this really depends on how clever the indexing side of things is and matching that cleverness with

Re: regex-based query contribution

2005-10-13 Thread Paul Elschot
On Thursday 13 October 2005 01:44, Erik Hatcher wrote: > I've developed normal and span-based Query implementations that use > regex to match index terms rather than the simplified WildcardQuery. > This allows for queries like "abc[0-9]xyz" that would match abc1xyz, > but not abc12xyz for ex