Re: dash-words

2006-08-02 Thread Chris Hostetter
: with a query like this +arbeiterjugend +west-berlin I get no results. : : org.apache.lucene.queryParser.QueryParser.parse makes this query (with : WordDelimiterFilter) with Default QueryParser.AND_OPERATOR: : : +titel:arbeiterjugend +titel:"west (berlin westberlin)" : : with +arbeiterjugend +west

Re: dash-words

2006-08-01 Thread Martin Braun
Hi Yonik, >> So a Phrase search to "The xmen story" will fail. With a slop of 1 the >> doc will be found. >> >> But when generating the query I won't know when to use a slop. So adding >> slops isn't a nice solution. > > If you can't tolerate slop, this is a problem. I use the WordDelimiterFilte

Re: dash-words

2006-07-25 Thread karl wettin
On Tue, 2006-07-25 at 11:42 -0400, Yonik Seeley wrote: > > > Yes, it will fail without slop... I don't think there is a > > > practical way around that. It would of course be much easier if Lucene supported multiple token dimensions instead of position increment only. > the x-men are here >

Re: dash-words

2006-07-25 Thread Yonik Seeley
On 7/25/06, Martin Braun <[EMAIL PROTECTED]> wrote: Hi Yonik, >> I can't figure out what the parameters does. ;) > > Yes, it will fail without slop... I don't think there is a practical > way around that. I am trying to analyze your WordDelimiterFilter. If I have x-men, after analyzing (with c

Re: dash-words

2006-07-25 Thread Martin Braun
Hi Yonik, >> I can't figure out what the parameters does. ;) > > Yes, it will fail without slop... I don't think there is a practical > way around that. I am trying to analyze your WordDelimiterFilter. If I have x-men, after analyzing (with catenateAll) I get this: Analzying "The x-men story

Re: dash-words

2006-07-25 Thread Yonik Seeley
On 7/25/06, karl wettin <[EMAIL PROTECTED]> wrote: On Mon, 2006-07-24 at 21:16 -0400, Yonik Seeley wrote: > > > I can't figure out what the parameters does. ;) > > Hopefully the wiki link I gave before will explain the parameters. Oh, I so totally missed that. Do you want me to java-doc it up a

Re: dash-words

2006-07-25 Thread karl wettin
On Mon, 2006-07-24 at 21:16 -0400, Yonik Seeley wrote: > > > I can't figure out what the parameters does. ;) > > Hopefully the wiki link I gave before will explain the parameters. Oh, I so totally missed that. Do you want me to java-doc it up and send you the patch? --

Re: dash-words

2006-07-24 Thread Yonik Seeley
> I can't figure out what the parameters does. ;) Hopefully the wiki link I gave before will explain the parameters. -Yonik http://incubator.apache.org/solr Solr, the open-source Lucene search server - To unsubscribe, e-mail:

Re: dash-words

2006-07-24 Thread Yonik Seeley
On 7/24/06, karl wettin <[EMAIL PROTECTED]> wrote: On Mon, 2006-07-24 at 15:17 +0200, karl wettin wrote: > On Mon, 2006-07-24 at 15:15 +0200, karl wettin wrote: > > Yes, it effects PhraseQuery. Only "the x men are" will match. > > I'm stupid. Forget about it. I should of course analyze the query

Re: dash-words

2006-07-24 Thread Yonik Seeley
On 7/24/06, karl wettin <[EMAIL PROTECTED]> wrote: > WordDelimiterFilter from Solr does this > It also has the false match problem you mention... Will it effect a phrase query? Yes... adding some slop to phrase queries is the best way to deal with that. -Yonik http://incubator.apache.org/sol

Re: dash-words

2006-07-24 Thread karl wettin
On Mon, 2006-07-24 at 15:17 +0200, karl wettin wrote: > On Mon, 2006-07-24 at 15:15 +0200, karl wettin wrote: > > Yes, it effects PhraseQuery. Only "the x men are" will match. > > I'm stupid. Forget about it. I should of course analyze the query too. But still it fails on xmen. Could it have some

Re: dash-words

2006-07-24 Thread karl wettin
On Mon, 2006-07-24 at 15:15 +0200, karl wettin wrote: > Yes, it effects PhraseQuery. Only "the x men are" will match. I'm stupid. Forget about it. I should of course analyze the query too. - To unsubscribe, e-mail: [EMAIL PROTEC

Re: dash-words

2006-07-24 Thread karl wettin
On Mon, 2006-07-24 at 13:51 +0200, karl wettin wrote: > On Mon, 2006-07-24 at 00:34 -0400, Yonik Seeley wrote: > > > filter words with a dash > > > > > > ["x-men"] > > > ["xmen"] > > > ["x", "men"] > > > > > > The problem is ["x", "men"] requiring a distance between the terms > > > and thus also ma

Re: dash-words

2006-07-24 Thread karl wettin
On Mon, 2006-07-24 at 00:34 -0400, Yonik Seeley wrote: > > filter words with a dash > > > > ["x-men"] > > ["xmen"] > > ["x", "men"] > > > > The problem is ["x", "men"] requiring a distance between the terms > > and thus also matching "x-men men". > > WordDelimiterFilter from Solr does this > It a

Re: dash-words

2006-07-24 Thread Martin Braun
Yonik Seeley schrieb: > On 7/23/06, karl wettin <[EMAIL PROTECTED]> wrote: >> I'm want to filter words with a dash in them. >> >> ["x-men"] >> ["xmen"] >> ["x", "men"] >> >> All of above should be synonyms. The problem is ["x", "men"] requiring a >> distance between the terms and thus also matching

Re: dash-words

2006-07-23 Thread Yonik Seeley
On 7/23/06, karl wettin <[EMAIL PROTECTED]> wrote: I'm want to filter words with a dash in them. ["x-men"] ["xmen"] ["x", "men"] All of above should be synonyms. The problem is ["x", "men"] requiring a distance between the terms and thus also matching "x-men men". WordDelimiterFilter from Sol

dash-words

2006-07-23 Thread karl wettin
I'm want to filter words with a dash in them. ["x-men"] ["xmen"] ["x", "men"] All of above should be synonyms. The problem is ["x", "men"] requiring a distance between the terms and thus also matching "x-men men". Or? How about storing ["x", "men"] as the first term and the use set a negative pos