Re: shingles work in analyzer but not real data

2010-09-07 Thread Chris Hostetter
: Hi Robert, thanks for the response. I've looked into the query parsers a : bit and I did find that using the raw parser on a matching multi-word : keyword works correctly. I need to have shingling though, in order to : support query phrases. It seems odd to have the query parser emitting

Re: shingles work in analyzer but not real data

2010-09-03 Thread Jeff Rose
Thanks Steven and Jonathan, we got it working by using a combination of quoting and the PositionFilterFactory, like is shown below. The documentation for the position filter doesn't make much sense without understanding more about how positioning of tokens is taken into account, but it appears to

Re: shingles work in analyzer but not real data

2010-09-03 Thread Jeff Rose
, Flat, and Crowded' Laugh at http://www.yert.com/film.php --- On Fri, 9/3/10, Jeff Rose j...@globalorange.nl wrote: From: Jeff Rose j...@globalorange.nl Subject: Re: shingles work in analyzer but not real data To: solr-user@lucene.apache.org Date: Friday, September 3, 2010, 1:48 AM

RE: shingles work in analyzer but not real data

2010-09-03 Thread Steven A Rowe
, 2010 5:06 AM To: solr-user@lucene.apache.org Subject: Re: shingles work in analyzer but not real data Anyone got a definitive, authoritative link to the definition of a 'shingle' in search engine results/technology? Dennis Gearon Signature Warning EARTH has a Right

Re: shingles work in analyzer but not real data

2010-09-03 Thread Lance Norskog
Steve -Original Message- From: Dennis Gearon [mailto:gear...@sbcglobal.net] Sent: Friday, September 03, 2010 5:06 AM To: solr-user@lucene.apache.org Subject: Re: shingles work in analyzer but not real data Anyone got a definitive, authoritative link to the definition of a 'shingle

Re: shingles work in analyzer but not real data

2010-09-03 Thread Dennis Gearon
...@gmail.com Subject: Re: shingles work in analyzer but not real data To: solr-user@lucene.apache.org Date: Friday, September 3, 2010, 9:55 PM http://en.wikipedia.org/wiki/W-shingling On Fri, Sep 3, 2010 at 6:19 AM, Steven A Rowe sar...@syr.edu wrote: Hi Dennis, I took a stab at answering

Re: shingles work in analyzer but not real data

2010-09-02 Thread Jeff Rose
On Wed, Sep 1, 2010 at 3:35 PM, Robert Muir rcm...@gmail.com wrote: On Wed, Sep 1, 2010 at 8:21 AM, Jeff Rose j...@globalorange.nl wrote: Hi, We are using SOLR to match query strings with a keyword database, where some of the keywords are actually more than one word. For example a

RE: shingles work in analyzer but not real data

2010-09-02 Thread Steven A Rowe
Subject: Re: shingles work in analyzer but not real data On Wed, Sep 1, 2010 at 3:35 PM, Robert Muir rcm...@gmail.com wrote: On Wed, Sep 1, 2010 at 8:21 AM, Jeff Rose j...@globalorange.nl wrote: Hi, We are using SOLR to match query strings with a keyword database, where some

Re: shingles work in analyzer but not real data

2010-09-02 Thread Jonathan Rochkind
I've run into this before too. Both the dismax and solr-lucene _query parsers_ will tokenize a query on whitespace _before_ they pass the query to any field analyzers. There are some reasons for this, lots of things wouldn't work if they didn't do this. But it makes your approach kind of

Re: shingles work in analyzer but not real data

2010-09-02 Thread Dennis Gearon
, Jonathan Rochkind rochk...@jhu.edu wrote: From: Jonathan Rochkind rochk...@jhu.edu Subject: Re: shingles work in analyzer but not real data To: solr-user@lucene.apache.org solr-user@lucene.apache.org Cc: Vishal Patel vishal_pa...@silvertouch.com, Michiel Willekens michiel.willek

shingles work in analyzer but not real data

2010-09-01 Thread Jeff Rose
Hi, We are using SOLR to match query strings with a keyword database, where some of the keywords are actually more than one word. For example a keyword might be apple pie and we only want it to match for a query containing that word pair, but not one only containing apple. Here is the relevant

Re: shingles work in analyzer but not real data

2010-09-01 Thread Robert Muir
On Wed, Sep 1, 2010 at 8:21 AM, Jeff Rose j...@globalorange.nl wrote: Hi, We are using SOLR to match query strings with a keyword database, where some of the keywords are actually more than one word. For example a keyword might be apple pie and we only want it to match for a query

Re: shingles work in analyzer but not real data

2010-09-01 Thread Markus Jelsma
If your use-case is limited to this, why don't you encapsulate all queries in double quotes? On Wednesday 01 September 2010 14:21:47 Jeff Rose wrote: Hi, We are using SOLR to match query strings with a keyword database, where some of the keywords are actually more than one word. For