Re: Confused with NGRAM results

2008-09-01 Thread gaz77
his was in. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original Message >> From: gaz77 <[EMAIL PROTECTED]> >> To: java-user@lucene.apache.org >> Sent: Friday, August 29, 2008 12:50:46 AM >&g

Re: Confused with NGRAM results

2008-08-28 Thread Otis Gospodnetic
lucene.apache.org > Sent: Friday, August 29, 2008 12:50:46 AM > Subject: Re: Confused with NGRAM results > > > Thanks for the pointer. > > I've gone into this in some depth, using the AnalyzerUtils class from the > lucene in action book. > > It seems that the NG

Re: Confused with NGRAM results

2008-08-28 Thread gaz77
Thanks for the pointer. I've gone into this in some depth, using the AnalyzerUtils class from the lucene in action book. It seems that the NGramTokenFilter is only processing part of the string that goes in. It stops tokenising the words part way through. That's why the documents weren't found i

RE: Confused with NGRAM results

2008-08-28 Thread Steven A Rowe
Hi gaz77, Here's a good place to start: Steve On 08/28/2008 at 10:52 AM, gaz77 wrote: > > Hi, > > I'd appreciate if someone could explain the results I'm getting. > > I've written a simple custom analyzer that applies the > NGramToken