Thanks. That got the search working. Do you know if there's a trick for
using FastVectorHighlighter with ngrams?  I followed that doc's advice to
use NGramTokenizer, and right now if the search matches "1234" it will only
highlight "123".

Rob


On Fri, Aug 29, 2014 at 12:18 AM, Jack Krupansky <j...@basetechnology.com>
wrote:

> Use the ngram token filter, and the a query of 512 would match by itself:
> http://lucene.apache.org/core/4_9_0/analyzers-common/org/
> apache/lucene/analysis/ngram/NGramTokenFilter.html
>
> -- Jack Krupansky
>
> -----Original Message----- From: Erick Erickson
> Sent: Thursday, August 28, 2014 11:52 PM
> To: java-user
> Subject: Re: indexing all suffixes to support leading wildcard?
>
>
> The "usual" approach is to index to a second field but.... backwards.
> See ReverseStringFilter... Then all your leading wildcards
> are really trailing wildcards in the reversed field.
>
> Best,
> Erick
>
>
> On Thu, Aug 28, 2014 at 10:38 AM, Rob Nikander <rob.nikan...@gmail.com>
> wrote:
>
>  Hi,
>>
>> I've got some short fields (phone num, email) that I'd like to search
>> using
>> good old string matching.  (The full query is a boolean "or" that also
>> uses
>> real text fields.) I see the warnings about wildcard queries that start
>> with *, and I'm wondering... do you think it would be a good idea to index
>> all the suffixes?  Eg, a phone num 5551234, would become 7 values for the
>> "phoneNum" field: 4, 34, 234, etc.  So "512*" would be a hit.
>>
>> And maybe do something with the boosts so it doesn't overvalue the match
>> when it hits multiple values.  ?
>>
>> Rob
>>
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>
>

Reply via email to