I suggest you write your own analyzer that doesn't remove non-letter
characters at index time. There might be one out there already, but
not that I can think of off hand.

Instead of leaving the non-letters in place you might consider doing
something with position increments.  I think that would prevent phrase
queries from matching.


--
Ian.


On Fri, Nov 28, 2008 at 5:05 PM, Ng Vinny <[EMAIL PROTECTED]> wrote:
> Hi,
>
> I'm having an issue with PhraseQuery in which a query for the phrase
> "information technology" has among of its matches the strings "information,
> technology" and "information. Technology",  which should not be considered
> as matches.
> Both StopAnalyzer  StandardAnalyzer removes non-letter character at index
> time.
>
> Any suggestions?
>
> Thanks.
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to