"This is because the StandardAnalyzer must be splitting the words on
"SPACES" and since there is no space present here. The entire string is
converted into 1 token."
Those statements are inconsistent! I mean, what code is converting the
entire string to 1 token and eliminating white space? Is
text at query time. What is the exact query text and what are the exact
analyzer tokens for that query text and how many are there?
-- Jack Krupansky
-Original Message-
From: Ankit Murarka
Sent: Monday, July 22, 2013 10:29 AM
To: java-user@lucene.apache.org
Subject: Re: Trying to search
First thing first : Same analyzer is being used to index and to search.
Now, I am not using any custom analyzer to split the string and get the
tokens. I was assuming StandardAnalyzer might be using whitespaces to
split the content. If that is not the case then I must have got it
completely wr
I can easily split on periods. The standard analyzer is splitting on
spaces and I can implement a custom analyzer that will split on periods.
However this string will be present say suppose 3-4 times in a huge
file. Now if I proceed to search other terms in this file then I may not
get the pro
Even though you're on the Lucene list, consider installing Solr
just to see the admin/analysis page to see how your index and
query analysis works. There's no reason you couldn't split this
up on periods into separate words and then just use phrase query
to find java.lang.NullPointerException, but