Give us some examples of tokens that you are expecting that pattern to
tokenize. And express the pattern in simple English as well. Some some
actual input data.
I suspect that Solr is working fine - but you may not have precisely
specified your pattern. But we don't know what your pattern is supposed to
recognize.
Maybe some of your previous hits had punctuation adjacent to to the terms
that your pattern doesn't recognize.
And use the Solr Admin UI Analysis page to see how your sample input data is
analyzed.
w
One other thing... without a "group", the pattern specifies what delimiter
sequence will "split" the rest of the input into tokens. I suspect you
didn't mean this.
-- Jack Krupansky
-----Original Message-----
From: meghana
Sent: Friday, April 19, 2013 9:01 AM
To: solr-user@lucene.apache.org
Subject: fuzzy search issue with PatternTokenizer Factory
I m using Solr4.2 , I have changed my text field definition, to use the
Solr.PatternTokenizerFactory instead of Solr.StandardTokenizerFactory , and
changed my schema defination as below
<fieldType name="text_token" class="solr.TextField"
positionIncrementGap="100">
<analyzer type="index">
<tokenizer class="solr.PatternTokenizerFactory"
pattern="[^a-zA-Z0-9&\-']|\d{0,4}s:" />
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" enablePositionIncrements="false" />
<filter class="solr.LowerCaseFilterFactory"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.PatternTokenizerFactory"
pattern="[^a-zA-Z0-9&\-']|\d{0,4}s:" />
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords_extra_query.txt" enablePositionIncrements="false" />
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
ignoreCase="true" expand="true"/>
<filter class="solr.LowerCaseFilterFactory"/>
</analyzer>
</fieldType>
after doing so, fuzzy search do not seems to working properly as it was
working before.
I m searching with search term : worde~1
on search , before it was returning , around 300 records , but now its
returning only 5 records. not sure what can be issue.
Can anybody help me to make it work!!
--
View this message in context:
http://lucene.472066.n3.nabble.com/fuzzy-search-issue-with-PatternTokenizer-Factory-tp4057275.html
Sent from the Solr - User mailing list archive at Nabble.com.