Hi Shamik:

One thing that might help is to use the "replaceWhitespaceWith" parameter of
the QParserPlugin and in your index-time Autophrase TokenFilter. so in my
solrconfig.xml I have

  <queryParser name="autophrasingParser"
class="com.lucidworks.analysis.AutoPhrasingQParserPlugin" >
      <str name="phrases">autophrases.txt</str>
      <str name="replaceWhitespaceWith">_</str>
  </queryParser>

then if in your fieldType in schema.xml if you have:

 <filter class="com.lucidworks.analysis.AutoPhrasingTokenFilterFactory"
phrases="autophrases.txt" includeTokens="true" replaceWhitespaceWith="_" />

The reason for this is that the QParserPlugin tricks the Solr QParser
(lucene, edismax, what have you) into treating the autophrased token as a
single token - this is a workaround for Lucene 2605, (and as with all
workarounds it is admittedly a bit kludgy) so it needs to intercept the
phrase and turn it into a single token.  

I think that this was in my second blog on using the autophrasing filter to
solve the multi-term synonyms problem - it may not have been in the original
blog post - my bad!

Let me know if this solves your problem.

Ted




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Have-anyone-used-Automatic-Phrase-Tokenization-AutoPhrasingTokenFilterFactory-tp4173808p4174100.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to