we've played with HyphenationCompoundWordTokenFilterFactory it works better than maintaining a word dictionary to split (although we ended up not using it for reasons i can't recall)
see http://lucene.apache.org/solr/api/org/apache/solr/analysis/HyphenationCompoundWordTokenFilterFactory.html On 9 June 2011 06:42, Gora Mohanty <g...@mimirtech.com> wrote: > On Thu, Jun 9, 2011 at 4:37 AM, Mark Mandel <mark.man...@gmail.com> wrote: >> Not sure if this possible, but figured I would ask the question. >> >> Basically, we have some users who do some pretty rediculous things ;o) >> >> Rather than writing "red jacket", they write "redjacket", which obviously >> returns no results. > [...] > > Have you tried using synonyms, > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory > It seems like they should fit your use case. > > Regards, > Gora >