Thank you very much,
this information is worht it's weight in gold. So far, we've used the
asterisk method because it seemed logical and straight-forward. We will
slowly migrate to a version using EdgeNGramFilterFactory.
Thanks a bunch.
Am 07.10.2014 14:42 schrieb Alexandre Rafalovitch:
On 7 October 2014 08:25, Thomas Michael Engelke
<thomas.enge...@posteo.de> wrote:
So the culprit is the asterisk at the end. As far as we can read from
the docs, an asterisk is just 0 or more characters, which means that
the literal word in front of the asterisk should match the query.
Not quite: http://wiki.apache.org/solr/MultitermQueryAnalysis [1]
It's actually quite complicated and even depends on exact version of
Solr you are using. In fact, out of all the analyzers you showed
above, I think only LowerCase will be present on the chain. Look for
(multi) marker at: http://www.solr-start.com/info/analyzers/ [2] for
more
details.
On a higher level, I would suggest getting away from *-based expansion
and looking at EdgeNGrams instead. You can see an example of
autocomplete at
http://www.solr-start.com/javadoc/solr-lucene/index.html [3] and the
matching configuration at:
https://github.com/arafalov/Solr-Javadoc/blob/master/JavadocIndex/JavadocCollection/conf/schema.xml#L24
[4]
Or a dedicated Suggester module, though information on that is a bit
harder to find.
Regards,
Alex.
Personal: http://www.outerthoughts.com/ [5] and @arafalov
Solr resources and newsletter: http://www.solr-start.com/ [6] and
@solrstart
Solr popularizers community:
https://www.linkedin.com/groups?gid=6713853 [7]
Links:
------
[1] http://wiki.apache.org/solr/MultitermQueryAnalysis
[2] http://www.solr-start.com/info/analyzers/
[3] http://www.solr-start.com/javadoc/solr-lucene/index.html
[4]
https://github.com/arafalov/Solr-Javadoc/blob/master/JavadocIndex/JavadocCollection/conf/schema.xml#L24
[5] http://www.outerthoughts.com/
[6] http://www.solr-start.com/
[7] https://www.linkedin.com/groups?gid=6713853