Re: Weird Problem (possible bug?) with german stemming and wildcard search

Thomas Michael Engelke Wed, 15 Oct 2014 00:52:00 -0700

Thank you very much,

this information is worht it's weight in gold. So far, we've used theasterisk method because it seemed logical and straight-forward. We willslowly migrate to a version using EdgeNGramFilterFactory.


Thanks a bunch.

Am 07.10.2014 14:42 schrieb Alexandre Rafalovitch:

On 7 October 2014 08:25, Thomas Michael Engelke
<thomas.enge...@posteo.de> wrote:
So the culprit is the asterisk at the end. As far as we can read fromthe docs, an asterisk is just 0 or more characters, which means thatthe literal word in front of the asterisk should match the query.
Not quite: http://wiki.apache.org/solr/MultitermQueryAnalysis [1]

It's actually quite complicated and even depends on exact version of
Solr you are using. In fact, out of all the analyzers you showed
above, I think only LowerCase will be present on the chain. Look for
(multi) marker at: http://www.solr-start.com/info/analyzers/ [2] formore
details.

On a higher level, I would suggest getting away from *-based expansion
and looking at EdgeNGrams instead. You can see an example of
autocomplete at
http://www.solr-start.com/javadoc/solr-lucene/index.html [3] and the
matching configuration at:
https://github.com/arafalov/Solr-Javadoc/blob/master/JavadocIndex/JavadocCollection/conf/schema.xml#L24[4]
Or a dedicated Suggester module, though information on that is a bit
harder to find.

Regards,
Alex.

Personal: http://www.outerthoughts.com/ [5] and @arafalov
Solr resources and newsletter: http://www.solr-start.com/ [6] and@solrstartSolr popularizers community:https://www.linkedin.com/groups?gid=6713853 [7]



Links:
------
[1] http://wiki.apache.org/solr/MultitermQueryAnalysis
[2] http://www.solr-start.com/info/analyzers/
[3] http://www.solr-start.com/javadoc/solr-lucene/index.html

[4]https://github.com/arafalov/Solr-Javadoc/blob/master/JavadocIndex/JavadocCollection/conf/schema.xml#L24

[5] http://www.outerthoughts.com/
[6] http://www.solr-start.com/
[7] https://www.linkedin.com/groups?gid=6713853

Re: Weird Problem (possible bug?) with german stemming and wildcard search

Reply via email to