On Wed, Aug 28, 2013 at 3:37 PM, Paul Selitskas <[email protected]>wrote:

> Will it be set as the search backend further on Wikimedia projects?
>

Yes.  I'm not sure when though.


> Is there source code available for Elasticsearch on Gerrit?


Our plugin that interacts with Elasticsearch is called CirrusSearch and
lives in gerrit here:
<https://gerrit.wikimedia.org/r/#/projects/mediawiki/extensions/CirrusSearch,dashboards/default>
https://gerrit.wikimedia.org/r/#/admin/projects/mediawiki/extensions/CirrusSearch
Elasticsearch lives in github here:
https://github.com/elasticsearch/elasticsearch


> Stemming doesn't work for some languages at all, thus
> searching exact matches only.
>

Stemming is done based on the language of the wiki.  I expect only English
stemming to work on mediawiki.org.  Right now we use the default language
analysers for all the languages that Elasticsearch supports out of the box (
http://www.elasticsearch.org/guide/reference/index-modules/analysis/lang-analyzer/)
with some customizations for English.  Languages that aren't better
supported get a "default" analyser that doesn't do any stemming and splits
on spaces.  I expect we'll have to add build some more analysers in the
future.

Nik
_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to