[Bug 62058] CirrusSearch: Can't find text in specific heading
https://bugzilla.wikimedia.org/show_bug.cgi?id=62058 Nik Everett neverett+bugzi...@wikimedia.org changed: What|Removed |Added See Also||https://github.com/elastics ||earch/elasticsearch/issues/ ||5648 --- Comment #6 from Nik Everett neverett+bugzi...@wikimedia.org --- I've gotten input: we should push aggressive splitting everywhere we can sensibly do it. I've filed https://github.com/elasticsearch/elasticsearch/issues/5648 upstream so we can more easily edit the analyzers built in to elasticsearch. Right now editing them requires rebuilding them as custom analyzers by hand which is error prone. The issue would let us instruct Elasticsearch to rebuild them as custom analyzers and then we could make incremental changes to them. We don't actually need the issue closed upstream to work on this here, but we will need it for a few languages because some of the language analyzers can't actually be rebuilt as custom analyzers: Persian, Thai, and German I believe. -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 62058] CirrusSearch: Can't find text in specific heading
https://bugzilla.wikimedia.org/show_bug.cgi?id=62058 Nik Everett neverett+bugzi...@wikimedia.org changed: What|Removed |Added Status|NEW |ASSIGNED Assignee|wikibugs-l@lists.wikimedia. |neverett+bugzilla@wikimedia |org |.org -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 62058] CirrusSearch: Can't find text in specific heading
https://bugzilla.wikimedia.org/show_bug.cgi?id=62058 --- Comment #3 from Nik Everett neverett+bugzi...@wikimedia.org --- Not quite sure what is going on but this actually works in dev but not production. Both enwiki and dewiki don't split on the : but my dev machines do. http://localhost:1234/dewiki_content/_analyze?analyzer=texttext=Kategorie:Stolpersteine { tokens: [ { token: kategorie:stolperstein, start_offset: 0, end_offset: 23, type: ALPHANUM, position: 1 } ] } -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 62058] CirrusSearch: Can't find text in specific heading
https://bugzilla.wikimedia.org/show_bug.cgi?id=62058 --- Comment #4 from Nik Everett neverett+bugzi...@wikimedia.org --- Ah, what is saving me in dev is the $wgCirrusSearchUseAggressiveSplitting setting which _is_ enabled on mediawiki.org but only works in English. The problem with enabling it everywhere is that it only works in English right now and might make it harder to find things Let me see what I can do about that. -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 62058] CirrusSearch: Can't find text in specific heading
https://bugzilla.wikimedia.org/show_bug.cgi?id=62058 --- Comment #5 from Nik Everett neverett+bugzi...@wikimedia.org --- Stalling this for a moment while I wait on input from Dan and Chad. At question is whether to get aggressive splitting working everywhere or to use a smaller fix to get just colons. I'd like to unify everywhere on aggressive splitting to make regression testing easier and so I don't have the confusion of some environment having it and some not. -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 62058] CirrusSearch: Can't find text in specific heading
https://bugzilla.wikimedia.org/show_bug.cgi?id=62058 Nik Everett neverett+bugzi...@wikimedia.org changed: What|Removed |Added See Also||https://bugzilla.wikimedia. ||org/show_bug.cgi?id=52905 --- Comment #2 from Nik Everett neverett+bugzi...@wikimedia.org --- Added some See Also bugs which might be the cause. Or might not. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 62058] CirrusSearch: Can't find text in specific heading
https://bugzilla.wikimedia.org/show_bug.cgi?id=62058 Nik Everett neverett+bugzi...@wikimedia.org changed: What|Removed |Added Priority|Unprioritized |High See Also||https://bugzilla.wikimedia. ||org/show_bug.cgi?id=61965 --- Comment #1 from Nik Everett neverett+bugzi...@wikimedia.org --- Filing high because I'm not sure what is up with it. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l