https://bugzilla.wikimedia.org/show_bug.cgi?id=70950

Nik Everett <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|Provide a way to find       |CirrusSearch should provide
                   |"really exact" matches, as  |a way to find hyphenated
                   |Lucene-search always has    |words, as Lucene-search
                   |                            |always has

--- Comment #4 from Nik Everett <[email protected]> ---
I think he's right in that hyphenated words are the only thing that lsearchd
has special handling for.  There could be more - the code is vast and I haven't
read it all - but I don't think there are.  I've set the summary back to how
[email protected] originally filed it.  Are there any constructs
other than hyphenated words that have this problem?

The problem with adding lsearchd's support for hyphenated words to Cirrus is
that it relies on some pretty gnarly hacks that we can't easily replicate.  My
hope was that regexes would give you more power to find more things and that
they'd be tolerably fast.

At this point I'm not willing to reimplement the hyphenation hack - its just
too much work and it only handles the hyphens.  I'm very happy to work to make
the regex search faster.  Adding another clause (<<insource:"follow-up"
insource:/follow-up/>> for example) speeds it up but if there are other regex
searches in front of you (there is a queue that all users share) it gets slow
again.  I can certainly work on that.

Even when Cirrus is the primary search backend for enwiki you'll still be able
to use lsearchd for a few months with a url parameter (&srbackend=LuceneSearch)
and we'll monitor which queries still hit that system before we disable it
entirely.  We're in no hurry there.

As to the discussion being in three places - I'm not sure what to say.  I have
trouble keeping track of anything outside of bugzilla.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to