https://bugzilla.wikimedia.org/show_bug.cgi?id=47770

Michael M. <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Priority|Low                         |Normal
                 CC|                            |[email protected]
            Summary|Searching for Unicode       |Searching for strings
                   |private code point and      |without letters or numbers
                   |characters above BMP        |returns error without
                   |returns error without       |description ("An error has
                   |description ("An error has  |occurred while searching:
                   |occurred while searching:   |The search backend returned
                   |The search backend returned |an error:")
                   |an error:")                 |
           Severity|minor                       |normal

--- Comment #2 from Michael M. <[email protected]> ---
Rephrasing summary and raising priority according to observations on
https://de.wikipedia.org/wiki/Wikipedia:Fragen_zur_Wikipedia#Hat_die_.28Volltext-.29Suche_ein_Unicode-Problem.3F:

It seems like all characters that are neither letters nor numbers (according to
Unicode General Category) are removed from the search string. This includes
private characters as well as surrogates (which removes all characters outside
BMP, even if they are letters). If the resulting string is empty, the said
error is thrown. Some examples:

https://en.wikipedia.org/w/index.php?title=Special%3ASearch&profile=default&search=a%EE%80%80&fulltext=Search
(search from first comment, with an "a" prepended, is treated like a search for
"a" alone)

https://en.wikipedia.org/w/index.php?title=Special%3ASearch&profile=default&search=%24&fulltext=Search
(search for "$", throws error)

https://en.wikipedia.org/w/index.php?title=Special%3ASearch&profile=default&search=!%22%C2%A7%24%25%26%2F%28%29%3D%3F%3C%3E*%2B%27%23%2C.%3B%3A-_&fulltext=Search
(search for a string of punctuation chars, throws error)

Probably namespaces are removed from the search, too, which causes bug 49675
with the same error thrown.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to