"Nikerabbit" changed the status of MediaWiki.r104635 to "ok"
URL: http://www.mediawiki.org/wiki/Special:Code/MediaWiki/104635

Old status:  new
> New status: ok

Commit summary for MediaWiki.r104635:

* (bug 32712) Fix for search indexing of pages with certain unicode chars 
following URL

A regex in SearchUpdate was built for ancient pure ISO 8859-1 and looked for 
\xa0-\xff bytes -- this caused the regex to cut off partway through if there 
was a char containing a byte in the \x80-\x9f range.
Fixed regex to pass \x80-\xff instead.

Added a test case to SearchUpdateTest which checks for this case (example text 
run through the update squash algo, then run through preg_replace with a /u 
param to make sure it gets treated as UTF-8 and checking whether it breaks.)

_______________________________________________
MediaWiki-CodeReview mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-codereview

Reply via email to