[Bug 28748] One multibyte character in parser function output kills all article content
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748 Krinkle krinklem...@gmail.com changed: What|Removed |Added Version|1.16.4 |1.16 -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 28748] One multibyte character in parser function output kills all article content
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748 Krinkle krinklem...@gmail.com changed: What|Removed |Added Version|1.16|1.16.x -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 28748] One multibyte character in parser function output kills all article content
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748 Brion Vibber br...@wikimedia.org changed: What|Removed |Added Status|NEW |RESOLVED Resolution||INVALID --- Comment #1 from Brion Vibber br...@wikimedia.org 2011-04-29 18:05:03 UTC --- \226 is not a multibyte character; it's an invalid UTF-8 byte sequence. That causes pcre to discard the input at some stage, and this has been the case for several years since PHP 5.1 or so. You must only return valid UTF-8 strings. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 28748] One multibyte character in parser function output kills all article content
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748 --- Comment #2 from Dan Barrett mediaw...@blazemonger.com 2011-04-29 18:06:54 UTC --- FYI, problem does not occur in PHP 5.1.6. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 28748] One multibyte character in parser function output kills all article content
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748 --- Comment #4 from Brion Vibber br...@wikimedia.org 2011-04-29 18:07:35 UTC --- There's no PHP error, it's internal to PCRE. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 28748] One multibyte character in parser function output kills all article content
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748 --- Comment #5 from Dan Barrett mediaw...@blazemonger.com 2011-04-29 18:44:57 UTC --- Does PCRE return any sort of status/result to MediaWiki that could be emitted as an error? It would be so helpful if *some* component in the whole system could identify the problem when it happens. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 28748] One multibyte character in parser function output kills all article content
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748 --- Comment #6 from Brion Vibber br...@wikimedia.org 2011-04-29 18:47:03 UTC --- The function returns null, which could be detected. However *every single use* of preg_replace or preg_replace_callback, and large chunks of preg_match and preg_match_all calls might need to have checks added, and I'm not convinced that it solves much since the 'entire page goes blank when a bad multibyte char shows up' symptom has been very stable for several years. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 28748] One multibyte character in parser function output kills all article content
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748 --- Comment #7 from Dan Barrett mediaw...@blazemonger.com 2011-04-29 18:53:28 UTC --- Thanks Brian. We encountered this problem in an extension that reads from a SQL Server database via an ODBC driver that claims to return UTF-8. Unfortunately, whenever I construct a test case with that ODBC driver talking to PHP directly, I cannot get preg_match() to fail. Only when it talks to MediaWiki. Hence the bug report. Thanks for your reasoned commentary about this. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l