[Bug 28748] One multibyte character in parser function output kills all article content

2012-04-12 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748

Krinkle krinklem...@gmail.com changed:

   What|Removed |Added

Version|1.16.4  |1.16

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 28748] One multibyte character in parser function output kills all article content

2012-04-12 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748

Krinkle krinklem...@gmail.com changed:

   What|Removed |Added

Version|1.16|1.16.x

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 28748] One multibyte character in parser function output kills all article content

2011-04-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748

Brion Vibber br...@wikimedia.org changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||INVALID

--- Comment #1 from Brion Vibber br...@wikimedia.org 2011-04-29 18:05:03 UTC 
---
\226 is not a multibyte character; it's an invalid UTF-8 byte sequence. That
causes pcre to discard the input at some stage, and this has been the case for
several years since PHP 5.1 or so.

You must only return valid UTF-8 strings.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 28748] One multibyte character in parser function output kills all article content

2011-04-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748

--- Comment #2 from Dan Barrett mediaw...@blazemonger.com 2011-04-29 18:06:54 
UTC ---
FYI, problem does not occur in PHP 5.1.6.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 28748] One multibyte character in parser function output kills all article content

2011-04-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748

--- Comment #4 from Brion Vibber br...@wikimedia.org 2011-04-29 18:07:35 UTC 
---
There's no PHP error, it's internal to PCRE.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 28748] One multibyte character in parser function output kills all article content

2011-04-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748

--- Comment #5 from Dan Barrett mediaw...@blazemonger.com 2011-04-29 18:44:57 
UTC ---
Does PCRE return any sort of status/result to MediaWiki that could be emitted
as an error? It would be so helpful if *some* component in the whole system
could identify the problem when it happens.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 28748] One multibyte character in parser function output kills all article content

2011-04-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748

--- Comment #6 from Brion Vibber br...@wikimedia.org 2011-04-29 18:47:03 UTC 
---
The function returns null, which could be detected. However *every single use*
of preg_replace or preg_replace_callback, and large chunks of preg_match and
preg_match_all calls might need to have checks added, and I'm not convinced
that it solves much since the 'entire page goes blank when a bad multibyte char
shows up' symptom has been very stable for several years.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 28748] One multibyte character in parser function output kills all article content

2011-04-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=28748

--- Comment #7 from Dan Barrett mediaw...@blazemonger.com 2011-04-29 18:53:28 
UTC ---
Thanks Brian. We encountered this problem in an extension that reads from a SQL
Server database via an ODBC driver that claims to return UTF-8.

Unfortunately, whenever I construct a test case with that ODBC driver talking
to PHP directly, I cannot get preg_match() to fail. Only when it talks to
MediaWiki. Hence the bug report.

Thanks for your reasoned commentary about this.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l