Edit report at https://bugs.php.net/bug.php?id=66618&edit=1
ID: 66618 Updated by: [email protected] Reported by: francois dot gannaz at silecs dot info Summary: UTF-8 encoding error -Status: Open +Status: Closed Type: Bug Package: Website problem PHP Version: Irrelevant -Assigned To: +Assigned To: cmb Block user comment: N Private report: N New Comment: The mentioned user notes have been removed in the meantime; there might be others with non UTF-8 encoding, but these can be handled on a case by case basis. Previous Comments: ------------------------------------------------------------------------ [2014-02-03 15:00:36] francois dot gannaz at silecs dot info As I wrote in the first note: | The problem with "zero-width spaces" that get printed is probably related to CSS, not to encoding bugs. Sorry, my bad, I thought that the encoding bugs that cause the W3C validator to refuse to validate the page also caused the display error. As I wrote, I later discovered that they were separated problem. See bug 66196 for the display error. I still think that serving well encoded pages would be better, even if it means deleting (or applying iconv) to old comments. ------------------------------------------------------------------------ [2014-02-03 14:59:50] cmbecker69 at gmx dot de I have not opened the ticket, but it seems reasonable to convert all old user contributed notes to UTF-8; the newer ones seem to be anyway. ------------------------------------------------------------------------ [2014-02-03 14:50:11] [email protected] thanks for the explaination, but I'm still missing the actual bug here. Is this ticket for making sure all user contributed notes are utf-8? ------------------------------------------------------------------------ [2014-02-03 14:47:08] cmbecker69 at gmx dot de If a browser shall display a page in UTF-8 encoding, and there are unrecognized code points, it will substitute them by the Unicode replacement character U+FFFD[1]. You can see that yourself if you visit the array_search man page[2], and search for "greetz Udo". Just below this text is an "ANSI" encoded non-breaking space character (0xA0), which is displayed as <?>. [1] <http://www.fileformat.info/info/unicode/char/0fffd/index.htm> [2] <http://php.net/manual/en/function.array-search.php> ------------------------------------------------------------------------ [2014-02-03 14:23:18] [email protected] I can't imagine that a handful of latin-1 encoded characters in the user submitted data are causing your browser to present you weird data or broken navigation. ------------------------------------------------------------------------ The remainder of the comments for this report are too long. To view the rest of the comments, please view the bug report online at https://bugs.php.net/bug.php?id=66618 -- Edit this bug report at https://bugs.php.net/bug.php?id=66618&edit=1 -- PHP Webmaster List Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
