From: public at christopheringram dot com Operating system: Linux PHP version: 4.3.4 PHP Bug Type: Unknown/Other Function Bug description: htmlentities breaks entities it already encoded
Description: ------------ When using htmlentities() on data that has high position characters ( >127), the characters are translated into &#nnnn; where nnnn is the character code. It seems the characters are translated properly to &#nnnn;, and then the ampersand is translated into &, making the translation of non ASCII characters pointless. Reproduce code: --------------- echo htmlentities("私はガラスを食べられます。それは私を傷つけません", ENT_QUOTES,'UTF-8'); Expected result: ---------------- 私はガラスを食べられます。それは私を傷つけません。 -- Edit bug report at http://bugs.php.net/?id=27691&edit=1 -- Try a CVS snapshot (php4): http://bugs.php.net/fix.php?id=27691&r=trysnapshot4 Try a CVS snapshot (php5): http://bugs.php.net/fix.php?id=27691&r=trysnapshot5 Fixed in CVS: http://bugs.php.net/fix.php?id=27691&r=fixedcvs Fixed in release: http://bugs.php.net/fix.php?id=27691&r=alreadyfixed Need backtrace: http://bugs.php.net/fix.php?id=27691&r=needtrace Need Reproduce Script: http://bugs.php.net/fix.php?id=27691&r=needscript Try newer version: http://bugs.php.net/fix.php?id=27691&r=oldversion Not developer issue: http://bugs.php.net/fix.php?id=27691&r=support Expected behavior: http://bugs.php.net/fix.php?id=27691&r=notwrong Not enough info: http://bugs.php.net/fix.php?id=27691&r=notenoughinfo Submitted twice: http://bugs.php.net/fix.php?id=27691&r=submittedtwice register_globals: http://bugs.php.net/fix.php?id=27691&r=globals PHP 3 support discontinued: http://bugs.php.net/fix.php?id=27691&r=php3 Daylight Savings: http://bugs.php.net/fix.php?id=27691&r=dst IIS Stability: http://bugs.php.net/fix.php?id=27691&r=isapi Install GNU Sed: http://bugs.php.net/fix.php?id=27691&r=gnused Floating point limitations: http://bugs.php.net/fix.php?id=27691&r=float