ID: 43347
User updated by: pschmandra at hotmail dot com
Reported By: pschmandra at hotmail dot com
Status: Open
Bug Type: XML related
Operating System: Red Hat Linux Enterprise 5
PHP Version: 5.1.6
New Comment:
List of Traditional Chinese RSS Feeds provided by the BBC that make
xml_parse() function error out.
newsrss.bbc.co.uk/rss/chinese/trad/news/rss.xml
newsrss.bbc.co.uk/rss/chinese/trad/world/rss.xml
newsrss.bbc.co.uk/rss/chinese/trad/china_news/rss.xml
newsrss.bbc.co.uk/rss/chinese/trad/taiwan_hk/rss.xml
newsrss.bbc.co.uk/rss/chinese/trad/uk/rss.xml
newsrss.bbc.co.uk/rss/chinese/trad/learn_english/rss.xml
newsrss.bbc.co.uk/rss/chinese/trad/business/rss.xml
newsrss.bbc.co.uk/rss/chinese/trad/sci/tech/rss.xml
newsrss.bbc.co.uk/rss/chinese/trad/press/rss.xml
Previous Comments:
------------------------------------------------------------------------
[2007-11-21 14:54:59] pschmandra at hotmail dot com
Sorry, every time I try to send the link to the BBC feeds or I put in
XML I get "Please do not SPAM our bug system".
------------------------------------------------------------------------
[2007-11-21 06:00:10] [EMAIL PROTECTED]
Please show an example of your XML
------------------------------------------------------------------------
[2007-11-20 18:49:11] pschmandra at hotmail dot com
All supported encodings listed below cause the xml_parse() function to
error out parsing any Chinese Traditional RSS Feeds published by the BBC
using PHP5.1.6 with a UTF-8 default_charset.
xml_parser_create()
xml_parser_create('')
xml_parser_create('UTF-8')
xml_parser_create('ISO-8859-1')
xml_parser_create('US-ASCII')
------------------------------------------------------------------------
[2007-11-20 17:56:22] [EMAIL PROTECTED]
Please also read
http://php.net/manual/en/function.xml-parser-create.php
carefully, especially:
***
The optional encoding specifies the character encoding for the
input/output in PHP 4. Starting from PHP 5, the input encoding is
automatically detected, so that the encoding parameter specifies only
the output encoding. In PHP 4, the default output encoding is the same
as the input charset. If empty string is passed, the parser attempts to
identify which encoding the document is encoded in by looking at the
heading 3 or 4 bytes. In PHP 5.0.0 and 5.0.1, the default output
charset
is ISO-8859-1, while in PHP 5.0.2 and upper is UTF-8. The supported
encodings are ISO-8859-1, UTF-8 and US-ASCII.
***
------------------------------------------------------------------------
[2007-11-20 16:49:39] [EMAIL PROTECTED]
What was the full configure line used to configure PHP in this case?
And I don't see any crash there, just a normal error for passing data
with encoding which isn't supported by the xml library in use..
------------------------------------------------------------------------
The remainder of the comments for this report are too long. To view
the rest of the comments, please view the bug report online at
http://bugs.php.net/43347
--
Edit this bug report at http://bugs.php.net/?id=43347&edit=1