ID: 18387 Updated by: [EMAIL PROTECTED] Reported By: [EMAIL PROTECTED] -Status: Open +Status: Feedback Bug Type: Documentation problem Operating System: Windows (98,2000) PHP Version: 4.2.1 -Assigned To: +Assigned To: k.schroeder New Comment:
Not enough information was provided for us to be able to handle this bug. Please re-read the instructions at http://bugs.php.net/how-to-report.php If you can provide more information, feel free to add it to this bug and change the status back to "Open". Thank you for your interest in PHP. Please try install sablotron complete package (including iconv). Regards, Kai Previous Comments: ------------------------------------------------------------------------ [2002-07-17 16:11:16] [EMAIL PROTECTED] ooops. the text goes even further on that page: "More over when compiled on an Unix platform with iconv support the full set of encodings supported by iconv can be instantly be used by libxml. On a linux machine with glibc-2.1 the list of supported encodings and aliases fill 3 full pages, and include UCS-4, the full set of ISO-Latin encodings, and the various Japanese ones." But that won't help the original poster since he's using windows.. chregu ------------------------------------------------------------------------ [2002-07-17 16:09:05] [EMAIL PROTECTED] Just for the record: >From http://xmlsoft.org/encoding.html: Default supported encodings [by libxml2] libxml has a set of default converters for the following encodings (located in encoding.c): 1. UTF-8 is supported by default (null handlers) 2. UTF-16, both little and big endian 3. ISO-Latin-1 (ISO-8859-1) covering most western languages 4. ASCII, useful mostly for saving 5. HTML, a specific handler for the conversion of UTF-8 to ASCII with HTML predefined entities like © for the Copyright sign. chregu ------------------------------------------------------------------------ [2002-07-17 14:01:18] [EMAIL PROTECTED] reclassified ------------------------------------------------------------------------ [2002-07-17 13:48:52] [EMAIL PROTECTED] I think it is expected behaviour. According to XML specification the only encodings, XML parsers must support are UTF-8 and UTF-16. libxml2 by default supports them and iso-8859-1. So you should workaround your problem by converting your XML documents to UTF-8 (you can also do it on the fly using iconv() function). 2 PHP Developers: I think this bug should be moved to "Documentation problem" category, because documentation miss this important note about list of supported encodings and how to handle documents with other encodings. ------------------------------------------------------------------------ [2002-07-17 03:55:14] [EMAIL PROTECTED] short script: <?php if(!$dom = domxml_open_file("b00000000001.xml")) { echo "Error while parsing the document\n"; exit; } $root = $dom->document_element(); print_r($root); ?> return: "Error while parsing the document" when xml document has encoding option, like this: <?xml version="1.0" encoding='WINDOWS-1251' ?> without "encoding='WINDOWS-1251'" it work ok. ------------------------------------------------------------------------ -- Edit this bug report at http://bugs.php.net/?id=18387&edit=1 -- PHP Documentation Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
