From:             daniel dot oconnor at gmail dot com
Operating system: Windows
PHP version:      5.2.5
PHP Bug Type:     DOM XML related
Bug description:  DOMDocument::baseURI parsing is out of whack

Description:
------------
The W3C clarified a few xml:base issues when publishing the GRDDL spec.

You can see the tests at
http://www.w3.org/TR/grddl-tests/#ambiguous-infoset.

Basically:
 * DOMDocument::loadXML does not detect xml:base attributes
 * simplexml_load_file does not detect xml:base attributes (or they are
lost during the importNode phase)
 * simplexml_load_string does not detect xml:base attributes (or they are
lost during the importNode phase)
 * DOMDocument does not deal with nested xml:base
 * DOMDocument does not deal with redirected xml:base locations

To clarify on the redirect-xml:base stuff...

If I request http://foo.com/example.xml
and that redirects me to http://bar.com/example.xml
and bar.com/example.xml said xml:base = http://foo.com/example.xml

... then http://bar.com/example.xml's baseURI should be
http://bar.com/example.xml

Reproduce code:
---------------
<?php
$url = 'http://www.w3.org/2001/sw/grddl-wg/td/base/xmlWithBase.xml';
$xml = file_get_contents($url);

//Load a url
$doc = DOMDocument::load($url);
var_dump($doc->baseURI);    //Expected
http://www.w3.org/2001/sw/grddl-wg/td/base/xmlWithBase.xml

//Load an xml document with xml:base
$doc = DOMDocument::loadXML($xml);
var_dump($doc->baseURI);    //Expected
http://www.w3.org/2001/sw/grddl-wg/td/base/xmlWithBase.xml



//Does it work with importNode?
$sxe = simplexml_load_file($url);
$dom_sxe = dom_import_simplexml($sxe);

$dom = new DOMDocument('1.0');
$dom_sxe = $dom->importNode($dom_sxe, true);
$dom_sxe = $dom->appendChild($dom_sxe);
var_dump($doc->baseURI);    //Expected (maybe)
http://www.w3.org/2001/sw/grddl-wg/td/base/xmlWithBase.xml

// Alternative?
$sxe = simplexml_load_string($xml);
$dom_sxe = dom_import_simplexml($sxe);

$dom = new DOMDocument('1.0');
$dom_sxe = $dom->importNode($dom_sxe, true);
$dom_sxe = $dom->appendChild($dom_sxe);
var_dump($doc->baseURI);   //Expected (maybe)
http://www.w3.org/2001/sw/grddl-wg/td/base/xmlWithBase.xml



//What about documents with an invalid xml:base (not on the top level
element)?
$doc =
DOMDocument::load('http://www.w3.org/2001/sw/grddl-wg/td/inline-rdf6.xml');
var_dump($doc->baseURI);    //Expected http://wwww.example.org/

//What about documents with a *redirected xml:base* ?
//Note: this test case is a little broken because of a W3C server change -
it *should* redirect to
'http://www.w3.org/2001/sw/grddl-wg/td/base/xmlWithBase.xml'
//      and thus have a funky new xml:base value
$doc =
DOMDocument::load('http://www.w3.org/2001/sw/grddl-wg/td/xmlWithBase.xml');
var_dump($doc->baseURI);    //Expected
http://www.w3.org/2001/sw/grddl-wg/td/base/xmlWithBase.xml

Expected result:
----------------
See reproduce code

Actual result:
--------------
See reproduce code

-- 
Edit bug report at http://bugs.php.net/?id=44367&edit=1
-- 
Try a CVS snapshot (PHP 5.2): 
http://bugs.php.net/fix.php?id=44367&r=trysnapshot52
Try a CVS snapshot (PHP 5.3): 
http://bugs.php.net/fix.php?id=44367&r=trysnapshot53
Try a CVS snapshot (PHP 6.0): 
http://bugs.php.net/fix.php?id=44367&r=trysnapshot60
Fixed in CVS:                 http://bugs.php.net/fix.php?id=44367&r=fixedcvs
Fixed in release:             
http://bugs.php.net/fix.php?id=44367&r=alreadyfixed
Need backtrace:               http://bugs.php.net/fix.php?id=44367&r=needtrace
Need Reproduce Script:        http://bugs.php.net/fix.php?id=44367&r=needscript
Try newer version:            http://bugs.php.net/fix.php?id=44367&r=oldversion
Not developer issue:          http://bugs.php.net/fix.php?id=44367&r=support
Expected behavior:            http://bugs.php.net/fix.php?id=44367&r=notwrong
Not enough info:              
http://bugs.php.net/fix.php?id=44367&r=notenoughinfo
Submitted twice:              
http://bugs.php.net/fix.php?id=44367&r=submittedtwice
register_globals:             http://bugs.php.net/fix.php?id=44367&r=globals
PHP 4 support discontinued:   http://bugs.php.net/fix.php?id=44367&r=php4
Daylight Savings:             http://bugs.php.net/fix.php?id=44367&r=dst
IIS Stability:                http://bugs.php.net/fix.php?id=44367&r=isapi
Install GNU Sed:              http://bugs.php.net/fix.php?id=44367&r=gnused
Floating point limitations:   http://bugs.php.net/fix.php?id=44367&r=float
No Zend Extensions:           http://bugs.php.net/fix.php?id=44367&r=nozend
MySQL Configuration Error:    http://bugs.php.net/fix.php?id=44367&r=mysqlcfg

Reply via email to