Edit report at https://bugs.php.net/bug.php?id=63769&edit=1
ID: 63769 Updated by: paj...@php.net Reported by: hanskrentel at yahoo dot de Summary: file:// protocol does not support percent-encoded characters Status: Not a bug Type: Bug Package: Streams related Operating System: Windows PHP Version: 5.4.9 Block user comment: N Private report: N New Comment: it is your job to decode it, file:// does not have and does not follow the % used in other areas. btw, paths on windows can contain the %, see http://msdn.microsoft.com/en- us/library/windows/desktop/aa365247(v=vs.85).aspx for a list of not allowed characters. Previous Comments: ------------------------------------------------------------------------ [2013-01-16 16:18:41] hanskrentel at yahoo dot de @ab: Consider you have a file containing a space in the filename, and you *need to* specify the filename in form of a file:// URI for which space is a special character that needs proper URL-encoding. That file://-URI btw is set in an environemnt variable that requires it (XML_CATALOG_FILES). Domdocument in PHP internally is then using that file://-URI and can't process it properly because the wrapper is not able to properly decode the path information. You actually pretty well demonstrate the problem in your example: php -r "echo file_get_contents('file://C:/my/path/catalog%202.xml');" percent filename Is obviously wrong. %20 in a file://-URI is an ecoded space, so the content space filename needs to be output instead. The filename you meant is properly written as: php -r "echo file_get_contents('file://C:/my/path/catalog%25202.xml');" percent filename compare: http://tools.ietf.org/html/rfc3986#section-2.1 Please add that example to yours because only if you have the two opposite cases (encoded *and* decoded) you can actually work out concrete results. You are just having two times the same example, of which I think both shows the same form of wrong: Missing encoding in those URIs. Which brings me to the point: Is there actually any interest to fix this? I mean there is not much standing in the way if you ask me. Normally users are not using the file:// URIs at all. Those who did most likely used the space (or would have complained earlier here, but I could find no bug-report). The only edge-case I can see is with files containing percent-signs, however how likely is that at all? Let me know if I would sponsor some well written patch how the chances would be to get this fixed. ------------------------------------------------------------------------ [2013-01-08 17:12:54] a...@php.net @hanskrentel That's my test: - create file 'catalog%202.xml' with content "percent filename" - create file 'catalog 2.xml' with content "space filename" - then run php -r "echo file_get_contents('file://C:/my/path/catalog%202.xml');" percent filename - then run php -r "echo file_get_contents('file://C:/my/path/catalog 2.xml');" space filename That's pretty straight forward. That's what I mean - no decoding, both are valid filenames. The decoding should be done in your app depending on what it needs. In your example - you create 'catalog 2.xml' and are trying to stat 'catalog%202.xml', literally. But 'catalog%202.xml' doesn't exist. ------------------------------------------------------------------------ [2013-01-06 07:03:56] anon at anon dot anon Actually, hold on a sec, plus signs are *not* supposed to be decoded here. That means that file names containing plus signs would not be broken by a fix, and only file names containing a '%xx' (where x is a hexit) sequence would be affected, which is probably uncommon. Perhaps you have a chance. ------------------------------------------------------------------------ [2013-01-06 06:38:45] anon at anon dot anon >You would have wanted to access it via 'file:///C:/temp/catalog%%25202.xml' Actually, 'catalog%25202.xml', but I know, I'm agreeing with you. I'm just pointing out that this erroneous behavior may be depended on somewhere in some PHP script, where the author, in good faith, did whatever made things work. I assume you're going to pass your path through urldecode (or not encode it in the first place), and then you'll be one of them. In any case, you're unlikely to get any support here. The reviewers here don't do much except dismiss things as 'Not a bug' and once they've successfully done that they lose interest. C'est le PHP. ------------------------------------------------------------------------ [2013-01-06 01:29:30] hanskrentel at yahoo dot de If you would create a file named catalog%202.xml. You would have wanted to access it via: 'file:///C:/temp/catalog%%25202.xml' which as well does not work. I'm not doing a bug report here to be treated like an idiot. I better suggest you piss completely off instead of leaving such an idiotic comment. My 2 cents. ------------------------------------------------------------------------ The remainder of the comments for this report are too long. To view the rest of the comments, please view the bug report online at https://bugs.php.net/bug.php?id=63769 -- Edit this bug report at https://bugs.php.net/bug.php?id=63769&edit=1