[issue8633] tarfile doesn't support undecodable filename in PAX format

2010-05-17 Thread Lars Gustäbel
Lars Gustäbel l...@gustaebel.de added the comment: I added support for the hdrcharset method and a workaround for the GNU tar bug, see r81273. -- resolution: - accepted status: open - closed ___ Python tracker rep...@bugs.python.org

[issue8633] tarfile doesn't support undecodable filename in PAX format

2010-05-16 Thread Lars Gustäbel
Changes by Lars Gustäbel l...@gustaebel.de: -- assignee: - lars.gustaebel ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue8633 ___ ___

[issue8633] tarfile doesn't support undecodable filename in PAX format

2010-05-06 Thread Lars Gustäbel
Lars Gustäbel l...@gustaebel.de added the comment: Victor, you misunderstood the pax definition, but it wouldn't harm tarfile if it knew how to handle these corrupt GNU tar archives. In the meantime I filed a bug report on bug-...@gnu.org for this. I said in msg105085 that POSIX gives no

[issue8633] tarfile doesn't support undecodable filename in PAX format

2010-05-06 Thread Lars Gustäbel
Lars Gustäbel l...@gustaebel.de added the comment: I am currently working on a patch to let tarfile use the hdrcharset field. -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue8633 ___

[issue8633] tarfile doesn't support undecodable filename in PAX format

2010-05-05 Thread STINNER Victor
New submission from STINNER Victor victor.stin...@haypocalc.com: tarfile is unable to open a TAR archive in PAX format embedding invalid filenames (filename not encoded in utf8, an undecodable filename). Attached file is an example (contain the file b'z/\xff', not decodable from utf8). PAX

[issue8633] tarfile doesn't support undecodable filename in PAX format

2010-05-05 Thread Martin v . Löwis
Martin v. Löwis mar...@v.loewis.de added the comment: I think you are misinterpreting the spec. A PAX file MUST encode its file names in UTF-8. The invalid flag only applies when these invalid names cannot map to file names - either because they are not supported in the locale, or because

[issue8633] tarfile doesn't support undecodable filename in PAX format

2010-05-05 Thread STINNER Victor
STINNER Victor victor.stin...@haypocalc.com added the comment: I didn't read the whole spec, only read quickly the invalid option. The idead behind this issue is to be able to read a file generated by GNU tar which keeps the filename unchanged if it's not encodable to utf8. (z-pax.tar