* Simos Xenitellis [2005-06-02 19:16]:
> The ZIP format (http://www.info-zip.org/pub/infozip/doc/) appears not
> to specify the text encoding of the filenames of the compressed files,
> which causes a problem with unzip utilities when they try to
> uncompress .ZIP files that include filenames in non-UTF-8 encodings.

I encountered this problem recently, when I tried to unpack a zip file
with greek filenames created with WinZip. I didn't try any graphical
decompression software, only command-line unzip, and discovered that
while the filenames were stored in the zipfile in CodePage 737, unzip
tried to map them using a CP-437 to latin-1 translation table on
extraction, and the result was a complete mess...

I found that I could display the stored filenames correctly with the
following command:

zipnote file.zip | iconv -f cp737 -t utf-8

Then, I just renamed the extracted files by hand to the correct names.
There weren't so many, and I could see which file matched which
filename from the order in which they were extracted.

I looked through the unzip docs, but couldn't find an option to avoid
filename translation when unzipping.

By the way, I think that RAR understands filename encodings, because I
never had a problem opening .rar files with greek filenames created on
Windows.

-- 
Alexandros Diamantidis * [EMAIL PROTECTED]

--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to