Package: file
Version: 1:5.25-2
Severity: normal

Hello,
file will confuse a WINDOWS-1253 encoded text file
for a ISO_8859-7 encoded file. If one then tries to
convert it, the conversion will not be accurate.

Here is how to reproduce this:

~/foo$ echo $LANG
en_US.utf8
~/foo$ echo 
"αΑάΆβΒγΓδΔεΕέΈζΖηΗήΉθΘιΙίΊϊΐκΚλΛμΜνΝξΞοΟόΌπΠρΡσΣτΤυΥύΎϋΰφΦχΧψΨωΩώΏ" > 
greek_chars.UTF-8.txt
~/foo$ cat greek_chars.UTF-8.txt | iconv -f UTF-8 -t ISO_8859-7 > 
greek_chars.ISO_8859-7.txt
~/foo$ cat greek_chars.UTF-8.txt | iconv -f UTF-8 -t WINDOWS-1253 > 
greek_chars.WINDOWS-1253.txt
~/foo$ ls -1
greek_chars.ISO_8859-7.txt
greek_chars.UTF-8.txt
greek_chars.WINDOWS-1253.txt
~/foo$ file *.txt
greek_chars.ISO_8859-7.txt:   ISO-8859 text
greek_chars.UTF-8.txt:        UTF-8 Unicode text
greek_chars.WINDOWS-1253.txt: ISO-8859 text

For convenience, the file greek_chars.UTF-8.txt
has been attached to this bug report.

Sophoklis

-- System Information:
Debian Release: stretch/sid
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'unstable'), (500, 'testing')
Architecture: i386 (i686)

Kernel: Linux 3.13-1-686-pae (SMP w/2 CPU cores)
Locale: LANG=en_US.utf8, LC_CTYPE=en_US.utf8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages file depends on:
ii  libc6      2.19-22
ii  libmagic1  1:5.25-2
ii  zlib1g     1:1.2.8.dfsg-2+b1

file recommends no packages.

file suggests no packages.

-- no debconf information
αΑάΆβΒγΓδΔεΕέΈζΖηΗήΉθΘιΙίΊϊΐκΚλΛμΜνΝξΞοΟόΌπΠρΡσΣτΤυΥύΎϋΰφΦχΧψΨωΩώΏ

Reply via email to