Package: moreutils
Version: 0.25
Severity: normal
>From the isutf8(1) manpage:
,----
| EXIT STATUS
|        If the file is valid UTF-8, the exit status is zero.  If the
|        file is not valid UTF-8, or there is some error, the exit
|        status is non-zero.
`----

>From experience:
,----
| [EMAIL PROTECTED] /tmp/test $ locale
| LANG=fr_FR.UTF-8
| LC_CTYPE="fr_FR.UTF-8"
| LC_NUMERIC="fr_FR.UTF-8"
| LC_TIME="fr_FR.UTF-8"
| LC_COLLATE="fr_FR.UTF-8"
| LC_MONETARY="fr_FR.UTF-8"
| LC_MESSAGES="fr_FR.UTF-8"
| LC_PAPER="fr_FR.UTF-8"
| LC_NAME="fr_FR.UTF-8"
| LC_ADDRESS="fr_FR.UTF-8"
| LC_TELEPHONE="fr_FR.UTF-8"
| LC_MEASUREMENT="fr_FR.UTF-8"
| LC_IDENTIFICATION="fr_FR.UTF-8"
| LC_ALL=
| [EMAIL PROTECTED] /tmp/test $ echo "e with acute is é" > utf8.txt
| [EMAIL PROTECTED] /tmp/test $ iconv --from=utf8 --to=latin1 < utf8.txt > 
latin1.txt
| [EMAIL PROTECTED] /tmp/test $ file *
| latin1.txt: ISO-8859 text
| utf8.txt:   UTF-8 Unicode text
| [EMAIL PROTECTED] /tmp/test $ isutf8 latin1.txt ; echo $?
| latin1.txt: line 1, char 1, byte offset 17: invalid UTF-8 code
| 0
| [EMAIL PROTECTED] /tmp/test $ isutf8 utf8.txt ; echo $?
| 1
| [EMAIL PROTECTED] /tmp/test $ 
`----

One or the other should be fixed.

Roland.
-- System Information:
Debian Release: lenny/sid
  APT prefers unstable
  APT policy: (500, 'unstable')
Architecture: i386 (i686)

Kernel: Linux 2.6.22-3-k7 (SMP w/1 CPU core)
Locale: LANG=fr_FR.UTF-8, LC_CTYPE=fr_FR.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages moreutils depends on:
ii  libc6                         2.7-2      GNU C Library: Shared libraries
ii  perl                          5.8.8-12   Larry Wall's Practical Extraction 

moreutils recommends no packages.

-- no debconf information


Reply via email to