Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide character

2009-11-09 Thread Raphael Hertzog
Package: bsdmainutils
Version: 8.0.1
Severity: serious

Since today I gets lots of lintian warnings (manpage-has-errors-from-man)
on my dpkg builds because col fails with:
col: Invalid or incomplete multibyte or wide character

You can reproduce it by doing this:
LANG=C man --warnings -E UTF-8 -l /usr/share/man/man8/update-alternatives.8.gz 
/dev/null

I don't know if it's col's fault or if it's man-db that does not use col
properly but since col changed recently (and not man-db), I filed the bug
against col. Note that dropping LANG=C makes the warning go away so it's
most certainly locale related. Using any other locale seems to work, even
one that is not UTF-8.

Severity serious to avoid propagation to testing until we know more on the
nature of the problem. 

Cheers,

-- System Information:
Debian Release: squeeze/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (500, 'testing'), (500, 'stable'), (150, 
'experimental')
Architecture: i386 (x86_64)

Kernel: Linux 2.6.30-2-amd64 (SMP w/2 CPU cores)
Locale: LANG=fr_FR.UTF-8, LC_CTYPE=fr_FR.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages bsdmainutils depends on:
ii  bsdutils  1:2.16.1-4 Basic utilities from 4.4BSD-Lite
ii  debianutils   3.2.1  Miscellaneous utilities specific t
ii  libc6 2.10.1-5   GNU C Library: Shared libraries
ii  libncurses5   5.7+20090803-2 shared libraries for terminal hand

bsdmainutils recommends no packages.

Versions of packages bsdmainutils suggests:
ii  cpp   4:4.3.4-1  The GNU C preprocessor (cpp)
pn  vacation  none (no description available)
ii  wamerican [wordlist]  6-3American English dictionary words 
ii  wfrench [wordlist]1.2.3-7French dictionary words for /usr/s
ii  whois 4.7.36 an intelligent whois client

-- no debconf information

-- 
Raphaƫl Hertzog



--
To UNSUBSCRIBE, email to debian-lint-maint-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#555330: Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide character

2009-11-09 Thread Colin Watson
On Mon, Nov 09, 2009 at 12:48:03PM +0100, Raphael Hertzog wrote:
 Package: bsdmainutils
 Version: 8.0.1
 Severity: serious
 
 Since today I gets lots of lintian warnings (manpage-has-errors-from-man)
 on my dpkg builds because col fails with:
 col: Invalid or incomplete multibyte or wide character
 
 You can reproduce it by doing this:
 LANG=C man --warnings -E UTF-8 -l 
 /usr/share/man/man8/update-alternatives.8.gz /dev/null
 
 I don't know if it's col's fault or if it's man-db that does not use col
 properly but since col changed recently (and not man-db), I filed the bug
 against col. Note that dropping LANG=C makes the warning go away so it's
 most certainly locale related. Using any other locale seems to work, even
 one that is not UTF-8.
 
 Severity serious to avoid propagation to testing until we know more on the
 nature of the problem. 

This bug is somewhere in the intersection of bsdmainutils, man-db,
lintian, and locales. Have fun. :-)

The proximate cause is that man uses -Tutf8 and thus outputs UTF-8
hyphens even under LANG=C (compare #547695), and that confuses col now
that it knows about the encoding of its input data.

However, the upstream patch referred to in #547695 is not sufficient
here. lintian uses the '-E UTF-8' option, which forces man to use UTF-8,
overriding the default. This used to work fine when col was dumb; now
that it's smart, things are a bit more problematic. The reason that
lintian does this is that it needs to force UTF-8 output somehow or else
CJK manual pages tend not to work properly, but there is no UTF-8 locale
that's guaranteed to be available on all systems.

In the short term, I think the best approach would be for man to set
LC_CTYPE to some appropriate locale that matches the encoding requested
by -E while running col. I'll see if I can arrange for this. However,
such a locale is not actually guaranteed to exist. Perhaps lintian needs
to generate a UTF-8 locale if it can't find one otherwise, a bit like
the hack in installation-locale; or perhaps we should just make sure
that there's always a C.UTF-8 locale on the system, which could be used
to get UTF-8 character type semantics without implying a particular
language or country.

-- 
Colin Watson   [cjwat...@debian.org]




-- 
To UNSUBSCRIBE, email to debian-lint-maint-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Processed: Re: Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide character

2009-11-09 Thread Debian Bug Tracking System
Processing commands for cont...@bugs.debian.org:

 reassign 555331 man-db 2.5.6-3
Bug #555331 [bsdmainutils] [col] improperly fails with Invalid or incomplete 
multibyte or wide character
Bug reassigned from package 'bsdmainutils' to 'man-db'.
Bug No longer marked as found in versions bsdmainutils/8.0.1.
Bug #555331 [man-db] [col] improperly fails with Invalid or incomplete 
multibyte or wide character
Bug Marked as found in versions man-db/2.5.6-3.
 user man...@packages.debian.org
Setting user to man...@packages.debian.org (was cjwat...@debian.org).
 usertags 555331 target-2.5.7
Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide 
character
There were no usertags set.
Usertags are now: target-2.5.7.
 tags 555331 fixed-upstream
Bug #555331 [man-db] [col] improperly fails with Invalid or incomplete 
multibyte or wide character
Added tag(s) fixed-upstream.
 clone 555331 -1
Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide 
character
Bug 555331 cloned as bug 555408.

 reassign -1 lintian 2.2.17
Bug #555408 [man-db] [col] improperly fails with Invalid or incomplete 
multibyte or wide character
Bug reassigned from package 'man-db' to 'lintian'.
Bug No longer marked as found in versions man-db/2.5.6-3.
Bug #555408 [lintian] [col] improperly fails with Invalid or incomplete 
multibyte or wide character
Bug Marked as found in versions lintian/2.2.17.
 retitle -1 lintian: ensure that there's always a UTF-8 locale for use when 
 running man?
Bug #555408 [lintian] [col] improperly fails with Invalid or incomplete 
multibyte or wide character
Changed Bug title to 'lintian: ensure that there's always a UTF-8 locale for 
use when running man?' from '[col] improperly fails with Invalid or incomplete 
multibyte or wide character'
 severity -1 wishlist
Bug #555408 [lintian] lintian: ensure that there's always a UTF-8 locale for 
use when running man?
Severity set to 'wishlist' from 'serious'

 thanks
Stopping processing here.

Please contact me if you need assistance.

Debian bug tracking system administrator
(administrator, Debian Bugs database)


-- 
To UNSUBSCRIBE, email to debian-lint-maint-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org