Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide character
Package: bsdmainutils Version: 8.0.1 Severity: serious Since today I gets lots of lintian warnings (manpage-has-errors-from-man) on my dpkg builds because col fails with: col: Invalid or incomplete multibyte or wide character You can reproduce it by doing this: LANG=C man --warnings -E UTF-8 -l /usr/share/man/man8/update-alternatives.8.gz /dev/null I don't know if it's col's fault or if it's man-db that does not use col properly but since col changed recently (and not man-db), I filed the bug against col. Note that dropping LANG=C makes the warning go away so it's most certainly locale related. Using any other locale seems to work, even one that is not UTF-8. Severity serious to avoid propagation to testing until we know more on the nature of the problem. Cheers, -- System Information: Debian Release: squeeze/sid APT prefers unstable APT policy: (500, 'unstable'), (500, 'testing'), (500, 'stable'), (150, 'experimental') Architecture: i386 (x86_64) Kernel: Linux 2.6.30-2-amd64 (SMP w/2 CPU cores) Locale: LANG=fr_FR.UTF-8, LC_CTYPE=fr_FR.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages bsdmainutils depends on: ii bsdutils 1:2.16.1-4 Basic utilities from 4.4BSD-Lite ii debianutils 3.2.1 Miscellaneous utilities specific t ii libc6 2.10.1-5 GNU C Library: Shared libraries ii libncurses5 5.7+20090803-2 shared libraries for terminal hand bsdmainutils recommends no packages. Versions of packages bsdmainutils suggests: ii cpp 4:4.3.4-1 The GNU C preprocessor (cpp) pn vacation none (no description available) ii wamerican [wordlist] 6-3American English dictionary words ii wfrench [wordlist]1.2.3-7French dictionary words for /usr/s ii whois 4.7.36 an intelligent whois client -- no debconf information -- Raphaƫl Hertzog -- To UNSUBSCRIBE, email to debian-lint-maint-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#555330: Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide character
On Mon, Nov 09, 2009 at 12:48:03PM +0100, Raphael Hertzog wrote: Package: bsdmainutils Version: 8.0.1 Severity: serious Since today I gets lots of lintian warnings (manpage-has-errors-from-man) on my dpkg builds because col fails with: col: Invalid or incomplete multibyte or wide character You can reproduce it by doing this: LANG=C man --warnings -E UTF-8 -l /usr/share/man/man8/update-alternatives.8.gz /dev/null I don't know if it's col's fault or if it's man-db that does not use col properly but since col changed recently (and not man-db), I filed the bug against col. Note that dropping LANG=C makes the warning go away so it's most certainly locale related. Using any other locale seems to work, even one that is not UTF-8. Severity serious to avoid propagation to testing until we know more on the nature of the problem. This bug is somewhere in the intersection of bsdmainutils, man-db, lintian, and locales. Have fun. :-) The proximate cause is that man uses -Tutf8 and thus outputs UTF-8 hyphens even under LANG=C (compare #547695), and that confuses col now that it knows about the encoding of its input data. However, the upstream patch referred to in #547695 is not sufficient here. lintian uses the '-E UTF-8' option, which forces man to use UTF-8, overriding the default. This used to work fine when col was dumb; now that it's smart, things are a bit more problematic. The reason that lintian does this is that it needs to force UTF-8 output somehow or else CJK manual pages tend not to work properly, but there is no UTF-8 locale that's guaranteed to be available on all systems. In the short term, I think the best approach would be for man to set LC_CTYPE to some appropriate locale that matches the encoding requested by -E while running col. I'll see if I can arrange for this. However, such a locale is not actually guaranteed to exist. Perhaps lintian needs to generate a UTF-8 locale if it can't find one otherwise, a bit like the hack in installation-locale; or perhaps we should just make sure that there's always a C.UTF-8 locale on the system, which could be used to get UTF-8 character type semantics without implying a particular language or country. -- Colin Watson [cjwat...@debian.org] -- To UNSUBSCRIBE, email to debian-lint-maint-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Processed: Re: Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide character
Processing commands for cont...@bugs.debian.org: reassign 555331 man-db 2.5.6-3 Bug #555331 [bsdmainutils] [col] improperly fails with Invalid or incomplete multibyte or wide character Bug reassigned from package 'bsdmainutils' to 'man-db'. Bug No longer marked as found in versions bsdmainutils/8.0.1. Bug #555331 [man-db] [col] improperly fails with Invalid or incomplete multibyte or wide character Bug Marked as found in versions man-db/2.5.6-3. user man...@packages.debian.org Setting user to man...@packages.debian.org (was cjwat...@debian.org). usertags 555331 target-2.5.7 Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide character There were no usertags set. Usertags are now: target-2.5.7. tags 555331 fixed-upstream Bug #555331 [man-db] [col] improperly fails with Invalid or incomplete multibyte or wide character Added tag(s) fixed-upstream. clone 555331 -1 Bug#555331: [col] improperly fails with Invalid or incomplete multibyte or wide character Bug 555331 cloned as bug 555408. reassign -1 lintian 2.2.17 Bug #555408 [man-db] [col] improperly fails with Invalid or incomplete multibyte or wide character Bug reassigned from package 'man-db' to 'lintian'. Bug No longer marked as found in versions man-db/2.5.6-3. Bug #555408 [lintian] [col] improperly fails with Invalid or incomplete multibyte or wide character Bug Marked as found in versions lintian/2.2.17. retitle -1 lintian: ensure that there's always a UTF-8 locale for use when running man? Bug #555408 [lintian] [col] improperly fails with Invalid or incomplete multibyte or wide character Changed Bug title to 'lintian: ensure that there's always a UTF-8 locale for use when running man?' from '[col] improperly fails with Invalid or incomplete multibyte or wide character' severity -1 wishlist Bug #555408 [lintian] lintian: ensure that there's always a UTF-8 locale for use when running man? Severity set to 'wishlist' from 'serious' thanks Stopping processing here. Please contact me if you need assistance. Debian bug tracking system administrator (administrator, Debian Bugs database) -- To UNSUBSCRIBE, email to debian-lint-maint-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org