Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly
reassign 483189 man-db 2.5.5-2 retitle 483189 man-db: does not really support UTF-8 manpages severity 483189 important clone 483189 -1 reassign -1 docbook-xsl 1.73.2.dfsg.1-5 retitle -1 manpages/docbook.xsl: please add encoding declarations severity -1 wishlist thanks * Daniel Leidert daniel.leid...@wgdd.de, 2008-06-15, 20:28: The file its actually UTF-8-encoded, but man interprets it as if it was ISO-8859-1. The file itself looks perfectly here. iconv doesn't seem to be happy about the UTF-8 character itself and because of this, man output does not show the character. After some investigation and reading: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=519209#51 I'm quite sure that the bug has nothing to do with iconv. What is happening here is as follows: manconv tries to convert the manpage from UTF-8 to ISO-8859-1, which obviously fails; thus, it assumes the encoding is ISO-8859-1. To solve the issue: - man should support UTF-8-encoded manual pages with characters not covered by legacy encodings. - docbook stylesheet should be adding encoding declarations to the generated manpages. -- Jakub Wilk -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#534095: Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly
reassign 483189 groff-base forcemerge 322760 48318 thanks On Sun, Jun 21, 2009 at 06:01:26PM +0200, Jakub Wilk wrote: - man should support UTF-8-encoded manual pages with characters not covered by legacy encodings. man-db works just fine assuming that the underlying groff installation has sufficient support. I plan to upgrade Debian's groff packages to 1.20 fairly soon (I have test packaging of it available via bzr at http://bzr.debian.org/users/cjwatson/groff/experimental/), which has proper UTF-8 support and will fix this bug. I've already subjected man-db to quite extensive testing against groff 1.20 to confirm that this class of bugs vanishes with it. - docbook stylesheet should be adding encoding declarations to the generated manpages. As I noted in the comment you linked to earlier (http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=519209#51), that's only necessary if you're using something other than UTF-8 or the legacy encoding for the relevant language. There's no actual *problem* with adding a UTF-8 encoding declaration, but it will not help with this bug, and once this bug is fixed it will not be necessary for an encoding declaration to be present in order to take advantage of the fix. manconv already has pretty reliable heuristic detection of UTF-8; in fact, this is a large part of its purpose in life. Regards, -- Colin Watson [cjwat...@debian.org] -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly
reassign 483189 libc6 retitle 483189 iconv fails on UTF-8 input with -f UTF-8 thanks Sorry, I forgot to answer you. Am Mittwoch, den 28.05.2008, 00:15 +0200 schrieb Jakub Wilk: * Daniel Leidert [EMAIL PROTECTED], 2008-05-27, 23:15: [..] But I cannot find a bug. Can you send me the manpage, it creates for you (preferably gzipped)? The file its actually UTF-8-encoded, but man interprets it as if it was ISO-8859-1. The file itself looks perfectly here. iconv doesn't seem to be happy about the UTF-8 character itself and because of this, man output does not show the character. This could be a duplicate of #342132. PS: To avoid locale issues, you can (and I recommend it) use the suggested parameter. Regards, Daniel -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly
Package: docbook-xsl Version: 1.73.2.dfsg.1-4 Severity: normal $ ?xml version='1.0'? !DOCTYPE refentry PUBLIC '-//OASIS//DTD DocBook XML V4.5//EN' 'http://www.docbook.org/xml/4.5/docbookx.dtd' refentry refentryinfo titletmp/title productnametmp/productname date2008-05-27/date /refentryinfo refmeta refentrytitletmp/refentrytitle manvolnum1/manvolnum refmiscinfo class='version'0/refmiscinfo /refmeta refnamediv refnametmp/refname /refnamediv refsection title#x2264;/title para/para /refsection /refentry $ xsltproc /usr/share/xml/docbook/stylesheet/nwalsh/manpages/docbook.xsl tmp.xml Note: Writing tmp.1 $ man -l tmp.1 TMP(1)tmp TMP(1) NAME tmp ⤠tmp 0 2008-05-27TMP(1) -- System Information: Debian Release: lenny/sid APT prefers testing APT policy: (900, 'testing'), (600, 'unstable'), (500, 'experimental') Architecture: i386 (i686) Kernel: Linux 2.6.24-1-686 (SMP w/2 CPU cores) Locale: LANG=C, LC_CTYPE=pl_PL.utf8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages docbook-xsl depends on: ii xml-core 0.11 XML infrastructure and XML catalog Versions of packages docbook-xsl recommends: ii docbook-xml 4.5-5 standard XML documentation system, pn docbook-xsl-doc-html | docboo none (no description available) -- debconf-show failed -- Jakub Wilk -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly
Am Dienstag, den 27.05.2008, 19:55 +0200 schrieb Jakub Wilk: [..] refsection title#x2264;/title para/para /refsection [..] ⤠It outputs the correct character for me. Note, that not all special characters are replaced by their GROFF escape codes by default. See file:///usr/share/doc/docbook-xsl/doc/manpages/man.charmap.use.subset.html, which you should use to not rely on a special character set. But I cannot find a bug. Can you send me the manpage, it creates for you (preferably gzipped)? Regards, Daniel -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]