Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly

2009-06-21 Thread Jakub Wilk

reassign 483189 man-db 2.5.5-2
retitle 483189 man-db: does not really support UTF-8 manpages
severity 483189 important
clone 483189 -1
reassign -1 docbook-xsl 1.73.2.dfsg.1-5
retitle -1 manpages/docbook.xsl: please add encoding declarations
severity -1 wishlist
thanks

* Daniel Leidert daniel.leid...@wgdd.de, 2008-06-15, 20:28:

The file its actually UTF-8-encoded, but man interprets it as if it was
ISO-8859-1.


The file itself looks perfectly here. iconv doesn't seem to be happy
about the UTF-8 character itself and because of this, man output does
not show the character.


After some investigation and reading:
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=519209#51
I'm quite sure that the bug has nothing to do with iconv.

What is happening here is as follows:
manconv tries to convert the manpage from UTF-8 to ISO-8859-1, which 
obviously fails; thus, it assumes the encoding is ISO-8859-1.


To solve the issue:
- man should support UTF-8-encoded manual pages with characters not 
covered by legacy encodings.
- docbook stylesheet should be adding encoding declarations to the 
generated manpages.


--
Jakub Wilk



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#534095: Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly

2009-06-21 Thread Colin Watson
reassign 483189 groff-base
forcemerge 322760 48318
thanks

On Sun, Jun 21, 2009 at 06:01:26PM +0200, Jakub Wilk wrote:
 - man should support UTF-8-encoded manual pages with characters not  
 covered by legacy encodings.

man-db works just fine assuming that the underlying groff installation
has sufficient support. I plan to upgrade Debian's groff packages to
1.20 fairly soon (I have test packaging of it available via bzr at
http://bzr.debian.org/users/cjwatson/groff/experimental/), which has
proper UTF-8 support and will fix this bug. I've already subjected
man-db to quite extensive testing against groff 1.20 to confirm that
this class of bugs vanishes with it.

 - docbook stylesheet should be adding encoding declarations to the  
 generated manpages.

As I noted in the comment you linked to earlier
(http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=519209#51), that's
only necessary if you're using something other than UTF-8 or the legacy
encoding for the relevant language.

There's no actual *problem* with adding a UTF-8 encoding declaration,
but it will not help with this bug, and once this bug is fixed it will
not be necessary for an encoding declaration to be present in order to
take advantage of the fix. manconv already has pretty reliable heuristic
detection of UTF-8; in fact, this is a large part of its purpose in
life.

Regards,

-- 
Colin Watson   [cjwat...@debian.org]



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly

2008-06-15 Thread Daniel Leidert
reassign 483189 libc6
retitle 483189 iconv fails on UTF-8 input with -f UTF-8
thanks

Sorry, I forgot to answer you.

Am Mittwoch, den 28.05.2008, 00:15 +0200 schrieb Jakub Wilk:
 * Daniel Leidert [EMAIL PROTECTED], 2008-05-27, 23:15:

[..]
 But I cannot find a bug. Can you send me the manpage, it creates for you
 (preferably gzipped)?
 The file its actually UTF-8-encoded, but man interprets it as if it was 
 ISO-8859-1.

The file itself looks perfectly here. iconv doesn't seem to be happy
about the UTF-8 character itself and because of this, man output does
not show the character.

This could be a duplicate of #342132.

PS: To avoid locale issues, you can (and I recommend it) use the
suggested parameter.

Regards, Daniel




--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly

2008-05-27 Thread Jakub Wilk

Package: docbook-xsl
Version: 1.73.2.dfsg.1-4
Severity: normal

$ ?xml version='1.0'?
!DOCTYPE refentry PUBLIC '-//OASIS//DTD DocBook XML V4.5//EN' 
'http://www.docbook.org/xml/4.5/docbookx.dtd'
refentry
refentryinfo
titletmp/title
productnametmp/productname
date2008-05-27/date
/refentryinfo
refmeta
refentrytitletmp/refentrytitle
manvolnum1/manvolnum
refmiscinfo class='version'0/refmiscinfo
/refmeta
refnamediv
refnametmp/refname
/refnamediv
refsection
title#x2264;/title
para/para
/refsection
/refentry

$ xsltproc /usr/share/xml/docbook/stylesheet/nwalsh/manpages/docbook.xsl tmp.xml
Note: Writing tmp.1

$ man -l tmp.1
TMP(1)tmp   TMP(1)

NAME
   tmp

â¤
tmp 0 2008-05-27TMP(1)


-- System Information:
Debian Release: lenny/sid
  APT prefers testing
  APT policy: (900, 'testing'), (600, 'unstable'), (500, 'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.24-1-686 (SMP w/2 CPU cores)
Locale: LANG=C, LC_CTYPE=pl_PL.utf8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages docbook-xsl depends on:
ii  xml-core  0.11   XML infrastructure and XML catalog

Versions of packages docbook-xsl recommends:
ii  docbook-xml   4.5-5  standard XML documentation system,
pn  docbook-xsl-doc-html | docboo none (no description available)

-- debconf-show failed

--
Jakub Wilk



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#483189: manpages/docbook.xsl: LESS-THAN OR EQUAL TO character is converted incorrectly

2008-05-27 Thread Daniel Leidert
Am Dienstag, den 27.05.2008, 19:55 +0200 schrieb Jakub Wilk:

[..]
 refsection
  title#x2264;/title
  para/para
 /refsection

[..]
 â¤

It outputs the correct character for me. Note, that not all special
characters are replaced by their GROFF escape codes by default. See
file:///usr/share/doc/docbook-xsl/doc/manpages/man.charmap.use.subset.html, 
which you should use to not rely on a special character set.

But I cannot find a bug. Can you send me the manpage, it creates for you
(preferably gzipped)?

Regards, Daniel 




--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]