Re: [LFS Trac] #2148: Man-DB 2.5.1

LFS Trac Sun, 17 Feb 2008 12:23:22 -0800

#2148: Man-DB 2.5.1
--------------------------+-------------------------------------------------
 Reporter:  LydianKnight  |        Owner:  [email protected]
     Type:  task          |       Status:  new                          
 Priority:  normal        |    Milestone:  7.0                          
Component:  Book          |      Version:  SVN                          
 Severity:  normal        |   Resolution:                               
 Keywords:                |  
--------------------------+-------------------------------------------------
Comment (by [EMAIL PROTECTED]):


 Not something I normally use, but I needed to build by-the-book to check
 some other things. Initially, this looks interesting - the included man
 pages seem to be in UTF-8, so I thought I'd give it a whirl using what it
 provides, and also NOT converting the UTF-8 pages from shadow.  There are
 also comments about allowing UTF-8 for Korean, but looking at the
 ChangeLog I found it hard to understand what, if anything, had changed for
 the expected format of pages.  Anybody diffing it against 2.5.0 will find
 it now includes gnulib, and frankly the other changes got swamped by that.

 My test results are extremely strange.

 In a regular tty, set for en_GB.UTF-8 and then overridden with LC_ALL, any
 non-ascii characters render as xFFFD (inverse question-mark for 'invalid
 unicode').  Of the versions of 'apropos.1' the package supplies, this
 affects de, es, fr, it.  All render acceptably in the appropriate .UTF-8
 locales.

 Of the shadow UTF-8 manpages, the picture is similar and also affects the
 bullet character as well as all the accents and diacriticals.  I tested
 passwd.1, or passwd.5 if not available.

 The situation with the vim pages is similar - fr, it, pl, ru only display
 ASCII characters correctly in non-UTF-8 locales, but are ok in UTF-8
 (ru_RU.UTF-8 vim.1 has some odd formatting for command-line option
 translations).

 Now that I've built X and urxvt, the position is murkier:

 Latin1 languages seem to work in urxvt as either de_DE or de_DE.UTF-8,
 Latin2 languages either have problems with a few characters but are ok in
 UTF-8 (cs, pl), or are ok in latin2 but unreadable in UTF-8 (hu). The
 bullet is gibberish in it_IT.UTF-8.  pt_BR.UTF-8 passwd.5 is gibberish.
 ru_RU{,.UTF-8} passwd.1 are both gibberish.  sv_SE.UTF-8 passwd.1
 misrenders A-diaeresis and A-ring.  tr_TR passwd.1 renders I-dot-above as
 Y-acute.  tr_TR.UTF-8 renders passwd.1 as gibberish.  ru_RU vim.1 is
 gibberish but ru_RU.UTF-8 vim.1 looks reasonable (that is a KOI8-R source
 page).

 Of the "hard" languages in urxvt, zh_CN.UTF-8 mostly renders passwd.5
 (initial character is not in my screen fonts), I'm on manually-installed
 locales and I didn't know what to define for non-UTF-8 zh_CN.  Didn't test
 zh_TW.  ko_KR is total gibberish, ko_KR.UTF-8 only displays the ascii
 characters, the ideograms are not present, neither are the empty boxes for
 characters not in any font.

 At first, I thought there had been a switch to "pages are now expected to
 be in UTF-8, but we haven't bothered to document that", but the errors I
 can see with UTF-8 in urxvt make me suspect there is something else
 required.

-- 
Ticket URL: <http://wiki.linuxfromscratch.org/lfs/ticket/2148#comment:1>
LFS Trac <http://wiki.linuxfromscratch.org/lfs/>
Linux From Scratch: Your Distro, Your Rules.
-- 
http://linuxfromscratch.org/mailman/listinfo/lfs-book
FAQ: http://www.linuxfromscratch.org/lfs/faq.html
Unsubscribe: See the above information page

Re: [LFS Trac] #2148: Man-DB 2.5.1

Reply via email to