#2148: Man-DB 2.5.1
--------------------------+-------------------------------------------------
Reporter: LydianKnight | Owner: [email protected]
Type: task | Status: new
Priority: normal | Milestone: 7.0
Component: Book | Version: SVN
Severity: normal | Resolution:
Keywords: |
--------------------------+-------------------------------------------------
Comment (by [EMAIL PROTECTED]):
Not something I normally use, but I needed to build by-the-book to check
some other things. Initially, this looks interesting - the included man
pages seem to be in UTF-8, so I thought I'd give it a whirl using what it
provides, and also NOT converting the UTF-8 pages from shadow. There are
also comments about allowing UTF-8 for Korean, but looking at the
ChangeLog I found it hard to understand what, if anything, had changed for
the expected format of pages. Anybody diffing it against 2.5.0 will find
it now includes gnulib, and frankly the other changes got swamped by that.
My test results are extremely strange.
In a regular tty, set for en_GB.UTF-8 and then overridden with LC_ALL, any
non-ascii characters render as xFFFD (inverse question-mark for 'invalid
unicode'). Of the versions of 'apropos.1' the package supplies, this
affects de, es, fr, it. All render acceptably in the appropriate .UTF-8
locales.
Of the shadow UTF-8 manpages, the picture is similar and also affects the
bullet character as well as all the accents and diacriticals. I tested
passwd.1, or passwd.5 if not available.
The situation with the vim pages is similar - fr, it, pl, ru only display
ASCII characters correctly in non-UTF-8 locales, but are ok in UTF-8
(ru_RU.UTF-8 vim.1 has some odd formatting for command-line option
translations).
Now that I've built X and urxvt, the position is murkier:
Latin1 languages seem to work in urxvt as either de_DE or de_DE.UTF-8,
Latin2 languages either have problems with a few characters but are ok in
UTF-8 (cs, pl), or are ok in latin2 but unreadable in UTF-8 (hu). The
bullet is gibberish in it_IT.UTF-8. pt_BR.UTF-8 passwd.5 is gibberish.
ru_RU{,.UTF-8} passwd.1 are both gibberish. sv_SE.UTF-8 passwd.1
misrenders A-diaeresis and A-ring. tr_TR passwd.1 renders I-dot-above as
Y-acute. tr_TR.UTF-8 renders passwd.1 as gibberish. ru_RU vim.1 is
gibberish but ru_RU.UTF-8 vim.1 looks reasonable (that is a KOI8-R source
page).
Of the "hard" languages in urxvt, zh_CN.UTF-8 mostly renders passwd.5
(initial character is not in my screen fonts), I'm on manually-installed
locales and I didn't know what to define for non-UTF-8 zh_CN. Didn't test
zh_TW. ko_KR is total gibberish, ko_KR.UTF-8 only displays the ascii
characters, the ideograms are not present, neither are the empty boxes for
characters not in any font.
At first, I thought there had been a switch to "pages are now expected to
be in UTF-8, but we haven't bothered to document that", but the errors I
can see with UTF-8 in urxvt make me suspect there is something else
required.
--
Ticket URL: <http://wiki.linuxfromscratch.org/lfs/ticket/2148#comment:1>
LFS Trac <http://wiki.linuxfromscratch.org/lfs/>
Linux From Scratch: Your Distro, Your Rules.
--
http://linuxfromscratch.org/mailman/listinfo/lfs-book
FAQ: http://www.linuxfromscratch.org/lfs/faq.html
Unsubscribe: See the above information page