Alexander E. Patrakov wrote:
Or, better, drop the UTF-8 support completely from LFS. It is
unmaintainable, and in fact not ready in "vanilla upstream". This also
means dropping Nautilus CD Burner from BLFS.
I realise that was in <troll> markers, but still, I have to disagree.
Part of my motivation for documenting the statuses of our patches is
exactly so that, by ensuring those patches are submitted, "vanilla
upstream" will work correctly in UTF-8 locales and folks (including book
editors) won't have to worry about i18n at all - it'll Just Work.
Actually, there are two problems.
1) Man forgets to recode its output from the translator's charset to the
user's locale charset. Thus, error messages are readable only in
traditional 8-bit locales. Workaround for the current version of Man:
configure with "+lang none". Work is in progress to implement this
conversion. I have no access to the source code of development versions
of Man, but the result will be shown to me when it is ready.
OK, I guess that's progress of sorts.
2) We still need to decide on a configuration file format, so that none
of the existing configurations are broken, but that the possibility of
integrating Man into UTF-8 systems currently using Man-DB is gained.
Why is a configuration file needed at all? Is this because, IIUC,
man-pages don't store their character encodings anywhere, therefore we
need to tell groff and man what encodings we need to translate between?
If so, that sucks, but I guess there's not much we can do about it
(save from translating all man pages to UTF-8, but that's infeasible
when it comes to BLFS, I guess).
In an ideal world, how would you go about being able to view both UTF-8
and non-UTF-8 man pages? I'm thinking it should be:
1) `groff` with -K support and
2) `man` that remembers to recode its output from the translator's
charset to the user's locale charset.
If that's correct, I think we should seriously consider dropping the
existing configuration and implement the above, regardless of whether it
requires patches backported from upstream. At least then, when upstream
eventually do release packages that support both UTF-8 and non-UTF-8 man
pages all we have to do is upgrade to those releases and drop the patches.
And that's really bad as we obviously miss out on any bug fixes made
in groff-1.19.x. For example, CVE-2004-0969 was fixed in 1.19.2!
And in Debian too! Just follow their patch :)
Ah, you mean the patch that's currently in the book? If so that's good
news.
Thanks,
Matt.
--
http://linuxfromscratch.org/mailman/listinfo/lfs-dev
FAQ: http://www.linuxfromscratch.org/faq/
Unsubscribe: See the above information page