On Wed, 15 Jun 2005, Peter Dalgaard wrote:

Robin Hankin <[EMAIL PROTECTED]> writes:

Hi

I'm having difficulty following the advice in section 2.7 of R-exts.

In one of my packages, there is a function called mobius().

I want to refer to it in the Rd file as the M�bius function, and to
illustrate the
  M�bius  inversion formula (just to be explicit: this is "Mobius" but
with two dots over the second letter).

R-exts section 2.7  gives

\enc{J�reskog}{Joreskog}

as an example, but when I cut-and-paste this, the dvi file (as produced
by R CMD Rd2dvi)
shows the umlauted "o" as A and Z with some diacritical marks, not the
desired o with
two dots on.

Using \"{o} is fine for the dvi output but not the ascii output.

How do I put an umlauted "o" in an Rd file in such a way as to have a
nice
ascii help page and nice dvi files?

Well... You can't. There's no odiaeresis in ASCII. That's exactly the
problem. In UTF-8 or ISO-Latin-1/9 (aka 8859-1 or ditto with the
addition of the Euro) you can display the character and we did
previously implicitly assume Latin-1. However this is of no use to
people in say Latin-2 locales, and in fact we can no longer spell the
entire R Core Team correctly using any of the Latin-N locales (we
lose either M{\"a}chler or {\v S}imon).

As far as I understand the current situation, we recommend that text
files be pure ASCII (which has also led us to introduce deliberate
misspellings of various people in the NEWS file and similar places).

What is happening to you is something else though: The double
characters are a tell-tale sign that you have provided UTF-8 to
something that expected an 8-bit encoding like Latin-1. The fix for
that should be to put \encoding{UTF-8} somewhere at the beginning of
the .Rd file.

(I may well have gotten some detail wrong here, Brian probably knows
the best.)

UTF-8 for latex does not work well (as yet, at least: there is now a utf8 encoding that allows at least the first plane (Latin-1) to work). So it would be much better to use Latin-1 for the file and mark it with \encoding{latin1} and mark specifically with \enc{M�bius}{Mobius} or your preferred transliteration.

The problem is not really for Latin-2 (which does have a and o diaeresis), but languages such as Japanese and Chinese which only have ASCII. So the transliteration is for people without any accents in their charset.


--
Brian D. Ripley,                  [EMAIL PROTECTED]
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595
______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html

Reply via email to