Your message dated Tue, 15 Apr 2008 20:09:30 -0500
with message-id <[EMAIL PROTECTED]>
and subject line Re: Bug#473453: r-base-core: Cyrillic characters cannot be
used in Rd files
has caused the Debian Bug report #473453,
regarding r-base-core: Cyrillic characters cannot be used in Rd files
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [EMAIL PROTECTED]
immediately.)
--
473453: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=473453
Debian Bug Tracking System
Contact [EMAIL PROTECTED] with problems
--- Begin Message ---
Package: r-base-core
Version: 2.6.2-2
Severity: normal
To reproduce:
1) Create a package stub by calling
package.skeleton('test', list = 'ls')
2) Open file ./test/man/ls.Rd and add the following line to the very
beginning of the document:
\encoding{utf8}
3) Add some cyrillic characters (e.g. тест) to say details section.
4) Run
R CMD check test
It will fail with the following diagnostics:
! Package inputenc Error: Unicode char \u8:т not set up for use with LaTeX.
You can see that R correctly processes the \encoding directive in
ls.Rd file (\usepackage[utf8]{inputenc} is added to the preamble of
test-manual.tex) but that is not enough for LaTeX to correctly compile
such a file.
-- System Information:
Debian Release: lenny/sid
APT prefers testing
APT policy: (1000, 'testing'), (200, 'unstable'), (1, 'experimental')
Architecture: i386 (i686)
Kernel: Linux 2.6.22-3-486
Locale: LANG=ru_RU.UTF-8, LC_CTYPE=ru_RU.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Versions of packages r-base-core depends on:
ii libblas3gf [libblas.so 1.2-1.5 Basic Linear Algebra Subroutines 3
ii libbz2-1.0 1.0.5-0.1 high-quality block-sorting file co
ii libc6 2.7-6 GNU C Library: Shared libraries
ii libgcc1 1:4.3.0-1 GCC support library
ii libgfortran3 4.3.0-1 Runtime library for GNU Fortran ap
ii libice6 2:1.0.4-1 X11 Inter-Client Exchange library
ii libjpeg62 6b-14 The Independent JPEG Group's JPEG
ii libpaper-utils 1.1.23 library for handling paper charact
ii libpcre3 7.4-1+lenny1 Perl 5 Compatible Regular Expressi
ii libpng12-0 1.2.15~beta5-3 PNG library - runtime
ii libreadline5 5.2-3 GNU readline and history libraries
ii libsm6 2:1.0.3-1+b1 X11 Session Management library
ii libx11-6 2:1.0.3-7 X11 client-side library
ii libxt6 1:1.0.5-3 X11 toolkit intrinsics library
ii perl 5.8.8-12 Larry Wall's Practical Extraction
ii tcl8.4 8.4.18-2 Tcl (the Tool Command Language) v8
ii tk8.4 8.4.18-1 Tk toolkit for Tcl and X11, v8.4 -
ii unzip 5.52-11 De-archiver for .zip files
ii zip 2.32-1 Archiver for .zip files
ii zlib1g 1:1.2.3.3.dfsg-11 compression library - runtime
Versions of packages r-base-core recommends:
ii r-base-dev 2.6.2-2 GNU R installation of auxiliary GN
ii r-recommended 2.6.2-2 GNU R collection of recommended pa
-- no debconf information
--- End Message ---
--- Begin Message ---
Upon reflection, closing this now: seems like a mismatch between supported
and used locale settings.
If I misunderstood, feel free to reopen but please also re-state why this
should be considered a bug.
Dirk
On 30 March 2008 at 23:31, wrote:
| 2008/3/30, Dirk Eddelbuettel <[EMAIL PROTECTED]>:
| > May well be but before you go overboard and file a dozen more bug reports
| > with Debian --- would you consider discussing this first on the r-devel
list?
| >
|
| Sorry if it looked unpolite, I'm not going to report any more bugs today ;-)
|
| > None of this is germane to Debian so we may as well get R folks involved.
As
| > for the issue in this bug report, I simply do not know where utf-8 is or is
| > not supported. I tend to just use English, so sorry. Will have to pass
this
| > on.
|
| An excerpt from "Writing R Extensions", section 2.10 "Encoding":
|
| > Wherever possible, avoid non-ASCII chars in Rd files, and even symbols
such as
| > '<', '>', '$', '^', '&', '|', '@', '~', and '*' outside verbatim
environments (since
| > they may disappear in fonts designed to render text).
|
| With that I fully agree, but in my case I *want* to use cyrillics. My
| package is created to overcome some problems with cyrillics in R
| graphics, so I want to insert an actual example of how to workaround a
| problem.
|
| Another except made me optimistic:
|
| > For convenience, encoding names 'latin1' and 'latin2' are always recognized:
| > these and 'UTF-8' are likely to work fairly widely.
|
| Looks like utf8 is generally supported.
|
| Andrey
--
I was mostly off email over the last few days, and am now catching
up. My apologies for delayed responses and/or brevity.
--
Three out of two people have difficulties with fractions.
--- End Message ---