On 19/07/2012 17:14, Hans-Jörg Bibiko wrote:
Hi,

[Since I have not a Windows machine at hand - I cannot say if it's a general R 
issue.]

It's nothing to do with Windows, and everything to do with Apple's contempt for standards conformance (which these days vies with Microsoft's).

I'm dealing with some texts written in languages which are using e.g. this 
character:

Ɂ U+0241 LATIN CAPITAL LETTER GLOTTAL STOP   UTF-8: C9 81

An naïve example:

print("Ɂ")

and I get:

[1] "\u0241"
Warning message:
In print.default("\u0241") :
   it is not known that wchar_t is Unicode on this platform


utf8ToInt("Ɂ") will be processed correctly.

If I do something like:

print(intToUtf8(32:5000))

I see lots of \u....

That's a different issue. Quite a lot of those are not printable Unicode points (some are not even valid ones).

Ok, I could use "cat()" to print such characters but ...

As far as I can see in e.g. "printutils.c" the decision is made due to the returned value of 
"iswprint()". "iswprint()" depends on locale's setting and I set locale to en_US.UTF-8.

Would it be possible to "update" the code to be able to print normal Unicode 
characters?
This issue also arises if I try to name e.g. rows and/or columns of R-objects.

You need to correct the information given by the OS. If it doesn't know if its wide characters are Unicode, there is no portable way in C99 to convert Unicode points to glyphs.

Define __STDC_ISO_10646__ if you know better than your OS headers.

Unicode does publish tables to determine if Unicode points are printable. Does Apple use them? Given how much of its OS is based on pre-internationalized FreeBSD, I doubt it. (I am talking here about ISO C99 functions, not Apple's own API.)



Kind regards,
--Hans


--------
Mac OS 10.7.4

R version 2.14.2 (2012-02-29)
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)

locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base

loaded via a namespace (and not attached):
[1] tools_2.14.2

_______________________________________________
R-SIG-Mac mailing list
R-SIG-Mac@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-mac



--
Brian D. Ripley,                  rip...@stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595

_______________________________________________
R-SIG-Mac mailing list
R-SIG-Mac@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-mac

Reply via email to