Hi,
[Since I have not a Windows machine at hand - I cannot say if it's a general R
issue.]
I'm dealing with some texts written in languages which are using e.g. this
character:
Ɂ U+0241 LATIN CAPITAL LETTER GLOTTAL STOP UTF-8: C9 81
An naïve example:
print("Ɂ")
and I get:
[1] "\u0241"
Warning message:
In print.default("\u0241") :
it is not known that wchar_t is Unicode on this platform
utf8ToInt("Ɂ") will be processed correctly.
If I do something like:
print(intToUtf8(32:5000))
I see lots of \u....
Ok, I could use "cat()" to print such characters but ...
As far as I can see in e.g. "printutils.c" the decision is made due to the
returned value of "iswprint()". "iswprint()" depends on locale's setting and I
set locale to en_US.UTF-8.
Would it be possible to "update" the code to be able to print normal Unicode
characters?
This issue also arises if I try to name e.g. rows and/or columns of R-objects.
Kind regards,
--Hans
--------
Mac OS 10.7.4
R version 2.14.2 (2012-02-29)
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)
locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
attached base packages:
[1] stats graphics grDevices utils datasets methods base
loaded via a namespace (and not attached):
[1] tools_2.14.2
_______________________________________________
R-SIG-Mac mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/r-sig-mac