Hi!
I'm wondering what's the use of the 'encoding' argument to readLines(x),
as opposed to readLines(file(x, encoding=)). The same question applies
to read.table()'s 'encoding' vs 'fileEncoding' arguments. AFAIK only the
latter is able to re-encode the read text into the internal
representation
UTF-8 is treated specially by readLines(), originally to allow for UTF-8
strings on Windows. See the NEWS for 2.12.0.
That is not the case for encoding = latin1.
If you have a Latin-1 file in a UTF-8 locale, then
readLines(x, encoding = latin1)
stores the strings in Latin-1 and marks them,
Le samedi 05 avril 2014 à 14:16 +0100, Prof Brian Ripley a écrit :
UTF-8 is treated specially by readLines(), originally to allow for UTF-8
strings on Windows. See the NEWS for 2.12.0.
That is not the case for encoding = latin1.
If you have a Latin-1 file in a UTF-8 locale, then
3 matches
Mail list logo