[R] Purpose of readLines(..., encoding=)?

2014-04-05 Thread Milan Bouchet-Valat
Hi! I'm wondering what's the use of the 'encoding' argument to readLines(x), as opposed to readLines(file(x, encoding=)). The same question applies to read.table()'s 'encoding' vs 'fileEncoding' arguments. AFAIK only the latter is able to re-encode the read text into the internal representation

Re: [R] Purpose of readLines(..., encoding=)?

2014-04-05 Thread Prof Brian Ripley
UTF-8 is treated specially by readLines(), originally to allow for UTF-8 strings on Windows. See the NEWS for 2.12.0. That is not the case for encoding = latin1. If you have a Latin-1 file in a UTF-8 locale, then readLines(x, encoding = latin1) stores the strings in Latin-1 and marks them,

Re: [R] Purpose of readLines(..., encoding=)?

2014-04-05 Thread Milan Bouchet-Valat
Le samedi 05 avril 2014 à 14:16 +0100, Prof Brian Ripley a écrit : UTF-8 is treated specially by readLines(), originally to allow for UTF-8 strings on Windows. See the NEWS for 2.12.0. That is not the case for encoding = latin1. If you have a Latin-1 file in a UTF-8 locale, then