Hi,

I have a problem when indexing the corpus. I used the following syntax:

> Setwd ("c :/....")
> Library (tm)
> Txt = Corpus (DirSource ("."); readerControl = list (language = "frensh"))

an error message comes:

>>> Messages d'avis :
1: In readLines(y, encoding = x$Encoding) :
  ligne finale incomplète trouvée dans './n3.txt'
2: In readLines(y, encoding = x$Encoding) :
  ligne finale incomplète trouvée dans './n32.

another question:
 how can I read different document types (. pdf,. "...) html using the
package "tm"?

Thanks very well for help



--
View this message in context: 
http://r.789695.n4.nabble.com/text-mining-tp3560367p3560367.html
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to