[R] text mining

rgui wa7.mej at gmail.com
Mon May 30 12:17:41 CEST 2011


Hi,

I have a problem when indexing the corpus. I used the following syntax:

> Setwd ("c :/....")
> Library (tm)
> Txt = Corpus (DirSource ("."); readerControl = list (language = "frensh"))

an error message comes:

>>> Messages d'avis :
1: In readLines(y, encoding = x$Encoding) :
  ligne finale incomplète trouvée dans './n3.txt'
2: In readLines(y, encoding = x$Encoding) :
  ligne finale incomplète trouvée dans './n32.

another question:
 how can I read different document types (. pdf,. "...) html using the
package "tm"?

Thanks very well for help



--
View this message in context: http://r.789695.n4.nabble.com/text-mining-tp3560367p3560367.html
Sent from the R help mailing list archive at Nabble.com.



More information about the R-help mailing list