[R] How to read plain text documents into a vector?

Dieter Menne dieter.menne at menne-biomed.de
Wed Oct 14 08:37:07 CEST 2009




Richard Liu wrote:
> 
>  There are actually two vignettes.  Both have examples of a vector of
> characters being made into a tm corpus, but neither shows how to read
> documents on the file system into the vectors.  I tried the other two
> suggestions, but paste seemed not to "glue" the separate lines together
> into one character string.  Perhaps I missed something (collapse?). 
> Perhaps I'll have another look.
> 

I admit, an example to read in external data is missing. Maybe inform the
author.
Try if this works; I have not use the special functions in tm, so there
might be another problem, but readPlain looks like a good place to continue

Dieter

library(tm)
filenames = list.files(path=".",pattern="\\.txt")
docs = ""
for (filename in filenames){
  docs = c(docs,paste(readLines(file(filename)),collapse="\n"))
}
docs
## continue as in example
vs = VectorSource(docs)

-- 
View this message in context: http://www.nabble.com/How-to-read-plain-text-documents-into-a-vector--tp25867792p25886104.html
Sent from the R help mailing list archive at Nabble.com.




More information about the R-help mailing list