[R] The KJV

Ben Bolker bolker at ufl.edu
Sun Feb 7 02:06:40 CET 2010


Jim Lemon <jim <at> bitwrit.com.au> writes:

> 
> On 02/06/2010 06:57 PM, Charlotte Maia wrote:
> > Hey all,
> >
> > Does anyone know if there are any R packages with a copy of the KJV?
> > I'm guessing the answer is no...
> >
> > So the next question, and the more important one is:
> > Does anyone think it would be useful (e.g. for text-mining purposes)?
> > I know almost nothing about theology,
> > so I'm not sure what kind of questions theologists might have (that R
> > could answer).
> >
> > An alternative, that would achieve a similar result (I think),
> > would be an R interface to another open source system, such as Sword.
> >
> Hi Charlotte,
> Try
> 
> http://www.gutenberg.org/etext/10
> 
> Jim
> 

 I couldn't help it:

x <- url("http://www.gutenberg.org/dirs/etext90/kjv10.txt",open="r")
X <- readLines(x,n=20000)
z <- grep("First Book of Moses",X)
X <- X[-(1:z)]
X <- X[nchar(X)>0]
length(X) ## 15058
words <- tolower(unlist(strsplit(X,"[ .,:;()]")))
words2 <- grep("[^0-9]",words,value=TRUE)
tt <- rev(sort(table(words2)))
barplot(rev(tt[1:100]),horiz=TRUE,las=1,cex.names=0.4,log="x")



More information about the R-help mailing list