[R] Troubles with stemming (tm + Snowball packages) under MacOS

Milan Bouchet-Valat nalimilan at club.fr
Sun Jan 15 15:52:10 CET 2012


Le vendredi 13 janvier 2012 à 15:49 +0100, Julien Velcin a écrit :
> Dear all,
> 
> I have some troubles using the stemming algorithm provided by the tm  
> (text mining) + Snowball packages.
> Here is my config:
> 
> MacOS 10.5
> R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions)
> 
> I have installed all the needed packages (tm, rJava, rWeka, Snowball)  
> + dependencies. I have desactivated AWT (like written in http://r.789695.n4.nabble.com/Problem-with-Snowball-amp-RWeka-td3402126.html) 
>   with :
> 
> Sys.setenv(NOAWT=TRUE)
> 
> The command tm_map(reuters, stemDocument) gives the following errors :
> 
> - First time:
> Error in .jnew(name) :
>    java.lang.InternalError: Can't start the AWT because Java was  
> started on the first thread.  Make sure StartOnFirstThread is not  
> specified in your application's Info.plist or on the command line
> Refreshing GOE props...
In my experience, there's no clean solution to this problem for now.
There's a good workaround, though: run your code from JGR, which is a
GUI written in Java. Snowball works well this way.

Cheers



More information about the R-help mailing list