[R] SparksR

Jeff Reichman reichm@nj @ending from @bcglob@l@net
Fri Apr 13 23:40:22 CEST 2018


R-Help

 

I'm working in my first large database (53,098,492,383 records).  When I
select the db via something like

 

Library(SparkR)

mydata <- sql("SELECT * FROM <table name>")

 

is "mydata" a SparkDataFrame,  and do I work with SparkDataFrames like I
would regular df (per say); because I can't image I would ever create a 53
billion record df.  I'm starting to acquaint myself with e SparkR package,
but I get confuse because it appears df and SparkDtaFrame are use
interchangeable. Or maybe not.

 

Looking for a good intro to SparkDataFrame.

 

Jeff Reichman


	[[alternative HTML version deleted]]




More information about the R-help mailing list