R-Help

 

I'm working in my first large database (53,098,492,383 records).  When I
select the db via something like

 

Library(SparkR)

mydata <- sql("SELECT * FROM <table name>")

 

is "mydata" a SparkDataFrame,  and do I work with SparkDataFrames like I
would regular df (per say); because I can't image I would ever create a 53
billion record df.  I'm starting to acquaint myself with e SparkR package,
but I get confuse because it appears df and SparkDtaFrame are use
interchangeable. Or maybe not.

 

Looking for a good intro to SparkDataFrame.

 

Jeff Reichman


        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to