I am changing the subject of this thread from GSOC to Spark. I was just looking around and found this:
https://github.com/d9w/Spark.jl The real question is with all the various systems out there, what is the level of abstraction that julia should work with. Julia's DataFrames is one level of abstraction, which could also transparently map to csv files (rather than doing readtable), or a database table, or an HBase table. Why would Spark users want Julia, and why would Julia users want Spark? I guess if we can nail this down - the rest of the integration is probably easy to figure out. -viral On Saturday, March 28, 2015 at 1:14:51 AM UTC+5:30, Jeff Waller wrote: > > Well with very little time lest (15 minutes?) I'd like to reiterate the > idea > of Julia-Spark. Simply put, the basic idea is too allow big data to flow > into Julia > via Spark. >
