I've since reached the conclusion that the thing I'm trying to compare it to is a 'data grid', e.g. gigaspaces.
We want a large, evolving, data structure, which is essentially cached in memory split over nodes. On Tue, Sep 6, 2011 at 5:09 PM, Jake Mannix <[email protected]> wrote: > On Tue, Sep 6, 2011 at 1:52 PM, Benson Margulies <[email protected]>wrote: > >> A quick look at Spark seems to suggest that the data it's working on >> is read-only. If you wanted to have an evolving data set, would any of >> these others be more useful? >> > > The original data set is read-only, but the same could be said for HDFS, > right? > Or are you saying that you don't see a lot of support for outputting back to > HDFS the results of a computation? > > -jake >
