There is a "Bring your own Hadoop" for DSE as well: http://www.datastax.com/documentation/datastax_enterprise/4.5/datastax_enterprise/byoh/byohIntro.html
Can also run hadoop against your backup/snapshots: https://github.com/Netflix/aegisthus https://github.com/fullcontact/hadoop-sstable Chris On Aug 26, 2014, at 8:41 AM, Russell Bradberry <[email protected]> wrote: > If you want true integration of Cassandra and Hadoop and Spark then you will > need to use Datastax Enterprise (DSE). There are connectors that will allow > MapReduce over vanilla Cassandra, however, they are just making requests to > Cassandra under the covers while DSE uses CFS which is similar to HDFS. > > > > On August 26, 2014 at 9:23:38 AM, Oliver Ruebenacker ([email protected]) wrote: > >> >> Hello, >> >> I read that Cassandra has had MapReduce integration since early on. There >> are instructions on how to use Hadoop or Spark. However, it appears to me >> that according to these instructions, Hadoop and Spark just submit requests >> to Cassandra just like any other client would. So, I'm not sure what is >> meant by "integration". >> >> Any pointers? Thanks! >> >> Best, >> Oliver >> >> -- >> Oliver Ruebenacker >> Solutions Architect at Altisource Labs >> Be always grateful, but never satisfied.
