Cassandra does not need Hadoop for functionality and is a "standalone" project.
Hadoop is many things. One of those is HDFS, which as you describe is a GFS clone. Hadoop also includes a MapReduce implementation, job tracking, and various other services that a distributed system using it would need. Some people are trying to allow the MapReduce and job components from Hadoop to operate on data from Cassandra. Nobody that I know of is working on any project that would make Hadoop a dependency of Cassandra or that would have Cassandra use HDFS for storage (this would destroy most of Cassandra's advantages for data storage). Michael 2009/7/24 <[email protected]>: > I m trying to understand how cassandra and hadoop fit together > > i thought cassandra was useful as a standalone product > > i cannot figure a situation where cassandra needs hadoop for functionality > > isnt hadoop just a gfs clone, since cassandra can be configured for > redundant storage and availability why do we need hadoop? > > > thanks > > 2009/7/24 李楠 <[email protected]> >> >> what time cassandra has a official releases and more detail user >> development tutorial document just like hadoop project? >> i am interesting in cassandra with distributed, structured key-value >> store,brings Amazon's Dynamo and BigTable and in production use at Facebook. >> >> 2009/7/24 Jonathan Ellis <[email protected]> >>> >>> because nodeprobe is in 0.4, not 0.3 >>> >>> On Fri, Jul 24, 2009 at 2:26 AM, 李楠<[email protected]> wrote: >>> > i deploy in storage-conf.xml <Seeds> add <Seed>192.1.12.149</Seed> >>> > >>> > use bin/nodeprobe -host X ring where X is each machine in your cluster >>> > to >>> > make sure all the nodes see each other >>> > >>> > why do not i find bin/nodeprobe >> > > > > -- > Bidegg worlds best auction site > http://bidegg.com >
