map/reduce on Cassandra

2010-01-25 Thread Phillip Michalak
Multiple people have expressed an interest in 'hadoop integration' and 'map/reduce functionality' within Cassandra. I'd like to get a feel for what that means to different people. As a starting point for discussion, Jeff Hodges undertook a prototype effort last summer which was the subject

Re: map/reduce on Cassandra

2010-01-25 Thread Ryan Daum
let it be part of the InputSplit calculation. 3) Progress information should be calculated and displayed.  -- Jeff On Mon, Jan 25, 2010 at 5:43 AM, Phillip Michalak phil.micha...@digitalreasoning.com wrote: Multiple people have expressed an interest in 'hadoop integration' and 'map/reduce

Re: map/reduce on Cassandra

2010-01-25 Thread Brandon Williams
On Mon, Jan 25, 2010 at 1:13 PM, Ryan Daum r...@thimbleware.com wrote: I agree with what Jeff says here about RandomPartitioner support being key. +1 For my purposes with map/reduce I'd personally be fine with some general all-keys dump utility that wrote contents of one node to a file,

Re: map/reduce on Cassandra

2010-01-25 Thread Ryan Daum
On Mon, Jan 25, 2010 at 2:18 PM, Brandon Williams dri...@gmail.com wrote: bin/sstablekeys will dump just the keys from an sstable without row deserialization overhead, but it can't introspect a commitlog. -Brandon Yes, and will it not also return the keys that are replicas from ranges

Re: map/reduce on Cassandra

2010-01-25 Thread Jonathan Ellis
sstablekeys is really the wrong place to support m/r anyway, it just shows that the index can handle what m/r will need On Mon, Jan 25, 2010 at 1:28 PM, Ryan Daum r...@thimbleware.com wrote: On Mon, Jan 25, 2010 at 2:18 PM, Brandon Williams dri...@gmail.com wrote: bin/sstablekeys will dump

Re: map/reduce on Cassandra

2010-01-25 Thread Vijay
...@digitalreasoning.com wrote: Multiple people have expressed an interest in 'hadoop integration' and 'map/reduce functionality' within Cassandra. I'd like to get a feel for what that means to different people. As a starting point for discussion, Jeff Hodges undertook a prototype effort

Map Reduce on Cassandra Store

2009-12-04 Thread Mark Vigeant
Hello! Has anyone tried to run MapReduce analytics on data stored in Cassandra? I feel like I saw a patch once to get hadoop working on top of Cassandra, but I can't find it now. I know that Hadoop integration is big on people's wishlists for future versions of Cassandra, but I'm just curious

RE: Map Reduce on Cassandra Store

2009-12-04 Thread Mark Vigeant
AH, there we go. Thanks a lot Ryan! -Mark -Original Message- From: Ryan King [mailto:r...@twitter.com] Sent: Friday, December 04, 2009 12:17 PM To: cassandra-user@incubator.apache.org Subject: Re: Map Reduce on Cassandra Store On Fri, Dec 4, 2009 at 8:44 AM, Mark Vigeant mark.vige