I've made a few comments on the differences here. http://www.quora.com/Apache-Hadoop/What-are-the-differences-between-Crunch-and-Cascading/answer/Chris-K-Wensel
chris On Oct 31, 2011, at 2:44 PM, Ted Dunning wrote: > +Chris Wensel > > The biggest difference between Cascading and Plume/Crunch/FlumeJava is that > the latter all do more lazy evaluation and more program restructuring and > much less large scale scheduling. Certainly the PCFJ group do much more to > make the results look like a java collection and are better at talking to > conventional java types. > > I think that Cascading could do the more extensive job graph rewrites. It > would be hard for Cascading to generalize its data structures, though without > major backward compatibility issues. > > In sum, I think that the difference between Cascading and PCFJ is largely a > matter of taste, not inherent system design. > > > On Mon, Oct 31, 2011 at 2:36 PM, Charles Earl <[email protected]> wrote: > Thanks. This is an insightful discussion. Having just glanced now at both > Plume and Crunch these seem similar to Cascading in the sense of being > dataflow languages. I wonder are you able to comment on if there are > important distinctions. -- Chris K Wensel [email protected] http://www.concurrentinc.com -- Concurrent, Inc. offers mentoring, support for Cascading
