+Chris Wensel The biggest difference between Cascading and Plume/Crunch/FlumeJava is that the latter all do more lazy evaluation and more program restructuring and much less large scale scheduling. Certainly the PCFJ group do much more to make the results look like a java collection and are better at talking to conventional java types.
I think that Cascading could do the more extensive job graph rewrites. It would be hard for Cascading to generalize its data structures, though without major backward compatibility issues. In sum, I think that the difference between Cascading and PCFJ is largely a matter of taste, not inherent system design. On Mon, Oct 31, 2011 at 2:36 PM, Charles Earl <[email protected]> wrote: > Thanks. This is an insightful discussion. Having just glanced now at both > Plume and Crunch these seem similar to Cascading in the sense of being > dataflow languages. I wonder are you able to comment on if there are > important distinctions. >
