Cool intro, thanks! One question. On slide 23 it says Standalone (local
mode). That sounds a bit confusing without hearing the talk.
Standalone mode is not local. It just does not depend on a cluster
software. I think it's the best mode for EC2/GCE, because they provide a
distributed filesystem
Thanks for the clarification. I'll fix the slide. I've done a lot of
Scalding/Cascading programming where the two concepts are synonymous, but
clearly I was imposing my prejudices here ;)
dean
On Thu, May 1, 2014 at 8:18 AM, Daniel Darabos
daniel.dara...@lynxanalytics.com wrote:
Cool intro,
Very Useful material. Currently, I am trying to persuade my client choose Spark
instead of Hadoop MapReduce. Your slide give me more evidence to support my
opinion.
--
ZhangYi (张逸)
Developer
tel: 15023157626
blog: agiledon.github.com
weibo: tw张逸
Sent with Sparrow
That's great! Thanks. Let me know if it works ;) or what I could improve to
make it work.
dean
On Thu, May 1, 2014 at 8:45 AM, ZhangYi yizh...@thoughtworks.com wrote:
Very Useful material. Currently, I am trying to persuade my client choose
Spark instead of Hadoop MapReduce. Your slide give
I meant to post this last week, but this is a talk I gave at the Philly ETE
conf. last week:
http://www.slideshare.net/deanwampler/spark-the-next-top-compute-model
Also here:
http://polyglotprogramming.com/papers/Spark-TheNextTopComputeModel.pdf
dean
--
Dean Wampler, Ph.D.
Typesafe