Re: spark 1.1.0 (w/ hadoop 2.4) vs aws java sdk 1.7.2

2014-09-20 Thread Aniket
Looks like the same issue as http://mail-archives.apache.org/mod_mbox/spark-dev/201409.mbox/%3ccajob8btdxks-7-spjj5jmnw0xsnrjwdpcqqtjht1hun6j4z...@mail.gmail.com%3E On Sep 20, 2014 11:09 AM, tian zhang [via Apache Spark Developers List] ml-node+s1001551n8481...@n3.nabble.com wrote: Hi, Spark

Re: A Comparison of Platforms for Implementing and Running Very Large Scale Machine Learning Algorithms

2014-09-20 Thread Seraph
I’m also one of the authors of this paper and I am responsible for the Spark experiments in this paper. Thank you for your guys discussion! (1) Ignacio Zendejas wrote I should rephrase my question as it was poorly phrased: on average, how much faster is Spark v. PySpark (I didn't really mean

Re: guava version conflicts

2014-09-20 Thread Marcelo Vanzin
Hmm, looks like the hack to maintain backwards compatibility in the Java API didn't work that well. I'll take a closer look at this when I get to work on Monday. On Fri, Sep 19, 2014 at 10:30 PM, Cody Koeninger c...@koeninger.org wrote: After the recent spark project changes to guava shading,

Re: A couple questions about shared variables

2014-09-20 Thread Matei Zaharia
Hey Sandy, On September 20, 2014 at 8:50:54 AM, Sandy Ryza (sandy.r...@cloudera.com) wrote: Hey All,  A couple questions came up about shared variables recently, and I wanted to  confirm my understanding and update the doc to be a little more clear.  *Broadcast variables*  Now that tasks data