Re: Joining the spark dev community

2014-10-19 Thread Henry Saputra
Hi Saurabh, Good way to start is to use Spark with your applications and file issues you might have found and maybe provide patch for those or existing ones. Please take a look at Spark's how to contribute page [1] to help you get started. Hope this helps. - Henry [1]

Re: Breaking the previous large-scale sort record with Spark

2014-10-11 Thread Henry Saputra
Congrats to Reynold et al leading this effort! - Henry On Fri, Oct 10, 2014 at 7:54 AM, Matei Zaharia matei.zaha...@gmail.com wrote: Hi folks, I interrupt your regularly scheduled user / dev list to bring you some pretty cool news for the project, which is that we've been able to use Spark

Re: [VOTE] Release Apache Spark 1.1.0 (RC4)

2014-09-04 Thread Henry Saputra
LICENSE and NOTICE files are good Hash files are good Signature files are good No 3rd parties executables Source compiled Run local and standalone tests Test persist off heap with Tachyon looks good +1 - Henry On Wed, Sep 3, 2014 at 12:24 AM, Patrick Wendell pwend...@gmail.com wrote: Please

Re: hey spark developers! intro from shane knapp, devops engineer @ AMPLab

2014-09-02 Thread Henry Saputra
Welcome Shane =) - Henry On Tue, Sep 2, 2014 at 10:35 AM, shane knapp skn...@berkeley.edu wrote: so, i had a meeting w/the databricks guys on friday and they recommended i send an email out to the list to say 'hi' and give you guys a quick intro. :) hi! i'm shane knapp, the new AMPLab

Re: [Spark SQL] off-heap columnar store

2014-08-25 Thread Henry Saputra
Hi Michael, This is great news. Any initial proposal or design about the caching to Tachyon that you can share so far? I don't think there is a JIRA ticket open to track this feature yet. - Henry On Mon, Aug 25, 2014 at 1:13 PM, Michael Armbrust mich...@databricks.com wrote: What is the plan

Re: Spark Contribution

2014-08-21 Thread Henry Saputra
The Apache Spark wiki on how to contribute should be great place to start: https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark - Henry On Thu, Aug 21, 2014 at 3:25 AM, Maisnam Ns maisnam...@gmail.com wrote: Hi, Can someone help me with some links on how to contribute for

Re: [VOTE] Release Apache Spark 1.0.2 (RC1)

2014-07-28 Thread Henry Saputra
NOTICE and LICENSE files look good Hashes and sigs look good No executable in the source distribution Compile source and run standalone +1 - Henry On Fri, Jul 25, 2014 at 4:08 PM, Tathagata Das tathagata.das1...@gmail.com wrote: Please vote on releasing the following candidate as Apache Spark

Re: -1s on pull requests?

2014-07-21 Thread Henry Saputra
There is ASF guidelines about Voting, including code review for patches: http://www.apache.org/foundation/voting.html Some ASF project do three +1 votes are required (to the issues like JIRA or Github PR in this case) for a patch unless it is tagged with lazy consensus [1] of like 48 hours. For

Re: Announcing Spark 1.0.1

2014-07-11 Thread Henry Saputra
Congrats to the Spark community ! On Friday, July 11, 2014, Patrick Wendell pwend...@gmail.com wrote: I am happy to announce the availability of Spark 1.0.1! This release includes contributions from 70 developers. Spark 1.0.0 includes fixes across several areas of Spark, including the core

Re: Run ScalaTest inside Intellij IDEA

2014-06-17 Thread Henry Saputra
I got stuck on this one too after did git pull from master. Have not been able to resolve it yet =( - Henry On Wed, Jun 11, 2014 at 6:51 AM, Yijie Shen henry.yijies...@gmail.com wrote: Thx Qiuzhuang, the problems disappeared after I add assembly jar at the head of list dependencies in

Re: Emergency maintenace on jenkins

2014-06-10 Thread Henry Saputra
Thanks for letting us know Patrick. - Henry On Monday, June 9, 2014, Patrick Wendell pwend...@gmail.com wrote: Just a heads up - due to an outage at UCB we've lost several of the Jenkins slaves. I'm trying to spin up new slaves on EC2 in order to compensate, but this might fail some ongoing

Removing spark-debugger.md file from master?

2014-06-03 Thread Henry Saputra
Hi All, Seemed like the spark-debugger.md is no longer accurate (see http://spark.apache.org/docs/latest/spark-debugger.html) and since it was originally written Spark has evolved that makes the doc obsolete. There are already work pending for new replay debugging (I could not find the PR links

Re: Removing spark-debugger.md file from master?

2014-06-03 Thread Henry Saputra
Cool, thanks Ankur, sounds good. PR is coming. - Henry On Tue, Jun 3, 2014 at 11:11 AM, Ankur Dave ankurd...@gmail.com wrote: I agree, let's go ahead and remove it. Ankur http://www.ankurdave.com/

Add my JIRA username (hsaputra) to Spark's contributor's list

2014-06-03 Thread Henry Saputra
Hi, Could someone with right karma kindly add my username (hsaputra) to Spark's contributor list? I was added before but somehow now I can no longer assign ticket to myself nor update tickets I am working on. Thanks, - Henry

Re: [VOTE] Release Apache Spark 1.0.0 (RC11)

2014-05-28 Thread Henry Saputra
NOTICE and LICENSE files look good Signatures look good. Hashes look good No external executables in the source distributions Source compiled with sbt Run local and standalone examples look good. +1 - Henry On Mon, May 26, 2014 at 7:38 AM, Tathagata Das tathagata.das1...@gmail.com wrote:

Re: [VOTE] Release Apache Spark 1.0.0 (RC10)

2014-05-22 Thread Henry Saputra
Looks like SPARK-1900 is a blocker for YARN and might as well add SPARK-1870 while at it. TD or Patrick, could you kindly send [CANCEL] prefixed in the subject email out for the RC10 Vote to help people follow the active VOTE threads? The VOTE emails are getting a bit hard to follow. - Henry

Re: [VOTE] Release Apache Spark 1.0.0 (rc5)

2014-05-16 Thread Henry Saputra
HI Sandy, Just curious if the Vote is for rc5 or rc6? Gmail shows me that you replied to the rc5 thread. Thanks, - Henry On Wed, May 14, 2014 at 1:28 PM, Sandy Ryza sandy.r...@cloudera.com wrote: +1 (non-binding) * Built the release from source. * Compiled Java and Scala apps that interact

Re: minor optimizations to get my feet wet

2014-04-10 Thread Henry Saputra
for your contributions. You can ignore preferred Apache id section for now. Thank you, Henry Saputra [1] https://www.apache.org/licenses/icla.txt [2] http://www.apache.org/licenses/cla-corporate.txt On Thu, Apr 10, 2014 at 1:48 PM, Ignacio Zendejas ignacio.zendejas...@gmail.com wrote: Hi, all

Re: minor optimizations to get my feet wet

2014-04-10 Thread Henry Saputra
You are welcome, thanks again for contributing =) - Henry On Thu, Apr 10, 2014 at 3:17 PM, Ignacio Zendejas ignacio.zendejas...@gmail.com wrote: I don't think there's a noticeable performance hit by the use of reverse in those cases. It was a quick set of changes and it helped understand what

Re: JIRA. github and asf updates

2014-03-29 Thread Henry Saputra
With the speed of comments updates in Jira by Spark dev community +1 for issues@ list - Henry On Saturday, March 29, 2014, Patrick Wendell pwend...@gmail.com wrote: Ah sorry I see - Jira updates are going to the dev list. Maybe that's not desirable. I think we should send them to the issues@

Re: Largest input data set observed for Spark.

2014-03-20 Thread Henry Saputra
Reynold, just curious did you guys ran it in AWS? - Henry On Thu, Mar 20, 2014 at 11:08 AM, Reynold Xin r...@databricks.com wrote: Actually we just ran a job with 70TB+ compressed data on 28 worker nodes - I didn't count the size of the uncompressed data, but I am guessing it is somewhere

Re: Announcing the official Spark Job Server repo

2014-03-18 Thread Henry Saputra
W00t! Thanks for releasing this, Evan. - Henry On Tue, Mar 18, 2014 at 1:51 PM, Evan Chan e...@ooyala.com wrote: Dear Spark developers, Ooyala is happy to announce that we have pushed our official, Spark 0.9.0 / Scala 2.10-compatible, job server as a github repo: