[GitHub] spark pull request: fixed spelling errors in documentation

2014-12-14 Thread JoshRosen
Github user JoshRosen commented on the pull request:

https://github.com/apache/spark/pull/3691#issuecomment-66905844
  
LGTM; thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: fixed spelling errors in documentation

2014-12-14 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/3691


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: fixed spelling errors in documentation

2014-12-14 Thread JoshRosen
Github user JoshRosen commented on the pull request:

https://github.com/apache/spark/pull/3691#issuecomment-66905888
  
Merged into master and all of the maintenance branches.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark Core - [SPARK-3620] - Refactor of SparkS...

2014-12-14 Thread tigerquoll
Github user tigerquoll commented on the pull request:

https://github.com/apache/spark/pull/2516#issuecomment-66909268
  
No probs, it was actually a nice way of starting to poke through the code 
to figure out how things are put together.  I'll stick to smaller jobs from now 
on in.
Regards,Dale.

Date: Tue, 9 Dec 2014 19:11:25 -0800
From: notificati...@github.com
To: sp...@noreply.github.com
CC: tigerqu...@outlook.com
Subject: Re: [spark] Spark Core - [SPARK-3620] - Refactor of SparkSubmit 
Argument parsing code (#2516)

Hey @tigerquoll usually for large patches like this we require a design doc 
on the JIRA. Especially because the priority of this is not super important, I 
would recommend that we close this issue for now, and maybe open a new one 
later once there is a consensus on how we should restructure Spark submit. 
Thanks for your work so far.


—
Reply to this email directly or view it on GitHub.

  


  
  
  =


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3640] [Streaming] [Kinesis] Allow users...

2014-12-14 Thread aniketbhatnagar
Github user aniketbhatnagar closed the pull request at:

https://github.com/apache/spark/pull/3092


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3640] [Streaming] [Kinesis] Allow users...

2014-12-14 Thread aniketbhatnagar
Github user aniketbhatnagar commented on the pull request:

https://github.com/apache/spark/pull/3092#issuecomment-66912820
  
Tested and Chris's suggestion to use EC2 profiles works.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4527][SQl]Add BroadcastNestedLoopJoin o...

2014-12-14 Thread liancheng
Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/3395#issuecomment-66915681
  
LGTM, thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread liancheng
Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66916125
  
retest this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66916267
  
  [Test build #24440 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24440/consoleFull)
 for   PR 3672 at commit 
[`31a6520`](https://github.com/apache/spark/commit/31a6520aea59f12d1a6896db6079c5cea62659a3).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66919317
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24440/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66919314
  
  [Test build #24440 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24440/consoleFull)
 for   PR 3672 at commit 
[`31a6520`](https://github.com/apache/spark/commit/31a6520aea59f12d1a6896db6079c5cea62659a3).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `class Analyzer(catalog: Catalog, registry: FunctionRegistry, 
caseSensitive: Boolean)`



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Add a Note on jsonFile having separate JSON ob...

2014-12-14 Thread petervandenabeele
Github user petervandenabeele commented on the pull request:

https://github.com/apache/spark/pull/3517#issuecomment-66922392
  
Bump ...

I suggest we revert to something close to my original proposal:
* no change in filenames (too complex for now)
* add a small(er) note in the doc about the non-standard format

In our DataScienceBe project, I just got this message from a new Spark user:

to reitarate (and make sure I understand correctly), the 
`jsonFile`function does not read valid JSON files, but rather special files 
containing a valid JSON object on each line.

Just making this clear to the users will already avoid some frustration.

Could you please confirm that I can make this proposal (or a different path 
to resolve this).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Add a Note on jsonFile having separate JSON ob...

2014-12-14 Thread marmbrus
Github user marmbrus commented on the pull request:

https://github.com/apache/spark/pull/3517#issuecomment-66924151
  
Sure, I'm happy with clarifications to the documentation.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4837] NettyBlockTransferService should ...

2014-12-14 Thread aarondav
Github user aarondav commented on the pull request:

https://github.com/apache/spark/pull/3688#issuecomment-66925510
  
Jenkins, retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4837] NettyBlockTransferService should ...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3688#issuecomment-66925608
  
  [Test build #24441 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24441/consoleFull)
 for   PR 3688 at commit 
[`ebd2007`](https://github.com/apache/spark/commit/ebd2007d82ea909b1d060114148e4abc4f749b25).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread marmbrus
Github user marmbrus commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66926121
  
This is a good catch, but perhaps the real bug here is that we are 
overriding the `sqlContext` in `InMemoryTableScan`.  Instead, what do you think 
about just removing that?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4837] NettyBlockTransferService should ...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3688#issuecomment-66928132
  
  [Test build #24441 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24441/consoleFull)
 for   PR 3688 at commit 
[`ebd2007`](https://github.com/apache/spark/commit/ebd2007d82ea909b1d060114148e4abc4f749b25).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4837] NettyBlockTransferService should ...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3688#issuecomment-66928133
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24441/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-785 [CORE] ClosureCleaner not invoked on...

2014-12-14 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/3690#issuecomment-66928843
  
Jenkins, retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-785 [CORE] ClosureCleaner not invoked on...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3690#issuecomment-66928921
  
  [Test build #24442 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24442/consoleFull)
 for   PR 3690 at commit 
[`8df68fe`](https://github.com/apache/spark/commit/8df68fed84cbbcc328d9ccce5df930f1c76c6b07).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4814 [CORE] Enable assertions in SBT, Ma...

2014-12-14 Thread srowen
GitHub user srowen opened a pull request:

https://github.com/apache/spark/pull/3692

SPARK-4814 [CORE] Enable assertions in SBT, Maven tests / AssertionError 
from Hive's LazyBinaryInteger

This enables assertions for the Maven and SBT build, but overrides the Hive 
module to not enable assertions.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/srowen/spark SPARK-4814

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/3692.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3692


commit f71e783b38cf709cb1bf6dcfea45237d820c7bd1
Author: Sean Owen so...@cloudera.com
Date:   2014-12-14T18:29:40Z

Enable assertions for SBT and Maven build

commit caca7047a6bcd672ae5e9657f4b2d5a61ba97cb7
Author: Sean Owen so...@cloudera.com
Date:   2014-12-14T21:27:31Z

Disable assertions just for Hive




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4814 [CORE] Enable assertions in SBT, Ma...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3692#issuecomment-66930746
  
  [Test build #24443 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24443/consoleFull)
 for   PR 3692 at commit 
[`caca704`](https://github.com/apache/spark/commit/caca7047a6bcd672ae5e9657f4b2d5a61ba97cb7).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-785 [CORE] ClosureCleaner not invoked on...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3690#issuecomment-66931556
  
  [Test build #24442 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24442/consoleFull)
 for   PR 3690 at commit 
[`8df68fe`](https://github.com/apache/spark/commit/8df68fed84cbbcc328d9ccce5df930f1c76c6b07).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-785 [CORE] ClosureCleaner not invoked on...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3690#issuecomment-66931560
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24442/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66933007
  
  [Test build #2 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/2/consoleFull)
 for   PR 3672 at commit 
[`526315d`](https://github.com/apache/spark/commit/526315d0cd3dbf34f7e6b5a3ea6d5187bf96cccb).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Added setMinCount to Word2Vec.scala

2014-12-14 Thread ganonp
GitHub user ganonp opened a pull request:

https://github.com/apache/spark/pull/3693

Added setMinCount to Word2Vec.scala

Wanted to customize the private minCount variable in the Word2Vec class. 
Added
a method to do so.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/ganonp/spark my-custom-spark

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/3693.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3693


commit 5eb91000cd74ddd7704c79ca69259ee48c5840f9
Author: ganonp gan...@gmail.com
Date:   2014-12-14T21:56:19Z

Added setMinCount to Word2Vec.scala

Wanted to customize the minCount variable in the Word2Vec class. Added
a method to do so.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Added setMinCount to Word2Vec.scala

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3693#issuecomment-66934430
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4814 [CORE] Enable assertions in SBT, Ma...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3692#issuecomment-66934745
  
  [Test build #24443 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24443/consoleFull)
 for   PR 3692 at commit 
[`caca704`](https://github.com/apache/spark/commit/caca7047a6bcd672ae5e9657f4b2d5a61ba97cb7).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4814 [CORE] Enable assertions in SBT, Ma...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3692#issuecomment-66934748
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24443/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66936450
  
  [Test build #2 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/2/consoleFull)
 for   PR 3672 at commit 
[`526315d`](https://github.com/apache/spark/commit/526315d0cd3dbf34f7e6b5a3ea6d5187bf96cccb).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `class Analyzer(catalog: Catalog, registry: FunctionRegistry, 
caseSensitive: Boolean)`



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66936452
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/2/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-1507][YARN]specify num of cores for AM

2014-12-14 Thread sryza
Github user sryza commented on the pull request:

https://github.com/apache/spark/pull/3686#issuecomment-66937888
  
@tgravescs In all other places we've managed to avoid referring to the 
yarn-client AM as the driver and I think blurring this distinction would be 
pretty confusing.

It's not difficult to imagine someone wanting to set a driver-cores 
property for yarn-cluster AMs in their defaults conf file.  Even if someone did 
want to set the yarn-client AM cores, I can't think of a situation where they'd 
want the former number to apply to it


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [Core] Adding a parallelismRatio to control th...

2014-12-14 Thread scwf
GitHub user scwf opened a pull request:

https://github.com/apache/spark/pull/3694

[Core] Adding a parallelismRatio to control the partitions num of 
shuffledRDD

Adding parallelismRatio to control the partitions num of shuffledRDD, the 
rule is:
  
 Math.max(1, parallelismRatio * number of partitions of the largest 
upstream RDD)
  
The ratio is 1.0 by default to make it compatible with the old version. 
When we have a good experience on it, we can change this.  

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/scwf/spark parallismRatio

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/3694.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3694


commit e6c43ecdf4e49ea73befea8b87fb2a47eec2fb37
Author: wangfei wangf...@huawei.com
Date:   2014-12-14T09:25:17Z

adding spark.default.parallelismRatio

commit 63826ae63bb1f912a6000f0cd958c44579960c1e
Author: wangfei wangf...@huawei.com
Date:   2014-12-14T09:31:58Z

minor fix

commit a71ce3b92a3f49f8035fa14b4249775087203af5
Author: wangfei wangf...@huawei.com
Date:   2014-12-15T01:01:13Z

minor fix

commit f21bfd4904fa340099d190bd3963fefc79f0faa4
Author: wangfei wangf...@huawei.com
Date:   2014-12-15T01:11:15Z

minor fix




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [Core] Adding a parallelismRatio to control th...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3694#issuecomment-66940757
  
  [Test build #24445 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24445/consoleFull)
 for   PR 3694 at commit 
[`f21bfd4`](https://github.com/apache/spark/commit/f21bfd4904fa340099d190bd3963fefc79f0faa4).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread zsxwing
Github user zsxwing commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66941524
  
I heard from @liancheng about some plan of using two SchemaRDDs from 
different sqlContext. Is the override sqlContext designed to support it? 
@liancheng since you added it in 
https://github.com/liancheng/spark/commit/378410559a1759b3218be15f1fda3a90240c143d


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [Core] Adding a parallelismRatio to control th...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3694#issuecomment-66943835
  
  [Test build #24445 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24445/consoleFull)
 for   PR 3694 at commit 
[`f21bfd4`](https://github.com/apache/spark/commit/f21bfd4904fa340099d190bd3963fefc79f0faa4).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [Core] Adding a parallelismRatio to control th...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3694#issuecomment-66943841
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24445/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread liancheng
Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66944216
  
The most recent test failures should be caused by other components. Let's 
wait for a moment. This PR now LGTM.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread harishreedharan
GitHub user harishreedharan opened a pull request:

https://github.com/apache/spark/pull/3695

[SPARK-4826][Streaming] - Create unique file names for each test in WALS...

...uite to avoid conflicts.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/harishreedharan/spark WALSuite-Fix

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/3695.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3695


commit ae63f35d0289255a5d4b71ffb31a5d54c7069269
Author: Hari Shreedharan hshreedha...@apache.org
Date:   2014-12-15T03:32:26Z

[SPARK-4826][Streaming] - Create unique file names for each test in 
WALSuite to avoid conflicts.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66946543
  
  [Test build #24446 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24446/consoleFull)
 for   PR 3695 at commit 
[`ae63f35`](https://github.com/apache/spark/commit/ae63f35d0289255a5d4b71ffb31a5d54c7069269).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread harishreedharan
Github user harishreedharan commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66946690
  
@tdas @JoshRosen Since the directories are created atomically each time the 
suite is run, conflicts reported in SPARK-4826 was likely caused by multiple 
tests running at the same time and using the same filename (since the 
writeLogSegments method is not thread-safe).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66946766
  
  [Test build #24447 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24447/consoleFull)
 for   PR 3695 at commit 
[`2948408`](https://github.com/apache/spark/commit/2948408b79a25afd8131eaa79fa6a7f66130f829).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66946801
  
  [Test build #24447 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24447/consoleFull)
 for   PR 3695 at commit 
[`2948408`](https://github.com/apache/spark/commit/2948408b79a25afd8131eaa79fa6a7f66130f829).
 * This patch **fails Scala style tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66946803
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24447/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66947024
  
  [Test build #24448 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24448/consoleFull)
 for   PR 3695 at commit 
[`2e2bf28`](https://github.com/apache/spark/commit/2e2bf28754cb93c4caeb882c6d7b115e4a86b39a).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4843 [YARN] Squash ExecutorRunnableUtil ...

2014-12-14 Thread ksakellis
GitHub user ksakellis opened a pull request:

https://github.com/apache/spark/pull/3696

SPARK-4843 [YARN] Squash ExecutorRunnableUtil and ExecutorRunnable

ExecutorRunnableUtil is a parent of ExecutorRunnable because of the 
yarn-alpha and yarn-stable split. Now that yarn-alpha is gone, this commit 
squashes the unnecessary hierarchy. The methods from ExecutorRunnableUtil are 
added as private.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/ksakellis/spark kostas-spark-4843

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/3696.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3696


commit 155f308651bde79ec67091986269e1b13c796fae
Author: Kostas Sakellis kos...@cloudera.com
Date:   2014-12-15T04:44:19Z

SPARK-4843 [YARN] Squash ExecutorRunnableUtil and ExecutorRunnable

ExecutorRunnableUtil is a parent of ExecutorRunnable because of the
yarn-alpha and yarn-stable split. Now that yarn-alpha is gone,
this commit squashes the unnecessary hierarchy.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4843 [YARN] Squash ExecutorRunnableUtil ...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3696#issuecomment-66949811
  
  [Test build #24449 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24449/consoleFull)
 for   PR 3696 at commit 
[`155f308`](https://github.com/apache/spark/commit/155f308651bde79ec67091986269e1b13c796fae).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SQL] SPARK-4700: Add HTTP protocol spark thri...

2014-12-14 Thread judynash
Github user judynash commented on the pull request:

https://github.com/apache/spark/pull/3672#issuecomment-66950036
  
Thank you Cheng for the update.

From: Cheng Lian [mailto:notificati...@github.com]
Sent: Sunday, December 14, 2014 6:57 PM
To: apache/spark
Cc: Judy Nash
Subject: Re: [spark] [SQL] SPARK-4700: Add HTTP protocol spark thrift 
server (#3672)


The most recent test failures should be caused by other components. Let's 
wait for a moment. This PR now LGTM.

—
Reply to this email directly or view it on 
GitHubhttps://github.com/apache/spark/pull/3672#issuecomment-66944216.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66950408
  
  [Test build #24446 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24446/consoleFull)
 for   PR 3695 at commit 
[`ae63f35`](https://github.com/apache/spark/commit/ae63f35d0289255a5d4b71ffb31a5d54c7069269).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66950410
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24446/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66950933
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24448/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4826][Streaming] - Create unique file n...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3695#issuecomment-66950931
  
  [Test build #24448 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24448/consoleFull)
 for   PR 3695 at commit 
[`2e2bf28`](https://github.com/apache/spark/commit/2e2bf28754cb93c4caeb882c6d7b115e4a86b39a).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [MLLIB]SPARK-4846: When the vocabulary size is...

2014-12-14 Thread jinntrance
GitHub user jinntrance opened a pull request:

https://github.com/apache/spark/pull/3697

[MLLIB]SPARK-4846: When the vocabulary size is large, Word2Vec may yield OOM

make variables syn0Global and syn1Global lazy, in order that the high-order 
function passed to RDD.mapPartitionsWithIndex would not yield 
OutOfMemoryError(Requested array size exceeds VM limit) when serialized if 
syn0Globalsyn1Global are too large arrays.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/jinntrance/spark branch-1.1-word2vec-fix

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/3697.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3697


commit b33dfe7a5ccb6c4a2f46f0953d21e45fca2c7c1f
Author: Joseph J.C. Tang jinntra...@gmail.com
Date:   2014-12-15T02:59:57Z

make variables syn0Global and syn1Global lazy, in order that the high-order 
function passed to RDD.mapPartitionsWithIndex would not yield 
OutOfMemoryError(Requested array size exceeds VM limit) when serialized if 
syn0Globalsyn1Global are too large arrays.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [MLLIB]SPARK-4846: When the vocabulary size is...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3697#issuecomment-66952147
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4845][Core] Adding a parallelismRatio t...

2014-12-14 Thread scwf
Github user scwf commented on the pull request:

https://github.com/apache/spark/pull/3694#issuecomment-66952763
  
Jekins, retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4843 [YARN] Squash ExecutorRunnableUtil ...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3696#issuecomment-66952800
  
  [Test build #24449 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24449/consoleFull)
 for   PR 3696 at commit 
[`155f308`](https://github.com/apache/spark/commit/155f308651bde79ec67091986269e1b13c796fae).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4843 [YARN] Squash ExecutorRunnableUtil ...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3696#issuecomment-66952805
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24449/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4845][Core] Adding a parallelismRatio t...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3694#issuecomment-66952871
  
  [Test build #24450 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24450/consoleFull)
 for   PR 3694 at commit 
[`f21bfd4`](https://github.com/apache/spark/commit/f21bfd4904fa340099d190bd3963fefc79f0faa4).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread liancheng
Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66954133
  
@zsxwing I guess what you mentioned is the plan to support multiple data 
sources via the newly introduced external data source API? The commit you 
mentioned is irrelevant. In that commit I just want to ensure the `SQLContext` 
we used in `InMemoryColumnarTableScan` is the same one used by its child .


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread zsxwing
Github user zsxwing commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66954374
  
 I guess what you mentioned is the plan to support multiple data sources 
via the newly introduced external data source API?

Yes.

 In that commit I just want to ensure the SQLContext we used in 
InMemoryColumnarTableScan is the same one used by its child .

In which case, it won't be `protected[spark] val sqlContext = 
SparkPlan.currentContext.get()`?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread liancheng
Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66955023
  
Ah I see you point, so here we referenced a field overriden by subclass in 
the constructor of the parent class. Then I think it's generally not safe to 
allow `sqlContext` to be overriden. How about marking `SparkPlan.sqlContext` as 
`final` to prevent this?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread liancheng
Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66955060
  
Yes, we should remove it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66955512
  
  [Test build #24451 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24451/consoleFull)
 for   PR 3660 at commit 
[`1cbb623`](https://github.com/apache/spark/commit/1cbb623d5f20471ee116cd54137f00a30c9592ff).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread zsxwing
Github user zsxwing commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66955549
  
Done. Also updated the description of this PR and JIRA.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4847][SQL]Fix extraStrategies cannot t...

2014-12-14 Thread jerryshao
GitHub user jerryshao opened a pull request:

https://github.com/apache/spark/pull/3698

[SPARK-4847][SQL]Fix extraStrategies cannot take effect in SQLContext 
issue



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/jerryshao/apache-spark SPARK-4847

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/3698.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3698


commit 4741130819ca02ad6a426a3aeb0f6ef1f972f36e
Author: jerryshao saisai.s...@intel.com
Date:   2014-12-15T06:52:56Z

Make later added extraStrategies effect when calling strategies




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4845][Core] Adding a parallelismRatio t...

2014-12-14 Thread scwf
Github user scwf commented on the pull request:

https://github.com/apache/spark/pull/3694#issuecomment-66956061
  
Hmm, seems there are some problems with 
```org.apache.spark.streaming.rdd.WriteAheadLogBackedBlockRDDSuite```, and i 
noticed that other PRs also failed there.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4845][Core] Adding a parallelismRatio t...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3694#issuecomment-66956286
  
  [Test build #24450 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24450/consoleFull)
 for   PR 3694 at commit 
[`f21bfd4`](https://github.com/apache/spark/commit/f21bfd4904fa340099d190bd3963fefc79f0faa4).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4845][Core] Adding a parallelismRatio t...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3694#issuecomment-66956294
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24450/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4847][SQL]Fix extraStrategies cannot t...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3698#issuecomment-66956337
  
  [Test build #24452 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24452/consoleFull)
 for   PR 3698 at commit 
[`4741130`](https://github.com/apache/spark/commit/4741130819ca02ad6a426a3aeb0f6ef1f972f36e).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4848] Stand-alone cluster: Allow differ...

2014-12-14 Thread nkronenfeld
GitHub user nkronenfeld opened a pull request:

https://github.com/apache/spark/pull/3699

[SPARK-4848] Stand-alone cluster: Allow differences between workers with 
multiple instances

I've changed the stand-alone cluster run scripts to allow different workers 
to have different numbers of instances, and base webui ports.

I did this by moving the loop over instances from start-slaves to 
start-slave.

In order to stop things properly, I had to make similar changes in 
stop-slaves (and introduce stop-slave).

While I was at it, I changed SPARK_WORKER_PORT to work the same way as 
SPARK_WORKER_UI_PORT, since the new methods works fine for both.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/nkronenfeld/spark-1 startup-scripts

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/3699.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3699


commit 479c31c9d3e580879d76146e2a687b5235c87b33
Author: Nathan Kronenfeld nkronenf...@oculusinfo.com
Date:   2014-12-15T06:58:08Z

Move looping through instances from the master to the workers, so that each 
worker respects its own number of instances and web-ui port.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4848] Stand-alone cluster: Allow differ...

2014-12-14 Thread nkronenfeld
Github user nkronenfeld commented on the pull request:

https://github.com/apache/spark/pull/3699#issuecomment-66957137
  
I'm not sure how mesos and yarn clusters are started/stopped (nor do I have 
such clusters on which to test), so I'm not sure how this will affect them.  I 
think the way I did this should be safe - it's mostly just moving code around - 
but I could use a knowledgeable set of eyes to be sure.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4848] Stand-alone cluster: Allow differ...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3699#issuecomment-66957276
  
  [Test build #24453 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24453/consoleFull)
 for   PR 3699 at commit 
[`479c31c`](https://github.com/apache/spark/commit/479c31c9d3e580879d76146e2a687b5235c87b33).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-4843 [YARN] Squash ExecutorRunnableUtil ...

2014-12-14 Thread ksakellis
Github user ksakellis commented on the pull request:

https://github.com/apache/spark/pull/3696#issuecomment-66958311
  
Hmm.. tests failed but I'm not sure they are related to this change. Am I 
missing something?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66959907
  
  [Test build #24451 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24451/consoleFull)
 for   PR 3660 at commit 
[`1cbb623`](https://github.com/apache/spark/commit/1cbb623d5f20471ee116cd54137f00a30c9592ff).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-4812][SQL] Fix the initialization issue...

2014-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/3660#issuecomment-66959913
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24451/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org