[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-31 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/11888


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-31 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-204083429
  
Merged into master thanks guys.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-31 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203772958
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-31 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203772960
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/54588/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-31 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203772472
  
**[Test build #54588 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/54588/consoleFull)**
 for PR 11888 at commit 
[`39dd0ee`](https://github.com/apache/spark/commit/39dd0eed15d9c67604cb3430350c4f0f57c5f7e7).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203744481
  
**[Test build #54588 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/54588/consoleFull)**
 for PR 11888 at commit 
[`39dd0ee`](https://github.com/apache/spark/commit/39dd0eed15d9c67604cb3430350c4f0f57c5f7e7).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203744003
  
LGTM retest this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203739189
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/54577/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203739188
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203739078
  
**[Test build #54577 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/54577/consoleFull)**
 for PR 11888 at commit 
[`39dd0ee`](https://github.com/apache/spark/commit/39dd0eed15d9c67604cb3430350c4f0f57c5f7e7).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203717852
  
**[Test build #54577 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/54577/consoleFull)**
 for PR 11888 at commit 
[`39dd0ee`](https://github.com/apache/spark/commit/39dd0eed15d9c67604cb3430350c4f0f57c5f7e7).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203675470
  
OK, @rxin and I discussed this more offline. Our proposal is:
```
def getExecutorInfos: Seq[SparkExecutorInfo]

private class SparkExecutorInfoImpl(
val host: String,
val port: Int,
...
val cacheSize: Long,
val numRunningTasks: Int)
 extends SparkExecutorInfo
```
Then we don't need to tie us down with the very specific `host:port` 
format, and we don't have to expose the executor IDs, which are just integers 
that don't mean much.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203670885
  
Yea that integer id is completely useless to users who want to figure out 
what to do with their clusters.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203665311
  
https://cloud.githubusercontent.com/assets/2133137/14159586/68f1d598-f68b-11e5-9e1d-0aa4b34a3910.png;>

https://cloud.githubusercontent.com/assets/2133137/14159589/6ca0057a-f68b-11e5-86b9-3dbf92aa713f.png;>

```
scala> df.count()
16/03/30 15:21:58 INFO SparkContext: Starting job: count at :27
...
16/03/30 15:21:58 INFO SparkDeploySchedulerBackend: Launching task 7 on 
executor id: 3 hostname: 192.168.0.209.
16/03/30 15:21:58 INFO SparkDeploySchedulerBackend: Launching task 8 on 
executor id: 0 hostname: 192.168.0.209.
16/03/30 15:21:58 INFO SparkDeploySchedulerBackend: Launching task 9 on 
executor id: 5 hostname: 192.168.0.209.
16/03/30 15:21:58 INFO SparkDeploySchedulerBackend: Launching task 10 on 
executor id: 1 hostname: 192.168.0.209.
16/03/30 15:21:58 INFO SparkDeploySchedulerBackend: Launching task 11 on 
executor id: 4 hostname: 192.168.0.209.
16/03/30 15:21:58 INFO SparkDeploySchedulerBackend: Launching task 12 on 
executor id: 2 hostname: 192.168.0.209.
```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203658850
  
Can you paste me what an executor looks like? If you just tell me "executor 
id" as an end user, I have no clue what you are talking about.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203658485
  
What are executor ids? is that even an external concept? 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203559116
  
My proposal:

```
def getExecutorIds(): Array[String]
def getExecutorInfo(executorId: String): Option[SparkExecutorInfo]

private class SparkExecutorInfoImpl(
val id: String,
val hostport: String, // host:port
...
val cacheSize: Long,
val numRunningTasks: Int)
 extends SparkExecutorInfo
```

This is more consistent with the existing status API, where we have things 
like
```
def getActiveJobIds(): Array[Int]
def getJobInfo(jobId: Int): Option[SparkJobInfo]
```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203555770
  
@rxin I don't understand the distinction between `getExecutors` and 
`getExecutorList`. Why not just be more explicit about what the strings 
represent?

The other problem is the keys to the other maps are also expected to be 
`host:port` but I would normally expect them to be executor IDs. Also host can 
be confusing because it can be in one of many different formats (e.g. IP addr, 
all the hostname formats in EC2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request:

https://github.com/apache/spark/pull/11888#discussion_r57933650
  
--- Diff: core/src/main/scala/org/apache/spark/SparkStatusTracker.scala ---
@@ -104,4 +106,34 @@ class SparkStatusTracker private[spark] (sc: 
SparkContext) {
   }
 }
   }
+
+  /**
+   * Returns a list of all known executors, represented by string with 
format: "host:port"
+   */
+  def getExecutors(): Array[String] = {
--- End diff --

actually let me move this to the main thread so it doesn't get collapsed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request:

https://github.com/apache/spark/pull/11888#discussion_r57933508
  
--- Diff: core/src/main/scala/org/apache/spark/SparkStatusTracker.scala ---
@@ -104,4 +106,34 @@ class SparkStatusTracker private[spark] (sc: 
SparkContext) {
   }
 }
   }
+
+  /**
+   * Returns a list of all known executors, represented by string with 
format: "host:port"
+   */
+  def getExecutors(): Array[String] = {
--- End diff --

wait, how is `getExecutorList` different from `getExecutors`? Why not just 
be more specific what the strings are?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203538706
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/54526/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203538700
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203537841
  
**[Test build #54526 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/54526/consoleFull)**
 for PR 11888 at commit 
[`0b9400e`](https://github.com/apache/spark/commit/0b9400e2a62f6a723be79264dfeddf947755f3d0).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-30 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-203484294
  
**[Test build #54526 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/54526/consoleFull)**
 for PR 11888 at commit 
[`0b9400e`](https://github.com/apache/spark/commit/0b9400e2a62f6a723be79264dfeddf947755f3d0).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-28 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request:

https://github.com/apache/spark/pull/11888#discussion_r57655794
  
--- Diff: core/src/main/scala/org/apache/spark/SparkStatusTracker.scala ---
@@ -104,4 +106,34 @@ class SparkStatusTracker private[spark] (sc: 
SparkContext) {
   }
 }
   }
+
+  /**
+   * Returns a list of all known executors, represented by string with 
format: "host:port"
+   */
+  def getExecutors(): Array[String] = {
--- End diff --

`getExecutorList` LGTM, I'll rename to it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-28 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/11888#discussion_r57654956
  
--- Diff: core/src/main/scala/org/apache/spark/SparkStatusTracker.scala ---
@@ -104,4 +106,34 @@ class SparkStatusTracker private[spark] (sc: 
SparkContext) {
   }
 }
   }
+
+  /**
+   * Returns a list of all known executors, represented by string with 
format: "host:port"
+   */
+  def getExecutors(): Array[String] = {
--- End diff --

getExecutorList


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-28 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request:

https://github.com/apache/spark/pull/11888#discussion_r57654799
  
--- Diff: core/src/main/scala/org/apache/spark/SparkStatusTracker.scala ---
@@ -104,4 +106,34 @@ class SparkStatusTracker private[spark] (sc: 
SparkContext) {
   }
 }
   }
+
+  /**
+   * Returns a list of all known executors, represented by string with 
format: "host:port"
+   */
+  def getExecutors(): Array[String] = {
--- End diff --

we can expose both, but I'd rather call it something more explicit like 
`getExecutorHostPort` or something. Elsewhere in Spark I would think 
`getExecutors: Array[String]` returns the executor IDs.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-28 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/11888#discussion_r57654386
  
--- Diff: core/src/main/scala/org/apache/spark/SparkStatusTracker.scala ---
@@ -104,4 +106,34 @@ class SparkStatusTracker private[spark] (sc: 
SparkContext) {
   }
 }
   }
+
+  /**
+   * Returns a list of all known executors, represented by string with 
format: "host:port"
+   */
+  def getExecutors(): Array[String] = {
--- End diff --

The other thing is I don't know if we want to query the scheduler every 
time we want a list of executors.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-28 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/11888#discussion_r57654356
  
--- Diff: core/src/main/scala/org/apache/spark/SparkStatusTracker.scala ---
@@ -104,4 +106,34 @@ class SparkStatusTracker private[spark] (sc: 
SparkContext) {
   }
 }
   }
+
+  /**
+   * Returns a list of all known executors, represented by string with 
format: "host:port"
+   */
+  def getExecutors(): Array[String] = {
--- End diff --

If I just want a list of executors, why shouldn't I be able to get them? I 
think it makes sense to have a more detailed version (maybe replace the 
following 2), but having a simple one that returns just the list of executors 
seem to make sense too.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-28 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request:

https://github.com/apache/spark/pull/11888#discussion_r57653397
  
--- Diff: core/src/main/scala/org/apache/spark/SparkStatusTracker.scala ---
@@ -104,4 +106,34 @@ class SparkStatusTracker private[spark] (sc: 
SparkContext) {
   }
 }
   }
+
+  /**
+   * Returns a list of all known executors, represented by string with 
format: "host:port"
+   */
+  def getExecutors(): Array[String] = {
--- End diff --

It seems kind of arbitrary that `getExecutors` returns `host:port` but not 
IDs. I think it's better that we make a `SparkExecutorInfo` or something and 
expose the `host:port` there, along with other things like cache size, 
`numRunningTasks` etc. Then in the future we can add more things we want to 
expose without tying ourselves with the `host:port` identifier.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200324144
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/53925/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200324143
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200323936
  
**[Test build #53925 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53925/consoleFull)**
 for PR 11888 at commit 
[`fe80390`](https://github.com/apache/spark/commit/fe8039013efd2e4504168dff264f725fb56665a9).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200276711
  
**[Test build #53925 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53925/consoleFull)**
 for PR 11888 at commit 
[`fe80390`](https://github.com/apache/spark/commit/fe8039013efd2e4504168dff264f725fb56665a9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread cloud-fan
Github user cloud-fan commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200276074
  
retest this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200273101
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200273071
  
**[Test build #53921 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53921/consoleFull)**
 for PR 11888 at commit 
[`fe80390`](https://github.com/apache/spark/commit/fe8039013efd2e4504168dff264f725fb56665a9).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200273102
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/53921/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200269875
  
**[Test build #53921 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53921/consoleFull)**
 for PR 11888 at commit 
[`fe80390`](https://github.com/apache/spark/commit/fe8039013efd2e4504168dff264f725fb56665a9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread cloud-fan
Github user cloud-fan commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200268796
  
retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200243417
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/53904/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200243412
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200242940
  
**[Test build #53904 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53904/consoleFull)**
 for PR 11888 at commit 
[`fe80390`](https://github.com/apache/spark/commit/fe8039013efd2e4504168dff264f725fb56665a9).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200207584
  
**[Test build #53904 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53904/consoleFull)**
 for PR 11888 at commit 
[`fe80390`](https://github.com/apache/spark/commit/fe8039013efd2e4504168dff264f725fb56665a9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread cloud-fan
Github user cloud-fan commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200206025
  
retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200196539
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/53884/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200196538
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-22 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200196073
  
**[Test build #53884 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53884/consoleFull)**
 for PR 11888 at commit 
[`fe80390`](https://github.com/apache/spark/commit/fe8039013efd2e4504168dff264f725fb56665a9).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-22 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200162987
  
It's probably ok.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-22 Thread cloud-fan
Github user cloud-fan commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200160851
  
I found it's difficulty to write tests for it. As it just collects the 
informations which are already exposed by the system, is it worth to test them 
again? cc @rxin 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-14069][SQL] Improve SparkStatusTracker ...

2016-03-22 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/11888#issuecomment-200159464
  
**[Test build #53884 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53884/consoleFull)**
 for PR 11888 at commit 
[`fe80390`](https://github.com/apache/spark/commit/fe8039013efd2e4504168dff264f725fb56665a9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org