[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55304894
  
  [QA tests have 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/69/consoleFull)
 for   PR 2338 at commit 
[`754b5b8`](https://github.com/apache/spark/commit/754b5b8cefd953e10c78519ba4c6d925c505bc6c).
 * This patch **passes** unit tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55333047
  
LGTM


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request:

https://github.com/apache/spark/pull/2338#discussion_r17451124
  
--- Diff: core/src/main/scala/org/apache/spark/executor/Executor.scala ---
@@ -360,7 +360,13 @@ private[spark] class Executor(
 if (!taskRunner.attemptedTask.isEmpty) {
   Option(taskRunner.task).flatMap(_.metrics).foreach { metrics 
=
 metrics.updateShuffleReadMetrics
-tasksMetrics += ((taskRunner.taskId, metrics))
+if (isLocal) {
+  // make a deep copy of it
--- End diff --

Could you elaborate on why we need to do this in the comment?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread sryza
Github user sryza commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55334340
  
I don't have any great ideas for how to write a test for it, but this looks 
good to me as well.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread davies
Github user davies commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55345239
  
@andrewor14 @sryza done


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55345563
  
  [QA tests have 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20182/consoleFull)
 for   PR 2338 at commit 
[`7c879e0`](https://github.com/apache/spark/commit/7c879e0b3deb7c3d5788cae5dc7e459cce6f84d6).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55349817
  
  [QA tests have 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20182/consoleFull)
 for   PR 2338 at commit 
[`7c879e0`](https://github.com/apache/spark/commit/7c879e0b3deb7c3d5788cae5dc7e459cce6f84d6).
 * This patch **passes** unit tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request:

https://github.com/apache/spark/pull/2338#discussion_r17460026
  
--- Diff: core/src/main/scala/org/apache/spark/executor/Executor.scala ---
@@ -360,7 +360,16 @@ private[spark] class Executor(
 if (!taskRunner.attemptedTask.isEmpty) {
   Option(taskRunner.task).flatMap(_.metrics).foreach { metrics 
=
 metrics.updateShuffleReadMetrics
-tasksMetrics += ((taskRunner.taskId, metrics))
+if (isLocal) {
+  // JobProgressListener will hold an reference of it 
during
--- End diff --

a reference


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request:

https://github.com/apache/spark/pull/2338#discussion_r17460029
  
--- Diff: core/src/main/scala/org/apache/spark/executor/Executor.scala ---
@@ -360,7 +360,16 @@ private[spark] class Executor(
 if (!taskRunner.attemptedTask.isEmpty) {
   Option(taskRunner.task).flatMap(_.metrics).foreach { metrics 
=
 metrics.updateShuffleReadMetrics
-tasksMetrics += ((taskRunner.taskId, metrics))
+if (isLocal) {
+  // JobProgressListener will hold an reference of it 
during
+  // onExecutorMetricsUpdate(), then JobProgressListener 
can not see
--- End diff --

cannot


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55352765
  
Actually I'll just merge them myself. Thanks @davies. This is going into 
master and 1.1.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread andrewor14
Github user andrewor14 commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55352717
  
LGTM pending very minor wording suggestions


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/2338


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-11 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55296407
  
  [QA tests have 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/69/consoleFull)
 for   PR 2338 at commit 
[`754b5b8`](https://github.com/apache/spark/commit/754b5b8cefd953e10c78519ba4c6d925c505bc6c).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-10 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55082144
  
**[Tests timed 
out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20081/consoleFull)**
 after a configured wait of `120m`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-10 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55185697
  
  [QA tests have 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/29/consoleFull)
 for   PR 2338 at commit 
[`754b5b8`](https://github.com/apache/spark/commit/754b5b8cefd953e10c78519ba4c6d925c505bc6c).
 * This patch **does not** merge cleanly!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-10 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55198726
  
  [QA tests have 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/29/consoleFull)
 for   PR 2338 at commit 
[`754b5b8`](https://github.com/apache/spark/commit/754b5b8cefd953e10c78519ba4c6d925c505bc6c).
 * This patch **passes** unit tests.
 * This patch **does not** merge cleanly!



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-09 Thread davies
GitHub user davies opened a pull request:

https://github.com/apache/spark/pull/2338

[SPARK-3465] fix task metrics aggregation in local mode

Before overwrite t.taskMetrics, take a deepcopy of it.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/davies/spark fix_metric

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/2338.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2338


commit 5ca26dcd7e007d0cfb11ab920e98fb46cc9343d7
Author: Davies Liu davies@gmail.com
Date:   2014-09-09T22:36:04Z

fix task metrics aggregation in local mode




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-09 Thread davies
Github user davies commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55047574
  
cc @sryza, is there a better way to fix it?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-09 Thread sryza
Github user sryza commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55066419
  
Hi @davies , sorry for causing this bug and thanks for picking it up.  To 
avoid making the deep copy unnecessarily when running in non-local mode, we 
could instead make it on the executor side, and only do so if isLocal = true.  
Any issues you can see with that?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-09 Thread davies
Github user davies commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55070918
  
@sryza I had changed it to do the copy in Executor, then it's hard to write 
a test now. Any idea?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3465] fix task metrics aggregation in l...

2014-09-09 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/2338#issuecomment-55073847
  
  [QA tests have 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20081/consoleFull)
 for   PR 2338 at commit 
[`754b5b8`](https://github.com/apache/spark/commit/754b5b8cefd953e10c78519ba4c6d925c505bc6c).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org