[GitHub] spark issue #17582: [SPARK-20239][Core] Improve HistoryServer's ACL mechanis...

2017-04-24 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17582 As @vanzin said I think this is fine for now to get this fixed quickly, but filing a follow up jira makes sense.Actually this might be good to get into the 2.1.1 release if they are going to

[GitHub] spark issue #17658: [SPARK-20355] Add per application spark version on the h...

2017-04-24 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17658 +1. @vanzin any further comments? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17582: [SPARK-20239][Core] Improve HistoryServer's ACL mechanis...

2017-04-24 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17582 changes lgtm. Did you file a jira to track changing to not use withSparkUI? If user is downloading because the file is huge and takes a long time to render or causes history server to have issue

[GitHub] spark issue #17445: [SPARK-20115] [CORE] Fix DAGScheduler to recompute all t...

2017-04-21 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17445 there is a large discussion about how to handle fetch failures going on in https://issues.apache.org/jira/browse/SPARK-20178. The fact that you got a fetch failure does not mean that all blocks

[GitHub] spark issue #17700: [SPARK-20391][Core] Rename memory related fields in Exec...

2017-04-20 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17700 Yes that is what I was thinking from the conversation in the jira. We should do that now as to not cause more compatibility issues. --- If your project is set up for it, you can reply to this

[GitHub] spark issue #17658: [SPARK-20355] Add per application spark version on the h...

2017-04-18 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17658 the idea here was mine. I agree that it could be confusing but its also confusing as is and its hard to find the version that was run. I was figuring this way would be consistent with the live

[GitHub] spark issue #17582: [SPARK-20239][Core] Improve HistoryServer's ACL mechanis...

2017-04-18 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17582 so we should definitely fix the /api/v1/applications//logs to go through the acls. It looks like it should be protected in ApiRootResource.java. You have the app id so it needs to do something

[GitHub] spark issue #17658: [SPARK-20355] Add per application spark version on the h...

2017-04-17 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17658 Jenkins, test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #17582: [SPARK-20239][Core] Improve HistoryServer's ACL mechanis...

2017-04-14 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17582 Sorry again the wording above and all the different configs are a bit confusing to me as to what the real issues are here. >Here actually has two list of acls, one is controlled

[GitHub] spark issue #17625: [SPARK-9103][WIP] Add Memory Tracking UI and track Netty...

2017-04-13 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17625 I haven't looked through the code at all, but I definitely like the idea of tracking the netty memory usage. Breaking into 2 pieces makes sense. If we end up creating any new UI

[GitHub] spark issue #17625: [SPARK-9103][WIP] Add Memory Tracking UI and track Netty...

2017-04-13 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17625 we also just exposed more memory information for storage memory in the executors page in SPARK-17019. If we now have a memory tab it could be confusing to the users where to go to see

[GitHub] spark issue #17582: [SPARK-20239][Core] Improve HistoryServer's ACL mechanis...

2017-04-10 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17582 Sorry but I'm confused by the explanation in the description. I didn't completely follow what problems you are seeing that aren't intended and I don't understand how you

[GitHub] spark issue #17495: [SPARK-20172][Core] Add file permission check when listi...

2017-04-07 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17495 Sorry @jerryshao I know you have a few up but I'm swamped and probably won't get to them to next week. --- If your project is set up for it, you can reply to this email and have

[GitHub] spark issue #17500: [SPARK-20181] [CORE] tries to bind the port to avoid jet...

2017-04-06 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17500 Jenkins, test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #17485: [SPARK-20163] Kill all running tasks in a stage in case ...

2017-03-30 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17485 see the discussion on the mailing list. We now have 4 different jira for handling fetch failures. I think we should get a design for the entire thing first. personally I don't wa

[GitHub] spark issue #17113: [SPARK-13669][Core] Improve the blacklist mechanism to h...

2017-03-30 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17113 sorry for the delay on this we have been having some discussion about scheduler changes and the fetch failure handling in the scheduler. Since this is related holding off on this. --- If your

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-29 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r108801550 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -735,7 +749,12 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark issue #17297: [SPARK-14649][CORE] DagScheduler should not run duplicat...

2017-03-28 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17297 Sounds good to me. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17387: [SPARK-20060][Deploy][Kerberos]Support Standalone visiti...

2017-03-27 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17387 Yeah if you plan on adding support for secure hdfs access in standalone mode, it needs a feature jira, probably go through SPIP and make sure everything truly works and is documented. I remember

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-03-23 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/15009 Jenkins, test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-22 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r107508739 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -735,7 +749,12 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-03-22 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/15009 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17387: [SPARK-20060][Deploy][Kerberos][Spark Shell] Obtain cred...

2017-03-22 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17387 Here is a jira from a long time back: https://issues.apache.org/jira/browse/SPARK-2541 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark issue #17387: [SPARK-20060][Deploy][Kerberos][Spark Shell] Obtain cred...

2017-03-22 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17387 I didn't think Spark officially supported kerberos in standalone mode. I'm pretty sure it doesn't work at all even if kinit'd due to a change that went in a long time ba

[GitHub] spark issue #17088: [SPARK-19753][CORE] Un-register all shuffle output on a ...

2017-03-22 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17088 > (a) even the existing behavior will make you do unnecessary work for transient failures and (b) this just slightly increases the amount of work that has to be repeated for those transi

[GitHub] spark issue #14617: [SPARK-17019][Core] Expose on-heap and off-heap memory u...

2017-03-21 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/14617 Checkbox sounds good to me. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #14617: [SPARK-17019][Core] Expose on-heap and off-heap memory u...

2017-03-20 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/14617 while I kind of like the hover because it doesn't clutter the page, it does bring up a couple concerns: - user can't sort by them - user might not know to hover (none of the o

[GitHub] spark issue #17238: getRackForHost returns None if host is unknown by driver

2017-03-16 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17238 > Actually, to play devil's advocate, the problem @morenn520 is describing is a little more involved. You have a driver running, which has its own view of what the cluster topology is,

[GitHub] spark issue #17113: [SPARK-13669][Core] Improve the blacklist mechanism to h...

2017-03-15 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17113 > Another thing I thought about as I was reviewing this -- spark currently assumes that a fetchfailure is always the fault of the source, never the destination. I almost wonder if we should co

[GitHub] spark pull request #17113: [SPARK-13669][Core] Improve the blacklist mechani...

2017-03-15 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/17113#discussion_r106287979 --- Diff: docs/configuration.md --- @@ -1411,6 +1411,15 @@ Apart from these, the following properties are also available, and may be useful

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-15 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r106258149 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -735,7 +749,12 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-03-15 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/15009 +1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #17238: getRackForHost returns None if host is unknown by driver

2017-03-15 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17238 Ok checked tez and mr and they don't do this. Actually in a couple of the input formats it actually adds DEFAULT_RACK if there wasn't any topology information so you would end u

[GitHub] spark issue #17238: getRackForHost returns None if host is unknown by driver

2017-03-15 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17238 If you aren't adding in machines to rack and configuring yarn properly before adding it to your cluster that is a process issue you should fix on your end.I would assume a unracking/rack

[GitHub] spark issue #17303: [SPARK-19112][CORE] add codec for ZStandard

2017-03-15 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17303 this should not be needed just to use to write to hdfs. The regular hadoop input/output type formats have support for it if you are using the right version (I think hadoop 2.8). This

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r106049395 --- Diff: resource-managers/yarn/src/test/scala/org/apache/spark/deploy/yarn/YarnClusterSuite.scala --- @@ -252,20 +307,55 @@ class YarnClusterSuite

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r106049046 --- Diff: resource-managers/yarn/src/test/scala/org/apache/spark/deploy/yarn/YarnClusterSuite.scala --- @@ -226,6 +243,44 @@ class YarnClusterSuite

[GitHub] spark issue #17238: getRackForHost returns None if host is unknown by driver

2017-03-14 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17238 Sorry if I'm missing something here but I don't see why this is a problem? If you have YARN misconfigured or not configured everything is going to default to DEFAULT_RACK. If you wa

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-03-13 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/15009 @kishorvpatil please resolve the conflicts. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-03-10 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/15009 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-03-10 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/15009 Test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17113: [SPARK-13669][Core] Improve the blacklist mechanism to h...

2017-03-09 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17113 sorry haven't had a chance to get to this to do full review, hopefully tomorrow. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r105261723 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -719,7 +716,24 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r105259983 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -719,7 +716,24 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r105267721 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/config.scala --- @@ -349,4 +350,8 @@ package object config

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r105260515 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -719,7 +716,24 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r105261284 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -735,7 +749,12 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r105273069 --- Diff: resource-managers/yarn/src/test/scala/org/apache/spark/deploy/yarn/YarnClusterSuite.scala --- @@ -193,6 +193,74 @@ class YarnClusterSuite

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r105282085 --- Diff: resource-managers/yarn/src/test/scala/org/apache/spark/deploy/yarn/YarnClusterSuite.scala --- @@ -193,6 +193,74 @@ class YarnClusterSuite

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r105266591 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkAppHandle.java --- @@ -100,6 +100,8 @@ public boolean isFinal

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-03-09 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/15009 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-03-08 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/15009 Jenkins, test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-07 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r104743802 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala --- @@ -174,6 +174,11 @@ private[spark] class

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-07 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r104673881 --- Diff: examples/src/main/java/org/apache/spark/examples/JavaWordCount.java --- @@ -36,17 +36,13 @@ public static void main(String[] args

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-07 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r104674280 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkLauncher.java --- @@ -78,9 +78,9 @@ public static final String

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-03-07 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r104673708 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -719,13 +716,17 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark issue #17088: [SPARK-19753][CORE] Un-register all shuffle output on a ...

2017-03-06 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17088 Note alternatively we could change it to not fail on fetch failure. This would seem better to me since there is no reason to throw away all the work you have done but I'm sure that is a

[GitHub] spark issue #17088: [SPARK-19753][CORE] Un-register all shuffle output on a ...

2017-03-06 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17088 In this particular case are your map tasks fast or slow. If they are really fast rerunning everything now makes sense, if each of those took 1 hour+ to run, failing all when they don't

[GitHub] spark issue #17113: [SPARK-13669][Core] Improve the blacklist mechanism to h...

2017-03-06 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17113 I was not talking about designing this around the killing task part of this, other then in reference to being able to count the # of fetch failures before triggering the blacklisting, but I think

[GitHub] spark issue #17113: [SPARK-13669][Core] Improve the blacklist mechanism to h...

2017-03-06 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17113 > Whether running tasks are interrupted on stage abort or not depends on the state of a config boolean -- and ideally we'd like to get to the point where we can confidently set that c

[GitHub] spark issue #17113: [SPARK-13669][Core] Improve the blacklist mechanism to h...

2017-03-06 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17113 So I looked at this a little more. I'm more ok with this since Spark doesn't actually invalidate the shuffle output. You are basically just trying to stop new tasks from running on the

[GitHub] spark issue #17088: [SPARK-19753][CORE] Un-register all shuffle output on a ...

2017-03-02 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17088 fyi, this is somewhat related to https://github.com/apache/spark/pull/17113 I mention it because I think both depend on how we handle failures and retries. This and that together could cause

[GitHub] spark issue #16291: [SPARK-18838][CORE] Use separate executor service for ea...

2017-03-02 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/16291 @sitalkedia are you still working on this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #17113: [SPARK-13669][Core] Improve the blacklist mechanism to h...

2017-03-02 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17113 @jerryshao are you actually seeing issues with this on real customer/production jobs? How often? NM failure for us is very rare. I'm not familiar with how mesos would fail differently

[GitHub] spark issue #17113: [SPARK-13669][Core] Improve the blacklist mechanism to h...

2017-03-01 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/17113 can you clarify the situations you are seeing issues? What happened to the NM in this case. If you have work preserving restart I would think this would actually cause you more problems. The NM

[GitHub] spark issue #16819: [SPARK-16441][YARN] Set maxNumExecutor depends on yarn c...

2017-02-27 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/16819 I agree with others, this is not the way to do this. There are different schedulers in yarn, each with different configs that could affect the actual resources you get. If you want to

[GitHub] spark issue #16946: [SPARK-19554][UI,YARN] Allow SHS URL to be used for trac...

2017-02-21 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/16946 On vacation back next Monday and will review. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #16923: [SPARK-19038][Hive][YARN] Correctly figure out ke...

2017-02-15 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/16923#discussion_r101304225 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala --- @@ -106,21 +106,31 @@ private[hive] class HiveClientImpl

[GitHub] spark pull request #16923: [SPARK-19038][Hive][YARN] Correctly figure out ke...

2017-02-15 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/16923#discussion_r101302275 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala --- @@ -106,21 +106,31 @@ private[hive] class HiveClientImpl

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101085286 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkLauncher.java --- @@ -528,13 +582,41 @@ public SparkAppHandle startApplication

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101095913 --- Diff: core/src/main/scala/org/apache/spark/launcher/LauncherBackend.scala --- @@ -71,6 +100,9 @@ private[spark] abstract class LauncherBackend

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101084346 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkLauncher.java --- @@ -528,13 +582,41 @@ public SparkAppHandle startApplication

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101061331 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -685,9 +686,8 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101087592 --- Diff: launcher/src/main/java/org/apache/spark/launcher/package-info.java --- @@ -49,6 +49,39 @@ * * * + * Currently, while

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101092206 --- Diff: core/src/main/scala/org/apache/spark/launcher/LauncherBackend.scala --- @@ -71,6 +100,9 @@ private[spark] abstract class LauncherBackend

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101085524 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkSubmitRunner.java --- @@ -0,0 +1,65 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101062900 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -719,7 +719,23 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r101083260 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkLauncher.java --- @@ -107,6 +121,30 @@ public static void setConfig(String name, String

[GitHub] spark issue #16916: [SPARK-19501][YARN] Reduce the number of HDFS RPCs durin...

2017-02-13 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/16916 we should not remove symlink resolution. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100320712 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -1149,13 +1179,23 @@ private object Client extends

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100313659 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -719,7 +719,20 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100312242 --- Diff: core/src/main/scala/org/apache/spark/SparkApp.scala --- @@ -0,0 +1,40 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100318015 --- Diff: launcher/src/main/java/org/apache/spark/launcher/package-info.java --- @@ -49,6 +49,38 @@ * * * + * Currently, for

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100318779 --- Diff: launcher/src/main/java/org/apache/spark/launcher/package-info.java --- @@ -49,6 +49,38 @@ * * * + * Currently, for

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100312144 --- Diff: core/src/main/scala/org/apache/spark/SparkApp.scala --- @@ -0,0 +1,40 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100313512 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -719,7 +719,20 @@ object SparkSubmit extends CommandLineUtils

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100318547 --- Diff: launcher/src/main/java/org/apache/spark/launcher/package-info.java --- @@ -49,6 +49,38 @@ * * * + * Currently, for

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100316402 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkLauncher.java --- @@ -94,6 +103,13 @@ static final Map launcherConfig = new

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100316424 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkLauncher.java --- @@ -94,6 +103,13 @@ static final Map launcherConfig = new

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100317631 --- Diff: launcher/src/main/java/org/apache/spark/launcher/SparkSubmitRunner.java --- @@ -0,0 +1,65 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-02-09 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/15009#discussion_r100316057 --- Diff: launcher/src/main/java/org/apache/spark/launcher/LauncherServer.java --- @@ -89,11 +89,32 @@ private static volatile LauncherServer

[GitHub] spark pull request #16650: [SPARK-16554][CORE] Automatically Kill Executors ...

2017-01-31 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/16650#discussion_r98782219 --- Diff: core/src/main/scala/org/apache/spark/scheduler/BlacklistTracker.scala --- @@ -187,6 +198,19 @@ private[scheduler] class BlacklistTracker

[GitHub] spark pull request #16650: [SPARK-16554][CORE] Automatically Kill Executors ...

2017-01-31 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/16650#discussion_r98781265 --- Diff: core/src/main/scala/org/apache/spark/scheduler/BlacklistTracker.scala --- @@ -187,6 +198,19 @@ private[scheduler] class BlacklistTracker

[GitHub] spark pull request #16650: [SPARK-16554][CORE] Automatically Kill Executors ...

2017-01-31 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/16650#discussion_r98781173 --- Diff: core/src/main/scala/org/apache/spark/scheduler/BlacklistTracker.scala --- @@ -173,6 +174,16 @@ private[scheduler] class BlacklistTracker

[GitHub] spark issue #16695: [SPARK-19277][yarn] Localize topology scripts inside Had...

2017-01-26 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/16695 So I'm just curious, in the specific case you saw this issue, what were the configs? The configs on NM had the correct path or the ones on the gateways were only pointing to gateway. I

[GitHub] spark issue #16695: [SPARK-19277][yarn] Localize topology scripts inside Had...

2017-01-26 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/16695 this seems really really specific to the scripts being in the hadoop conf directory and the user using default mapping. I assume the hadoop confs on the nodemanagers have a different config then

[GitHub] spark issue #16704: [SPARK-18750][yarn] Follow up: move test to correct dire...

2017-01-25 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/16704 thanks for fixing, forgot we moved those. +1, go ahead and merge once jenkins passes. --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark issue #16667: [SPARK-18750][yarn] Avoid using "mapValues" when allocat...

2017-01-25 Thread tgravescs
Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/16667 +1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #16650: [SPARK-16554][CORE] Automatically Kill Executors ...

2017-01-23 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/16650#discussion_r97342524 --- Diff: core/src/main/scala/org/apache/spark/scheduler/BlacklistTracker.scala --- @@ -50,10 +50,11 @@ import org.apache.spark.util.{Clock, SystemClock

[GitHub] spark pull request #16650: [SPARK-16554][CORE] Automatically Kill Executors ...

2017-01-23 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/16650#discussion_r97341837 --- Diff: core/src/main/scala/org/apache/spark/scheduler/BlacklistTracker.scala --- @@ -168,6 +169,21 @@ private[scheduler] class BlacklistTracker

<    5   6   7   8   9   10   11   12   13   14   >