[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread trystanleftwich
Github user trystanleftwich commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77208611 So to confirm, i think this function needs to be able to handle 5 states: Path is a dir which has subdirs (structure is hdfs://foo/foo1/foo2.jar)

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77209251 I tested with this version of `fetchHcfsFile` and my tests pass: /** * Fetch a file or directory from a Hadoop-compatible filesystem. * *

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77223185 Hi @trystanleftwich , just tested my code again with more strict checks, and files show up as files, directories show up as directories. --- If your project is set up

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4881#discussion_r25805285 --- Diff: core/src/test/scala/org/apache/spark/util/UtilsSuite.scala --- @@ -389,16 +389,30 @@ class UtilsSuite extends FunSuite with ResetSystemProperties {

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77224961 Leaving a link to an alternate fix in #4894 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77221581 The code will create a directory local_dir/foo/foo.jar and not a file Hmm. Let me check that. --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77245736 Let's close this PR in favor of #4894, which I just merged. Thanks for reporting this blocker. --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread trystanleftwich
Github user trystanleftwich closed the pull request at: https://github.com/apache/spark/pull/4881 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77216017 @trystanleftwich I believe that's correct. To summarize: - Before this patch, adding `hdfs://single/file.jar` doesn't work (a regression from Spark 1.2) -

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread trystanleftwich
Github user trystanleftwich commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77221033 Ok, i've pushed my changes, I've added tests that should cover all the states, I was getting errors with @vanzin code snippet, if you pass in dir i.e path =

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-04 Thread trystanleftwich
Github user trystanleftwich commented on a diff in the pull request: https://github.com/apache/spark/pull/4881#discussion_r25806270 --- Diff: core/src/test/scala/org/apache/spark/util/UtilsSuite.scala --- @@ -389,16 +389,30 @@ class UtilsSuite extends FunSuite with

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread trystanleftwich
GitHub user trystanleftwich opened a pull request: https://github.com/apache/spark/pull/4880 [SPARK-6144]When in cluster mode using ADD JAR with a hdfs:// sourced ja... ...r will fail While in cluster mode if you use ADD JAR with a HDFS sourced jar it will fail trying to source

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4880#discussion_r25737673 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -632,8 +633,9 @@ private[spark] object Utils extends Logging { fs:

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4880#discussion_r25741212 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -643,7 +645,12 @@ private[spark] object Utils extends Logging {

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77079868 Jenkins, this is ok to test. By the way in the future it would be good to open this against the master branch. In this case it's fine because the 1.3 branch

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77056801 [Test build #28246 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28246/consoleFull) for PR 4880 at commit

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/4880#discussion_r25740726 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -643,7 +645,12 @@ private[spark] object Utils extends Logging {

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77054200 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/4880#discussion_r25738862 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -632,8 +633,9 @@ private[spark] object Utils extends Logging { fs:

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77068164 @trystanleftwich thanks for fixing this. I believe given the current way we call `fetchHcfsFile` your existing patch is sufficient in fixing the problem. However, at

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77071494 I tried this patch locally and while it works for `addFile(String)`, it seems to not work for `addFile(String, boolean)` (i.e. the directory version). Here's the error I

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4880#discussion_r25743304 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -643,7 +645,12 @@ private[spark] object Utils extends Logging {

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77062488 @pwendell adding to your radar. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77072245 Ah, I didn't realize `addFile` also supports directories for Hadoop file systems. Then this does seem to a correctness problem. --- If your project is set up for it,

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread trystanleftwich
Github user trystanleftwich commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77079677 I fat fingered and accidentally closed this ticket, And for some reason its not picking up that the branch has changes in it. I reopened here:

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77079619 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread andrewor14
Github user andrewor14 commented on a diff in the pull request: https://github.com/apache/spark/pull/4880#discussion_r25743237 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -643,7 +645,12 @@ private[spark] object Utils extends Logging {

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread trystanleftwich
GitHub user trystanleftwich opened a pull request: https://github.com/apache/spark/pull/4881 [SPARK-6144]When in cluster mode using ADD JAR with a hdfs:// sourced ja... ...r will fail You can merge this pull request into a Git repository by running: $ git pull

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread trystanleftwich
GitHub user trystanleftwich reopened a pull request: https://github.com/apache/spark/pull/4881 [SPARK-6144]When in cluster mode using ADD JAR with a hdfs:// sourced ja... ...r will fail You can merge this pull request into a Git repository by running: $ git pull

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread trystanleftwich
Github user trystanleftwich closed the pull request at: https://github.com/apache/spark/pull/4881 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77056435 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77060524 LGTM aside from minor style issue. I also think this should really go into 1.3... --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77073312 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4880#issuecomment-77073304 [Test build #28246 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28246/consoleFull) for PR 4880 at commit

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread trystanleftwich
Github user trystanleftwich closed the pull request at: https://github.com/apache/spark/pull/4880 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4881#discussion_r25748314 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -632,8 +633,9 @@ private[spark] object Utils extends Logging { fs:

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77085167 [Test build #625 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/625/consoleFull) for PR 4881 at commit

[GitHub] spark pull request: [SPARK-6144]When in cluster mode using ADD JAR...

2015-03-03 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4881#issuecomment-77093891 [Test build #625 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/625/consoleFull) for PR 4881 at commit