[jira] [Comment Edited] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"

2016-09-18 Thread Zhenhua Xu (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500502#comment-15500502
 ] 

Zhenhua Xu edited comment on SPARK-17566 at 9/18/16 7:55 AM:
-

[~saisai_shao] I did some debugging, the issue seems to be on Line 634 of 
SparkSubmit.scala. If I change it to

  if(!isYarnCluster) {
sysProps.get("spark.submit.pyFiles").foreach { pyFiles =>
  val resolvedPyFiles = Utils.resolveURIs(pyFiles)
  val formattedPyFiles = 
PythonRunner.formatPaths(resolvedPyFiles).mkString(",")
  sysProps("spark.submit.pyFiles") = formattedPyFiles
}
  }

Things start to work. Not sure how this affect other submit flow though.


was (Author: zhenhua.xu):
[~saisai_shao] I did some debugging, the issue seems to be on Line 634 of 
SparkSubmit.scala. If I change it to

  if(isYarnCluster) {
sysProps.get("spark.submit.pyFiles").foreach { pyFiles =>
  val resolvedPyFiles = Utils.resolveURIs(pyFiles)
  val formattedPyFiles = 
PythonRunner.formatPaths(resolvedPyFiles).mkString(",")
  sysProps("spark.submit.pyFiles") = formattedPyFiles
}
  }

Things start to work. Not sure how this affect other submit flow though.

> "--master yarn --deploy-mode cluster" gives "Launching Python applications 
> through spark-submit is currently only supported for local files"
> 
>
> Key: SPARK-17566
> URL: https://issues.apache.org/jira/browse/SPARK-17566
> Project: Spark
>  Issue Type: Bug
>  Components: Spark Submit
>Affects Versions: 2.0.0
>Reporter: Zhenhua Xu
>
> In Spark 1.6, the following command runs fine with both primary and 
> additional python files in hdfs.
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster 
> hdfs:///tmp/pi.py
> In Spark 2.0.0, the following command fails:
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode 
> cluster hdfs:///tmp/pi.py
> Error:
> Launching Python applications through spark-submit is currently only 
> supported for local files: hdfs:///tmp/base.py



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"

2016-09-18 Thread Zhenhua Xu (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500516#comment-15500516
 ] 

Zhenhua Xu commented on SPARK-17566:


Right, I was retyping the code in the comment editor on my laptop. I corrected 
it now. Thanks.

> "--master yarn --deploy-mode cluster" gives "Launching Python applications 
> through spark-submit is currently only supported for local files"
> 
>
> Key: SPARK-17566
> URL: https://issues.apache.org/jira/browse/SPARK-17566
> Project: Spark
>  Issue Type: Bug
>  Components: Spark Submit
>Affects Versions: 2.0.0
>Reporter: Zhenhua Xu
>
> In Spark 1.6, the following command runs fine with both primary and 
> additional python files in hdfs.
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster 
> hdfs:///tmp/pi.py
> In Spark 2.0.0, the following command fails:
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode 
> cluster hdfs:///tmp/pi.py
> Error:
> Launching Python applications through spark-submit is currently only 
> supported for local files: hdfs:///tmp/base.py



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Comment Edited] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"

2016-09-18 Thread Zhenhua Xu (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500502#comment-15500502
 ] 

Zhenhua Xu edited comment on SPARK-17566 at 9/18/16 7:46 AM:
-

[~saisai_shao] I did some debugging, the issue seems to be on Line 634 of 
SparkSubmit.scala. If I change it to

  if(isYarnCluster) {
sysProps.get("spark.submit.pyFiles").foreach { pyFiles =>
  val resolvedPyFiles = Utils.resolveURIs(pyFiles)
  val formattedPyFiles = 
PythonRunner.formatPaths(resolvedPyFiles).mkString(",")
  sysProps("spark.submit.pyFiles") = formattedPyFiles
}
  }

Things start to work. Not sure how this affect other submit flow though.


was (Author: zhenhua.xu):
[~saisai_shao] I did some debugging, the issue seems to be on Line 634 of 
SparkSubmit.scala. If I change it to

  if(isYarnCluster) {
sysProps.get("spark.submit.pyFiles").foreach { pyFiles =>
  val resolvedPyFiles = Utils.resolveURIs(pyFiles)
  val formattedPyFiles = 
PythonRunner.formatPaths(resolvedPyFiles).mkString(",")
  sysProps("spark.submit.pyFiles") = formattedPyFiles
}
  }

Things start to work.

> "--master yarn --deploy-mode cluster" gives "Launching Python applications 
> through spark-submit is currently only supported for local files"
> 
>
> Key: SPARK-17566
> URL: https://issues.apache.org/jira/browse/SPARK-17566
> Project: Spark
>  Issue Type: Bug
>  Components: Spark Submit
>Affects Versions: 2.0.0
>Reporter: Zhenhua Xu
>
> In Spark 1.6, the following command runs fine with both primary and 
> additional python files in hdfs.
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster 
> hdfs:///tmp/pi.py
> In Spark 2.0.0, the following command fails:
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode 
> cluster hdfs:///tmp/pi.py
> Error:
> Launching Python applications through spark-submit is currently only 
> supported for local files: hdfs:///tmp/base.py



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"

2016-09-18 Thread Zhenhua Xu (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500502#comment-15500502
 ] 

Zhenhua Xu commented on SPARK-17566:


[~saisai_shao] I did some debugging, the issue seems to be on Line 634 of 
SparkSubmit.scala. If I change it to

  if(isYarnCluster) {
sysProps.get("spark.submit.pyFiles").foreach { pyFiles =>
  val resolvedPyFiles = Utils.resolveURIs(pyFiles)
  val formattedPyFiles = 
PythonRunner.formatPaths(resolvedPyFiles).mkString(",")
  sysProps("spark.submit.pyFiles") = formattedPyFiles
}
  }

Things start to work.

> "--master yarn --deploy-mode cluster" gives "Launching Python applications 
> through spark-submit is currently only supported for local files"
> 
>
> Key: SPARK-17566
> URL: https://issues.apache.org/jira/browse/SPARK-17566
> Project: Spark
>  Issue Type: Bug
>  Components: Spark Submit
>Affects Versions: 2.0.0
>Reporter: Zhenhua Xu
>
> In Spark 1.6, the following command runs fine with both primary and 
> additional python files in hdfs.
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster 
> hdfs:///tmp/pi.py
> In Spark 2.0.0, the following command fails:
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode 
> cluster hdfs:///tmp/pi.py
> Error:
> Launching Python applications through spark-submit is currently only 
> supported for local files: hdfs:///tmp/base.py



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"

2016-09-16 Thread Zhenhua Xu (JIRA)
Zhenhua Xu created SPARK-17566:
--

 Summary: "--master yarn --deploy-mode cluster" gives "Launching 
Python applications through spark-submit is currently only supported for local 
files"
 Key: SPARK-17566
 URL: https://issues.apache.org/jira/browse/SPARK-17566
 Project: Spark
  Issue Type: Bug
  Components: Spark Submit
Affects Versions: 2.0.0
Reporter: Zhenhua Xu


In Spark 1.6, the following command runs fine with both primary and additional 
python files in hdfs.
/bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster 
hdfs:///tmp/pi.py

In Spark 2.0.0, the following command fails:
/bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode 
cluster hdfs:///tmp/pi.py

Error:
Launching Python applications through spark-submit is currently only supported 
for local files: hdfs:///tmp/base.py





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org