[jira] [Comment Edited] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"
[ https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500502#comment-15500502 ] Zhenhua Xu edited comment on SPARK-17566 at 9/18/16 7:55 AM: - [~saisai_shao] I did some debugging, the issue seems to be on Line 634 of SparkSubmit.scala. If I change it to if(!isYarnCluster) { sysProps.get("spark.submit.pyFiles").foreach { pyFiles => val resolvedPyFiles = Utils.resolveURIs(pyFiles) val formattedPyFiles = PythonRunner.formatPaths(resolvedPyFiles).mkString(",") sysProps("spark.submit.pyFiles") = formattedPyFiles } } Things start to work. Not sure how this affect other submit flow though. was (Author: zhenhua.xu): [~saisai_shao] I did some debugging, the issue seems to be on Line 634 of SparkSubmit.scala. If I change it to if(isYarnCluster) { sysProps.get("spark.submit.pyFiles").foreach { pyFiles => val resolvedPyFiles = Utils.resolveURIs(pyFiles) val formattedPyFiles = PythonRunner.formatPaths(resolvedPyFiles).mkString(",") sysProps("spark.submit.pyFiles") = formattedPyFiles } } Things start to work. Not sure how this affect other submit flow though. > "--master yarn --deploy-mode cluster" gives "Launching Python applications > through spark-submit is currently only supported for local files" > > > Key: SPARK-17566 > URL: https://issues.apache.org/jira/browse/SPARK-17566 > Project: Spark > Issue Type: Bug > Components: Spark Submit >Affects Versions: 2.0.0 >Reporter: Zhenhua Xu > > In Spark 1.6, the following command runs fine with both primary and > additional python files in hdfs. > /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster > hdfs:///tmp/pi.py > In Spark 2.0.0, the following command fails: > /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode > cluster hdfs:///tmp/pi.py > Error: > Launching Python applications through spark-submit is currently only > supported for local files: hdfs:///tmp/base.py -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"
[ https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500516#comment-15500516 ] Zhenhua Xu commented on SPARK-17566: Right, I was retyping the code in the comment editor on my laptop. I corrected it now. Thanks. > "--master yarn --deploy-mode cluster" gives "Launching Python applications > through spark-submit is currently only supported for local files" > > > Key: SPARK-17566 > URL: https://issues.apache.org/jira/browse/SPARK-17566 > Project: Spark > Issue Type: Bug > Components: Spark Submit >Affects Versions: 2.0.0 >Reporter: Zhenhua Xu > > In Spark 1.6, the following command runs fine with both primary and > additional python files in hdfs. > /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster > hdfs:///tmp/pi.py > In Spark 2.0.0, the following command fails: > /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode > cluster hdfs:///tmp/pi.py > Error: > Launching Python applications through spark-submit is currently only > supported for local files: hdfs:///tmp/base.py -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Comment Edited] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"
[ https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500502#comment-15500502 ] Zhenhua Xu edited comment on SPARK-17566 at 9/18/16 7:46 AM: - [~saisai_shao] I did some debugging, the issue seems to be on Line 634 of SparkSubmit.scala. If I change it to if(isYarnCluster) { sysProps.get("spark.submit.pyFiles").foreach { pyFiles => val resolvedPyFiles = Utils.resolveURIs(pyFiles) val formattedPyFiles = PythonRunner.formatPaths(resolvedPyFiles).mkString(",") sysProps("spark.submit.pyFiles") = formattedPyFiles } } Things start to work. Not sure how this affect other submit flow though. was (Author: zhenhua.xu): [~saisai_shao] I did some debugging, the issue seems to be on Line 634 of SparkSubmit.scala. If I change it to if(isYarnCluster) { sysProps.get("spark.submit.pyFiles").foreach { pyFiles => val resolvedPyFiles = Utils.resolveURIs(pyFiles) val formattedPyFiles = PythonRunner.formatPaths(resolvedPyFiles).mkString(",") sysProps("spark.submit.pyFiles") = formattedPyFiles } } Things start to work. > "--master yarn --deploy-mode cluster" gives "Launching Python applications > through spark-submit is currently only supported for local files" > > > Key: SPARK-17566 > URL: https://issues.apache.org/jira/browse/SPARK-17566 > Project: Spark > Issue Type: Bug > Components: Spark Submit >Affects Versions: 2.0.0 >Reporter: Zhenhua Xu > > In Spark 1.6, the following command runs fine with both primary and > additional python files in hdfs. > /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster > hdfs:///tmp/pi.py > In Spark 2.0.0, the following command fails: > /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode > cluster hdfs:///tmp/pi.py > Error: > Launching Python applications through spark-submit is currently only > supported for local files: hdfs:///tmp/base.py -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"
[ https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500502#comment-15500502 ] Zhenhua Xu commented on SPARK-17566: [~saisai_shao] I did some debugging, the issue seems to be on Line 634 of SparkSubmit.scala. If I change it to if(isYarnCluster) { sysProps.get("spark.submit.pyFiles").foreach { pyFiles => val resolvedPyFiles = Utils.resolveURIs(pyFiles) val formattedPyFiles = PythonRunner.formatPaths(resolvedPyFiles).mkString(",") sysProps("spark.submit.pyFiles") = formattedPyFiles } } Things start to work. > "--master yarn --deploy-mode cluster" gives "Launching Python applications > through spark-submit is currently only supported for local files" > > > Key: SPARK-17566 > URL: https://issues.apache.org/jira/browse/SPARK-17566 > Project: Spark > Issue Type: Bug > Components: Spark Submit >Affects Versions: 2.0.0 >Reporter: Zhenhua Xu > > In Spark 1.6, the following command runs fine with both primary and > additional python files in hdfs. > /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster > hdfs:///tmp/pi.py > In Spark 2.0.0, the following command fails: > /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode > cluster hdfs:///tmp/pi.py > Error: > Launching Python applications through spark-submit is currently only > supported for local files: hdfs:///tmp/base.py -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-17566) "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"
Zhenhua Xu created SPARK-17566: -- Summary: "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files" Key: SPARK-17566 URL: https://issues.apache.org/jira/browse/SPARK-17566 Project: Spark Issue Type: Bug Components: Spark Submit Affects Versions: 2.0.0 Reporter: Zhenhua Xu In Spark 1.6, the following command runs fine with both primary and additional python files in hdfs. /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster hdfs:///tmp/pi.py In Spark 2.0.0, the following command fails: /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode cluster hdfs:///tmp/pi.py Error: Launching Python applications through spark-submit is currently only supported for local files: hdfs:///tmp/base.py -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org