[ https://issues.apache.org/jira/browse/SPARK-24623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hyukjin Kwon resolved SPARK-24623. ---------------------------------- Resolution: Incomplete > Hadoop - Spark Cluster - Python XGBoost - Not working in distributed mode > ------------------------------------------------------------------------- > > Key: SPARK-24623 > URL: https://issues.apache.org/jira/browse/SPARK-24623 > Project: Spark > Issue Type: Bug > Components: Deploy > Affects Versions: 2.1.1 > Environment: Hadoop - Hortonworks Cluster > > Total Nodes - 18 > Worker Nodes - 13 > Reporter: Abhishek Reddy Chamakura > Priority: Major > Labels: bulk-closed > > Hi > We recently installed python on the Hadoop cluster with lot of data science > python modules including xgboost , spicy , scikit learn , pandas > Using pyspark the data scientists are able to test there scoring models in > the distributed mode on the Hadoop cluster. But with python - xgboost the > pyspark job is not getting distributed and it is trying to run only on one > instance. > we are trying to achieve the distributed mode when using python xgboost via > pyspark. > It would be a great help if you can direct me on how to achieve this. > Thanks, > Abhishek -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org