[ 
https://issues.apache.org/jira/browse/SPARK-13115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15127615#comment-15127615
 ] 

Rahul Tanwani commented on SPARK-13115:
---------------------------------------

Computing the same stage over and over again is different than progressing at a 
slower rate. I have run the same dataset with 40 cores and 100 GB Memory in the 
cluster mode but the issue remains. Furthermore, even the bigger and wider 
datasets than this particular dataset has completed in very finite amount of 
time.

Could you suggest what more information help us to find and digg into the issue?

> RandomForest is stuck at computing same stage over and over
> -----------------------------------------------------------
>
>                 Key: SPARK-13115
>                 URL: https://issues.apache.org/jira/browse/SPARK-13115
>             Project: Spark
>          Issue Type: Bug
>          Components: ML, MLlib, Spark Core
>    Affects Versions: 1.5.2
>            Reporter: Rahul Tanwani
>         Attachments: Stage details.png, Stages.png, Task details.png
>
>
> While running the RandomForest regression, the algorithm keeps computing the 
> same stage and does not proceed any further. I have observed the same stage 
> being computed for more than 11 hours. Attached are some of the captures from 
> Spark WebUI.
> Also, the spark event logs for this model run could be fetched from Spark 
> Event Logs 
> (https://s3.amazonaws.com/com.tookitaki.public.logs/spark-event-logs). I am 
> running spark-1.5.2 in the standalone local mode. Also, I wanted to know why 
> any stage is marked skipped? 
> Let me know if you would need more information.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to