[ 
https://issues.apache.org/jira/browse/SPARK-18750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15726837#comment-15726837
 ] 

Sean Owen commented on SPARK-18750:
-----------------------------------

Hm, I am not immediately sure how these are related. Where does the overflow 
occur -- is it even in Spark? and I wonder why it would be caused by a lot of 
executor requests?

And then why is it asking for so many executors? that seems wrong but is that 
the bug you are reporting?  does the app ask for that many? is dynamic 
allocation on, etc?

This isn't enough info.

> spark should be able to control the number of executor and should not throw 
> stack overslow
> ------------------------------------------------------------------------------------------
>
>                 Key: SPARK-18750
>                 URL: https://issues.apache.org/jira/browse/SPARK-18750
>             Project: Spark
>          Issue Type: Bug
>            Reporter: Neerja Khattar
>
> When running Sql queries on large datasets. Job fails with stack overflow 
> warning and it shows it is requesting lots of executors.
> Looks like there is no limit to number of executors or not even having an 
> upperbound based on yarn available resources.
> 16/11/29 15:47:47 INFO impl.ContainerManagementProtocolProxy: Opening proxy : 
> bdtcstr61n5.svr.us.jpmchase.net:8041 
> 16/11/29 15:47:47 INFO impl.ContainerManagementProtocolProxy: Opening proxy : 
> bdtcstr61n8.svr.us.jpmchase.net:8041 
> 16/11/29 15:47:47 INFO impl.ContainerManagementProtocolProxy: Opening proxy : 
> bdtcstr61n2.svr.us.jpmchase.net:8041 
> 16/11/29 15:47:47 INFO yarn.YarnAllocator: Driver requested a total number of 
> 32770 executor(s). 
> 16/11/29 15:47:47 INFO yarn.YarnAllocator: Will request 24576 executor 
> containers, each with 1 cores and 6758 MB memory including 614 MB overhead 
> 16/11/29 15:49:11 INFO yarn.YarnAllocator: Driver requested a total number of 
> 52902 executor(s). 
> 16/11/29 15:49:11 WARN yarn.ApplicationMaster: Reporter thread fails 1 
> time(s) in a row. 
> java.lang.StackOverflowError 
> at scala.collection.immutable.HashMap.$plus(HashMap.scala:57) 
> at scala.collection.immutable.HashMap.$plus(HashMap.scala:36) 
> at scala.collection.mutable.MapBuilder.$plus$eq(MapBuilder.scala:28) 
> If you notice in the error above, YARN is trying to request 24576 executor 
> containers, whereas the available cores are 1719. The Driver is requesting 
> for 52902 executor(s), which too high. 
> This exception should be fixed



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to