[
https://issues.apache.org/jira/browse/SPARK-4838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14247662#comment-14247662
]
Sean Owen commented on SPARK-4838:
----------------------------------
[~shenhong] This stack trace is very large but does not show any new
information. What I meant was, is there anything different at the root? or was
it not present in your logs? Obviously the problem is a serialization graph
that is way too deeply nested, so more copies of these 5 lines aren't needed,
but it might help to show where the call originated.
> StackOverflowError when serialization task
> ------------------------------------------
>
> Key: SPARK-4838
> URL: https://issues.apache.org/jira/browse/SPARK-4838
> Project: Spark
> Issue Type: Bug
> Components: Spark Core, SQL
> Affects Versions: 1.1.0
> Reporter: Hong Shen
>
> When run a sql with more than 2000 partitions, each partition a HadoopRDD,
> it will cause java.lang.StackOverflowError at serialize task.
> Error message from spark is:Job aborted due to stage failure: Task
> serialization failed: java.lang.StackOverflowError
> java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1508)
> java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1431)
> java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1177)
> java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1547)
> java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1508)
> java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1431)
> java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1177)
> ......
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]