[
https://issues.apache.org/jira/browse/SPARK-11767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15008145#comment-15008145
]
Apache Spark commented on SPARK-11767:
--------------------------------------
User 'davies' has created a pull request for this issue:
https://github.com/apache/spark/pull/9760
> Easy to OOM when cache large column
> -----------------------------------
>
> Key: SPARK-11767
> URL: https://issues.apache.org/jira/browse/SPARK-11767
> Project: Spark
> Issue Type: Improvement
> Reporter: Davies Liu
> Assignee: Davies Liu
>
> The default batch size (10000) does not work well the large column (with
> serialized size about 100k), it's easy to OOM when unrolling the rows.
> We should limit the serialized size of batch.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]