[ 
https://issues.apache.org/jira/browse/SPARK-11767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated SPARK-11767:
------------------------------
    Component/s: SQL

> Easy to OOM when cache large column
> -----------------------------------
>
>                 Key: SPARK-11767
>                 URL: https://issues.apache.org/jira/browse/SPARK-11767
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>            Reporter: Davies Liu
>            Assignee: Davies Liu
>
> The default batch size (10000) does not work well the large column (with 
> serialized size about 100k), it's easy to OOM when unrolling the rows.
> We should limit the serialized size of batch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to