[ 
https://issues.apache.org/jira/browse/SPARK-52582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaonan Yang updated SPARK-52582:
---------------------------------
    Description: Today, the XML parser is not memory efficient. It will load 
each XML record into memory first before parsing, which will OOM if the input 
XML record is pretty large. We want to improve the memory usage of the XML 
parser.

> Improve the memory usage of XML parser
> --------------------------------------
>
>                 Key: SPARK-52582
>                 URL: https://issues.apache.org/jira/browse/SPARK-52582
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 4.1
>            Reporter: Xiaonan Yang
>            Priority: Major
>
> Today, the XML parser is not memory efficient. It will load each XML record 
> into memory first before parsing, which will OOM if the input XML record is 
> pretty large. We want to improve the memory usage of the XML parser.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to