[
https://issues.apache.org/jira/browse/SPARK-52582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiaonan Yang updated SPARK-52582:
---------------------------------
Description: Today, the XML parser is not memory efficient. It will load
each XML record into memory first before parsing, which will OOM if the input
XML record is pretty large. We want to improve the memory usage of the XML
parser.
> Improve the memory usage of XML parser
> --------------------------------------
>
> Key: SPARK-52582
> URL: https://issues.apache.org/jira/browse/SPARK-52582
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 4.1
> Reporter: Xiaonan Yang
> Priority: Major
>
> Today, the XML parser is not memory efficient. It will load each XML record
> into memory first before parsing, which will OOM if the input XML record is
> pretty large. We want to improve the memory usage of the XML parser.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]