[
https://issues.apache.org/jira/browse/SPARK-54220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan resolved SPARK-54220.
---------------------------------
Fix Version/s: 4.1.0
Resolution: Fixed
Issue resolved by pull request 52922
[https://github.com/apache/spark/pull/52922]
> NullType/VOID/UNKNOWN Type Support in Parquet
> ---------------------------------------------
>
> Key: SPARK-54220
> URL: https://issues.apache.org/jira/browse/SPARK-54220
> Project: Spark
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 4.0.1
> Reporter: Ziya Mukhtarov
> Assignee: Ziya Mukhtarov
> Priority: Major
> Labels: parquet, parquetReader, parquetWriter,
> pull-request-available
> Fix For: 4.1.0
>
>
> Currently, Spark does not support NullType columns in Parquet files. This is
> blocked at the ParquetFileFormat level, which throws an exception if there's
> an attempt to read or write NullType column. Parquet has [UNKNOWN logical
> type
> annotation|https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#unknown-always-null],
> which can be used for columns whose values are always null. This can be used
> to write NullType columns into Parquet files, and when reading from Parquet
> files that already has this annotation.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]