Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/16281
For this issue, it was initially PARQUET-363, but all maintenance fix (with
new features) are welcome.
For me, the followings?
- PARQUET-99: Large rows cause unnecessary OOM exceptions
- PARQUET-353: Compressors not getting recycled while writing parquet
files, causing memory leak
- PARQUET-363: Cannot construct empty MessageType for
ReadContext.requestedSchema
- PARQUET-511: Integer overflow on counting values in column
- PARQUET-569: ParquetMetadataConverter offset filter is broken
- PARQUET-571: Fix potential leak in ParquetFileReader.close()
- PARQUET-623: DeltaByteArrayReader has incorrect skip behaviour
- PARQUET-645: DictionaryFilter incorrectly handles null
@gatorsmile . Do you have more?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]