Andrei Lopukhov created PARQUET-2216:
----------------------------------------
Summary: Parquet writer classes don't close underlying output
stream in case of errors.
Key: PARQUET-2216
URL: https://issues.apache.org/jira/browse/PARQUET-2216
Project: Parquet
Issue Type: Bug
Components: parquet-mr
Affects Versions: 1.12.3
Reporter: Andrei Lopukhov
Attachments: TestExample.java
org.apache.parquet.io.OutputFile interface does not implement Closeable.
In my opinion it implies that created streams are fully managed by parquet-mr
classes.
Unfortunately opened stream will not be closed in case of IO or other failure.
There are two places I can find for this problem:
* During writer creation
(org.apache.parquet.hadoop.ParquetWriter.Builder#build()) - created stream
should be closed if writer creation fails.
* During writer close(org.apache.parquet.hadoop.ParquetWriter#close) -
underlying stream should be closed regardless of any faced failures.
Although I didn't examine ParquetReaded that much.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)