[
https://issues.apache.org/jira/browse/PARQUET-781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Steve Loughran updated PARQUET-781:
-----------------------------------
Summary: ParquetOutputFormat should support custom OutputCommitter (was:
ParquetOuptputFormat should support custom OutputCommitter)
> ParquetOutputFormat should support custom OutputCommitter
> ---------------------------------------------------------
>
> Key: PARQUET-781
> URL: https://issues.apache.org/jira/browse/PARQUET-781
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-mr
> Reporter: Mikko Kupsu
> Priority: Major
>
> ParquetOutputFormat should support custom OutputCommitter.
> There is a need to bypass current Hadoop functionality of writing output data
> under *_temporary* folder. Especially with AWS S3, there can be huge overhead
> of moving the files from *_temporary* folder to output folder.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]