[
https://issues.apache.org/jira/browse/FLINK-13852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Flink Jira Bot updated FLINK-13852:
-----------------------------------
Labels: auto-deprioritized-major (was: stale-major)
> Support storing in-progress/pending files in different directories
> (StreamingFileSink)
> --------------------------------------------------------------------------------------
>
> Key: FLINK-13852
> URL: https://issues.apache.org/jira/browse/FLINK-13852
> Project: Flink
> Issue Type: New Feature
> Components: Connectors / FileSystem
> Reporter: Gyula Fora
> Priority: Major
> Labels: auto-deprioritized-major
>
> Currently in-progress and pending files are stored in the same directory as
> the final output file. This can be problematic depending on the usage of the
> final output files. One example would be loading the data to hive where we
> can only load all files in a certain directory.
> I suggest we allow specifying a Pending/Inprogress base path where we create
> the same bucketing structure as the final files to store only the non-final
> files.
> To support this we need to extend the RecoverableWriter interface with a new
> open method for example:
> RecoverableFsDataOutputStream open(Path path, Path tmpPath) throws
> IOException;
--
This message was sent by Atlassian Jira
(v8.3.4#803005)