[
https://issues.apache.org/jira/browse/FLINK-13852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Flink Jira Bot updated FLINK-13852:
-----------------------------------
Labels: auto-deprioritized-major auto-deprioritized-minor (was:
auto-deprioritized-major stale-minor)
Priority: Not a Priority (was: Minor)
This issue was labeled "stale-minor" 7 days ago and has not received any
updates so it is being deprioritized. If this ticket is actually Minor, please
raise the priority and ask a committer to assign you the issue or revive the
public discussion.
> Support storing in-progress/pending files in different directories
> (StreamingFileSink)
> --------------------------------------------------------------------------------------
>
> Key: FLINK-13852
> URL: https://issues.apache.org/jira/browse/FLINK-13852
> Project: Flink
> Issue Type: New Feature
> Components: Connectors / FileSystem
> Reporter: Gyula Fora
> Priority: Not a Priority
> Labels: auto-deprioritized-major, auto-deprioritized-minor
>
> Currently in-progress and pending files are stored in the same directory as
> the final output file. This can be problematic depending on the usage of the
> final output files. One example would be loading the data to hive where we
> can only load all files in a certain directory.
> I suggest we allow specifying a Pending/Inprogress base path where we create
> the same bucketing structure as the final files to store only the non-final
> files.
> To support this we need to extend the RecoverableWriter interface with a new
> open method for example:
> RecoverableFsDataOutputStream open(Path path, Path tmpPath) throws
> IOException;
--
This message was sent by Atlassian Jira
(v8.20.1#820001)