[ 
https://issues.apache.org/jira/browse/FLINK-1572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348807#comment-14348807
 ] 

Robert Metzger commented on FLINK-1572:
---------------------------------------

How about removing the files of a failed job by default (seems to be Hadoop's 
behavior as well: 
http://hadoop.apache.org/docs/r1.2.1/mapred_tutorial.html#OutputCommitter) and 
adding a configuration option to disable behavior?

> Output directories are created before input paths are checked
> -------------------------------------------------------------
>
>                 Key: FLINK-1572
>                 URL: https://issues.apache.org/jira/browse/FLINK-1572
>             Project: Flink
>          Issue Type: Improvement
>          Components: JobManager
>    Affects Versions: 0.9
>            Reporter: Robert Metzger
>            Priority: Minor
>
> Flink is first creating the output directories for a job before creating the 
> input splits.
> If a job's input directories are wrong, the system will have created output 
> directories for a failed job.
> It would be much better if the system is creating the output directories "on 
> demand" before data is actually written.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to