[ 
https://issues.apache.org/jira/browse/FLINK-18962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Piotr Nowojski closed FLINK-18962.
----------------------------------
    Fix Version/s: 1.12.0
       Resolution: Fixed

Merged to master as f8ce30a50b^^..f8ce30a50b.

Thanks for submitting the idea [~NicoK] and addressing the issue 
[~roman_khachatryan] :)

> Improve error message if checkpoint directory is not writable
> -------------------------------------------------------------
>
>                 Key: FLINK-18962
>                 URL: https://issues.apache.org/jira/browse/FLINK-18962
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Checkpointing
>    Affects Versions: 1.11.1
>            Reporter: Nico Kruber
>            Assignee: Roman Khachatryan
>            Priority: Major
>              Labels: pull-request-available, usability
>             Fix For: 1.12.0
>
>
> If the checkpoint directory from {{state.checkpoints.dir}} is not writable by 
> the user that Flink is running with, checkpoints will be declined, but the 
> real cause is not mentioned anywhere:
> * the Web UI says: "Cause: The job has failed" (the Flink job is running 
> though)
> * the JM log says:
> {code}
> 2020-08-14 12:13:18,820 INFO  
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator    [] - Triggering 
> checkpoint 2 (type=CHECKPOINT) @ 1597399998819 for job 
> 2c567b14e8d0833404931ef47dfec266.
> 2020-08-14 12:13:18,921 INFO  
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator    [] - Decline 
> checkpoint 2 by task 0d4fd75374ad16c8d963679e3c2171ec of job 
> 2c567b14e8d0833404931ef47dfec266 at a184deea621e3923fbfcb1d899348448 @ 
> Nico-PC.lan (dataPort=35531).
> {code}
> * the TM log says:
> {code}
> 2020-08-14 12:13:14,102 INFO  
> org.apache.flink.streaming.runtime.tasks.SubtaskCheckpointCoordinatorImpl [] 
> - Checkpoint 1 has been notified as aborted, would not trigger any checkpoint.
> {code}
> And that's it. It should have a real error message indicating that the 
> checkpoint (sub)-directory could not be created.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to