[ 
https://issues.apache.org/jira/browse/GOBBLIN-2152?focusedWorklogId=934316&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-934316
 ]

ASF GitHub Bot logged work on GOBBLIN-2152:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 11/Sep/24 17:52
            Start Date: 11/Sep/24 17:52
    Worklog Time Spent: 10m 
      Work Description: Will-Lo merged PR #4047:
URL: https://github.com/apache/gobblin/pull/4047




Issue Time Tracking
-------------------

    Worklog Id:     (was: 934316)
    Time Spent: 1h 20m  (was: 1h 10m)

> Gobblin Temporal Jobs should be properly cleaning up writer staging and 
> output dirs
> -----------------------------------------------------------------------------------
>
>                 Key: GOBBLIN-2152
>                 URL: https://issues.apache.org/jira/browse/GOBBLIN-2152
>             Project: Apache Gobblin
>          Issue Type: Bug
>            Reporter: William Lo
>            Priority: Major
>          Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Gobblin HDFS writers write to a staging directory, which renames to an output 
> directory, before committing data to the final destination. This is so that 
> commits are atomic and not partially done (unless configured).
> Gobblin needs to clean up these directories in order to manage HDFS space 
> properly so that failed jobs do not maintain their files around. It is 
> critical to ensure that deletion is only done to folders that are configured 
> at a job level, so that any folders that may be shared between jobs or 
> concurrently are not deleted mid-execution.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to