[jira] [Commented] (HDFS-16127) Improper pipeline close recovery causes a permanent write failure or data loss.

Daryn Sharp (Jira) Fri, 16 Jul 2021 12:05:07 -0700


    [ 
https://issues.apache.org/jira/browse/HDFS-16127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17382288#comment-17382288
 ]


Daryn Sharp commented on HDFS-16127:
------------------------------------

+1 this has become a source of very nasty problems.

> Improper pipeline close recovery causes a permanent write failure or data 
> loss.
> -------------------------------------------------------------------------------
>
>                 Key: HDFS-16127
>                 URL: https://issues.apache.org/jira/browse/HDFS-16127
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Kihwal Lee
>            Assignee: Kihwal Lee
>            Priority: Major
>         Attachments: HDFS-16127.patch
>
>
> When a block is being closed, the data streamer in the client waits for the 
> final ACK to be delivered. If an exception is received during this wait, the 
> close is retried. This assumption has become invalid by HDFS-15813, resulting 
> in permanent write failures in some close error cases involving slow nodes. 
> There are also less frequent cases of data loss.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HDFS-16127) Improper pipeline close recovery causes a permanent write failure or data loss.

Reply via email to