[jira] [Comment Edited] (HDFS-5728) [Diskfull] Block recovery will fail if the metafile not having crc for all chunks of the block

Uma Maheswara Rao G (JIRA) Tue, 14 Jan 2014 19:29:54 -0800

    [ 
https://issues.apache.org/jira/browse/HDFS-5728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13871170#comment-13871170
 ]


Uma Maheswara Rao G edited comment on HDFS-5728 at 1/15/14 3:27 AM:
--------------------------------------------------------------------

{quote}
That will be a implicit truncation without recovery being called.
{quote}
Logically we already truncated in memory by having the integrity check. There 
is no use of considering data more than crc bytes covered.  And this truncation 
will not make recovery of block.  This is just making crc and blockFile having 
same length (as data integrity expects). Recovery will make actual block file 
truncation upto where new length proposed for block recovery.


was (Author: umamaheswararao):
{code}
That will be a implicit truncation without recovery being called.
{code}
Logically we already truncated in memory by having the integrity check. There 
is no use of considering data more than crc bytes covered.  And this truncation 
will not make recovery of block.  This is just making crc and blockFile having 
same length (as data integrity expects). Recovery will make actual block file 
truncation upto where new length proposed for block recovery.

> [Diskfull] Block recovery will fail if the metafile not having crc for all 
> chunks of the block
> ----------------------------------------------------------------------------------------------
>
>                 Key: HDFS-5728
>                 URL: https://issues.apache.org/jira/browse/HDFS-5728
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: datanode
>    Affects Versions: 2.2.0
>            Reporter: Vinay
>            Assignee: Vinay
>         Attachments: HDFS-5728.patch
>
>
> 1. Client (regionsever) has opened stream to write its WAL to HDFS. This is 
> not one time upload, data will be written slowly.
> 2. One of the DataNode got diskfull ( due to some other data filled up disks)
> 3. Unfortunately block was being written to only this datanode in cluster, so 
> client write has also failed.
> 4. After some time disk is made free and all processes are restarted.
> 5. Now HMaster try to recover the file by calling recoverLease. 
> At this time recovery was failing saying file length mismatch.
> When checked,
>  actual block file length: 62484480
>  Calculated block length: 62455808
> This was because, metafile was having crc for only 62455808 bytes, and it 
> considered 62455808 as the block size.
> No matter how many times, recovery was continously failing.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Comment Edited] (HDFS-5728) [Diskfull] Block recovery will fail if the metafile not having crc for all chunks of the block

Reply via email to