[ 
https://issues.apache.org/jira/browse/HDFS-1111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12895117#action_12895117
 ] 

Konstantin Shvachko commented on HDFS-1111:
-------------------------------------------

The patch looks good. A couple of nits.

# I don't think we should have new a configuration variable for the number of 
corrupt blocks to return. 
We should just use the constant you introduced. I like the name 
{{MAX_CORRUPT_FILE_BLOCKS_RETURNED}}.
The value is set to 100, which is fine with me. Please speak up if somebody has 
other values in mind.
So variable {{maxListCorruptFilesBlocksReturned}} will not be necessary.
# Consulted with Rob about the name {{listCorruptFilesBlocks()}}. Plural Files 
in the name doesn't sound right, 
as it is not clear whether we return files or blocks. Would be good to change 
it to {{listCorruptFileBlocks()}} throughout the code.
# In {{NamenodeFsck.listCorruptFilesBlocks()}} the printout is not correct. It 
will read one of the following:
{code}
"The filesystem under path '/tmp' has 57 is CORRUPT files"
"The filesystem under path '/tmp' has no is CORRUPT files"
{code}
This should be rephrased. Also the last message can be confusing if you already 
returned some files before. 
We should probably distinguish and say "has no CORRUPT files" if 
startBlockAfter == null, and 
 "has no more CORRUPT files" otherwise.

> getCorruptFiles() should give some hint that the list is not complete
> ---------------------------------------------------------------------
>
>                 Key: HDFS-1111
>                 URL: https://issues.apache.org/jira/browse/HDFS-1111
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Rodrigo Schmidt
>            Assignee: Rodrigo Schmidt
>         Attachments: HADFS-1111.0.patch, HDFS-1111-y20.1.patch
>
>
> If the list of corruptfiles returned by the namenode doesn't say anything if 
> the number of corrupted files is larger than the call output limit (which 
> means the list is not complete). There should be a way to hint incompleteness 
> to clients.
> A simple hack would be to add an extra entry to the array returned with the 
> value null. Clients could interpret this as a sign that there are other 
> corrupt files in the system.
> We should also do some rephrasing of the fsck output to make it more 
> confident when the list is not complete and less confident when the list is 
> known to be incomplete.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to