RAID blockfixer can use a heuristic to find unfixable files
------------------------------------------------------------
Key: MAPREDUCE-2352
URL: https://issues.apache.org/jira/browse/MAPREDUCE-2352
Project: Hadoop Map/Reduce
Issue Type: Improvement
Components: contrib/raid
Reporter: Ramkumar Vadali
Assignee: Ramkumar Vadali
Priority: Minor
It is possible to have corrupt files that were never RAIDed. In such a case,
there is no use in trying to submit a block fixer job for that file. The RAID
code has the function filterUnfixableSourceFiles() that checks for the presence
of parity files for each source file. This is too expensive, since a lot of the
parity files can be HARed. Instead, we can use a heuristic where we just check
for the presence of the parent directory in the parity space. If the parent
directory is absent, the parity file cannot be present, and the source file
would be unfixable.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira