[
https://issues.apache.org/jira/browse/HDFS-17812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18040095#comment-18040095
]
ASF GitHub Bot commented on HDFS-17812:
---------------------------------------
github-actions[bot] closed pull request #7826: HDFS-17812. Reset curEntry when
BlockIteratorImpl checks a new subdir
URL: https://github.com/apache/hadoop/pull/7826
> BlockIteratorImpl can miss blocks under certain conditions
> ----------------------------------------------------------
>
> Key: HDFS-17812
> URL: https://issues.apache.org/jira/browse/HDFS-17812
> Project: Hadoop HDFS
> Issue Type: Bug
> Reporter: Felix N
> Assignee: Felix N
> Priority: Minor
> Labels: pull-request-available
>
> # BlockIteratorImpl#getSubdirEntries sorts by strings, meaning subdir20 comes
> before subdir4, which can result in a bigger block id being listed before a
> smaller block id
> # If block cleanup happens before state.curEntry is reset to null, stale
> data can carry over into the next loop
> # Search for a bigger ID than the range in nextSorted(entries,
> state.curEntry) -> null -> subdir is skipped.
> Usually happens during DiskBalancer operations.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]