[ https://issues.apache.org/jira/browse/HDFS-898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12829897#action_12829897 ]
Todd Lipcon commented on HDFS-898: ---------------------------------- You're completely right, now that I think more about it. However, it's a good sanity check since we know the probability of duplicates to be relatively small. Using the correct numbers, the birthday problem approximation is 0.0307 which lines up with yours very closely. Thanks, and apologies for the junk on the jira. > Sequential generation of block ids > ---------------------------------- > > Key: HDFS-898 > URL: https://issues.apache.org/jira/browse/HDFS-898 > Project: Hadoop HDFS > Issue Type: Improvement > Components: name-node > Affects Versions: 0.20.1 > Reporter: Konstantin Shvachko > Assignee: Konstantin Shvachko > Fix For: 0.22.0 > > Attachments: DuplicateBlockIds.patch, HighBitProjection.pdf > > > This is a proposal to replace random generation of block ids with a > sequential generator in order to avoid block id reuse in the future. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.