[
https://issues.apache.org/jira/browse/HDFS-12809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16267600#comment-16267600
]
Íñigo Goiri commented on HDFS-12809:
------------------------------------
I think is fine not checking for a random distribution (overdoing it).
Do you mind clearing the checkstyle?
> [READ] Fix the randomized selection of locations in {{ProvidedBlocksBuilder}}.
> ------------------------------------------------------------------------------
>
> Key: HDFS-12809
> URL: https://issues.apache.org/jira/browse/HDFS-12809
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Reporter: Virajith Jalaparti
> Assignee: Virajith Jalaparti
> Attachments: HDFS-12809-HDFS-9806.001.patch,
> HDFS-12809-HDFS-9806.002.patch
>
>
> Calling {{getBlockLocations}} on files that have a PROVIDED replica, results
> in the datanode locations being selected at random. Currently, this
> randomization uses the datanode uuids to pick a node at random
> ({{ProvidedDescriptor#choose}}, {{ProvidedDescriptor#chooseRandom}}).
> Depending on the distribution of the datanode UUIDs, this can lead to large
> number of iterations (which may not terminate) before a location is chosen.
> This JIRA aims to replace this with a more efficient randomization strategy.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]