[ 
https://issues.apache.org/jira/browse/YARN-5686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15610978#comment-15610978
 ] 

Varun Saxena commented on YARN-5686:
------------------------------------

+1. Will commit this pending Jenkins.

> DefaultContainerExecutor random working dir algorigthm skews results
> --------------------------------------------------------------------
>
>                 Key: YARN-5686
>                 URL: https://issues.apache.org/jira/browse/YARN-5686
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Miklos Szegedi
>            Assignee: Vrushali C
>            Priority: Minor
>         Attachments: YARN-5686.001.patch, YARN-5686.002.patch
>
>
> {code}
> long randomPosition = RandomUtils.nextLong() % totalAvailable;
>     ...
>     while (randomPosition > availableOnDisk[dir]) {
>       randomPosition -= availableOnDisk[dir++];
>     }
> {code}
> The code above selects a disk based on the random number weighted by the free 
> space on each disk respectively. For example, if I have two disks with 100 
> bytes each, totalAvailable is 200. The value of randomPosition will be 
> 0..199. 0..99 should select the first disk, 100..199 should select the second 
> disk inclusively. Random number 100 should select the second disk to be fair 
> but this is not the case right now.
> We need to use 
> {code}
> while (randomPosition >= availableOnDisk[dir])
> {code}
> instead of
> {code}
> while (randomPosition > availableOnDisk[dir])
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to