[ 
https://issues.apache.org/jira/browse/HBASE-55?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12568377#action_12568377
 ] 

Bryan Duxbury commented on HBASE-55:
------------------------------------

I think what we actually need to do is better define what "server load" is. 
After all, we're seeking to make the set of region assignments that will cause 
all region servers to have the same average load. 

So to reformulate the name of this issue a little, we need a better way to 
calculate total server load. I'm thinking this should be a function of the 
total size of all regions. The rationale behind this is that the bigger the 
region (ie, the underlying map files), the more time it will take to do gets, 
puts, compacts, etc. In the long run, machines with bigger regions will be more 
utilized than systems with smaller regions. 

So, to balance region assignment, we should sum up all the sizes of all the 
regions currently assigned per server, calculate an average, and then reduce 
the load of overloaded servers by deallocating regions from them, and increase 
the load of underloaded servers by adding newly unassigned regions. 

To incorporate the daughter regions on different machines concept, we can just 
add an additional check to skip the same server as was just assigned to.

> [hbase] Improve Master region assignment function
> -------------------------------------------------
>
>                 Key: HBASE-55
>                 URL: https://issues.apache.org/jira/browse/HBASE-55
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: regionserver
>            Reporter: Bryan Duxbury
>             Fix For: 0.2.0
>
>
> We would like the master's region assignment function to take into account 
> more factors when choosing where to assign regions.
>  
> - More advanced accounting of load on regionserver - memory, # requests, etc
> - Don't deploy both daughter regions to the same regionserver
> - Assign regions where the underlying DFS blocks are hosted if possible
> Please add additional ideas in comments as they come up.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to