[ 
https://issues.apache.org/jira/browse/HBASE-675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12613533#action_12613533
 ] 

Naama Kraus commented on HBASE-675:
-----------------------------------

Returning the region server is one option. Are there other options to consider ?
Would it make sense to return the nodes hosting the actual data files composing 
the table split (HStoreFiles blocks) ? As they are the ones accessed when 
reading the table split content ?

> Report correct server hosting a table split for assignment to for MR Jobs
> -------------------------------------------------------------------------
>
>                 Key: HBASE-675
>                 URL: https://issues.apache.org/jira/browse/HBASE-675
>             Project: Hadoop HBase
>          Issue Type: Improvement
>            Reporter: Billy Pearson
>            Priority: Minor
>             Fix For: 0.3.0
>
>
> Currently we return a null String array to the MR framework to use a random 
> node for MR job assignment.
> class: org.apache.hadoop.hbase.mapred.tableSplit
> function getLocations()
> We should be able to query the meta now for the current host name of the 
> server hosting the region in question.
> This will help with scaling as there will be less cross server communication 
> removing bandwidth as a bottleneck.
> The side effect of fixing this will help from overloading region servers with 
> lots of MR clients all pulling from the same region server while theres work 
> local for them to do.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to