[
https://issues.apache.org/jira/browse/HBASE-675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12613533#action_12613533
]
Naama Kraus commented on HBASE-675:
-----------------------------------
Returning the region server is one option. Are there other options to consider ?
Would it make sense to return the nodes hosting the actual data files composing
the table split (HStoreFiles blocks) ? As they are the ones accessed when
reading the table split content ?
> Report correct server hosting a table split for assignment to for MR Jobs
> -------------------------------------------------------------------------
>
> Key: HBASE-675
> URL: https://issues.apache.org/jira/browse/HBASE-675
> Project: Hadoop HBase
> Issue Type: Improvement
> Reporter: Billy Pearson
> Priority: Minor
> Fix For: 0.3.0
>
>
> Currently we return a null String array to the MR framework to use a random
> node for MR job assignment.
> class: org.apache.hadoop.hbase.mapred.tableSplit
> function getLocations()
> We should be able to query the meta now for the current host name of the
> server hosting the region in question.
> This will help with scaling as there will be less cross server communication
> removing bandwidth as a bottleneck.
> The side effect of fixing this will help from overloading region servers with
> lots of MR clients all pulling from the same region server while theres work
> local for them to do.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.