Consider running major compaction which restores data locality. 

Thanks

> On May 26, 2017, at 6:08 AM, Rajeshkumar J <[email protected]> 
> wrote:
> 
> Thanks Ted. If data blocks of the hfile may not be on the same node as the
> region server then how data locality is achieved when mapreduce is run over
> hbase tables
> 
> 
> 
>> On Fri, May 26, 2017 at 6:15 PM, Ted Yu <[email protected]> wrote:
>> 
>> The hfiles of a region are stored on hdfs. By default, hdfs has replication
>> factor of 3.
>> If you're not using read replica feature, any single region is served by
>> one region server (however the data blocks of the hfile may not be on the
>> same node as the region server).
>> 
>> Cheers
>> 
>> On Thu, May 25, 2017 at 11:45 PM, Rajeshkumar J <
>> [email protected]
>>> wrote:
>> 
>>> Hi,
>>> 
>>>   we have region max file size as 10 GB. Whether the hfiles of a region
>>> exists in same region server or will it be distributed?
>>> 
>>> Thanks
>> 

Reply via email to