Hi,
we have a number of Reducer task each writing a bunch of rows into the latest HBase via Puts. What is working is that each Reducer only creates Puts for one single Region by using HRegionPartionioner. However, we are seeing that the Region flush itself is not local, but going to some other node in the cluster. This puts load on the network. We'd like to see that instead the Reducer would be run on the same node where the region is served. Is that possible? Any ideas or suggestions? Sven
