I am in the process of constructing a hadoop cluster with approximately 400 nodes. We are going to be running HBase for storing structured data and specifically making heavy use of row versioning. HBase data will primarily be consumed by map reduce jobs. Should I install a region server on every data node?
Steve