> > 4. Is there any advantage to put rows close in HBase, if these rows > have a high chance to be queried together? > > Yes.. rows are stored contiguously, sorted by the RowID+ColFam+ColQual+Timestamp. So, your reads are faster if you access contiguous rows (and avoid disk seeks).. You can scan a set of rows and retrieve them..
> 5. Any general rule for row size? > If a row is bigger than the max region size you have given, the region wont split. In other words, rows don't span regions. > > 6. Where does the HReigion host the row keys in HFile or other files? > > It is in HFile..
