Hi all, I am in a hurry to finish a report about whether or not we should host our data in HBase. After many readings and diggings, there still are some questions I cannot find answers. Sorry for brining them up again if you have seen them before. :) If you could answer any of these following questions, I would greatly grateful for that.
1. For cell size, why it should not be larger than 20m in general? 2. What is the block size if the cell is 20m? Can a cell covers multiple blocks? 3. For single cell column family (it has only one cell), does it share the same size limit as cell? In other words, does single column family should be smaller than 20m? 4. Is there any advantage to put rows close in HBase, if these rows have a high chance to be queried together? 5. Any general rule for row size? 6. Where does the HReigion host the row keys in HFile or other files? Many thanks! Your answers would be highly appreciated. William
