Hi, we are running HBase on super-low-cost HW :) Sometimes random node goes down. And HBase needs a time to move regions from failed RS.
What are the practices to: 1. minimize MTTR? 2. is there any possibility to gracefully handle situation when region is not accessible for r/w? I can just drop the data, but I can't wait seconds or minutes for response. I would like to break request after 100ms and return empty result.