HA master on EMR

Austin Heyne Thu, 30 Aug 2018 08:30:40 -0700

HBase on EMR is fairly reliable but is still subject to hardwarefailures (which has happened to me before). Is there a best practice foradding backup masters to an EMR cluster?

I know this isn't technically a supported feature from AWS but we'realready heavily invested into HBase on EMR and would like to investigateoptions on mitigating the risk of a master failure. In EMR if the masterdies the entire cluster is terminated so we need fail over for HBase,Hadoop/HDFS and Zookeeper. The one idea that I've had is to create asecond (or third) EMR cluster with its HBase, Zookeeper and Hadoop/HDFSconfiguration pointed to the primary cluster. This would in effect addthe RegionServers and Datanodes to the primary cluster. I know thatloosing 1/3 to 1/2 of your Datanodes would most likely mean you wouldloose some WALs but re-ingesting the last days worth of data isacceptable trade off for us in exchange for not having downtime.

I realize this is a slightly crazy idea and using something likeKubernetes is the 'correct' solution but I have to work with what wehave and mitigate possible issues. My question is are there any bigissues that anyone would foresee us having with this idea?


Thanks for the feedback,
Austin

HA master on EMR

Reply via email to