Hello, Sorry if this topic has already been discussed, but I am new to this mailing list and didn't find a way to check for past messages.
Let me introduce myself. My name is Augusto Souza and I am a MSc student in Distributed Systems in University of Campinas (Brazil). One of the possibilities I have been thinking for developing my research is the problem of MapReduce High Availability. There are some open issues in Jira for this topic for quite a long time: https://issues.apache.org/jira/browse/MAPREDUCE-2288 https://issues.apache.org/jira/browse/MAPREDUCE-225 I also found some blog posts about this topic (eg: http://hortonworks.com/blog/high-availability-and-hadoop-1-0-perfect-together/), but I didn't find one global and official solution from the community, is there one? Is there a way I could contribute with this? Are there some resources you guys recommend me to read about this topic? Thanks in advance. Best regards, Augusto Souza