We had a lot of datanodes lock up nearly simaltanuously in our Accumulo
instance. Many more of the tservers also went offline. After about two
hours we were able to get all the datanodes and tservers back online with
no HDFS blocks lost. However we have two tservers throwing about 70
exceptions caused by:
java.IO.EOFException: ..../accumulo/recovery/.../part-r-00000/index not a
For all the exceptions all the
"..../accumulo/recovery/.../part-r-00000/index" files are empty but their
associated ..../accumulo/recovery/.../part-r-00000/data file is not.
Any suggestions on how we can best recover from these exceptions?