Hi,
      I have a 20 node cluster on ec2(small instance).... i have a set of
tables which store huge amount of data (tried wid 10,000 rows... more to be
added).... but during my map reduce jobs, some of the region servers shut
down thereby causing data loss, stop in my program execution and infact one
of my tables got damaged. when ever i scan the table, i get the could not
obtain block error.

1. i want to make the cluster more robust. since it contains a lot of data.
and its really important that they remain stable.
2. if one of my tables gets damaged (even after restarting dfs n hbase), how
do i go about recovering it?

my ec2 cluster mostly has the default configuration.
with hadoop-site n hbase-site have some entries pertaining to map-reduce
(for example. num of map tasks, mapred.task.timeout etc).

Your help will be greatly appreciated.
Thanks,
Raakhi Khatwani

Reply via email to