Hi,
I have been running application on my YARN cluster since around 20 days,
about 5000 applications a day. I am getting the following error today.
Please let me know how can I avoid this, is this happening because of a bug?
What is your replication factor on hdfs?
Did any of your datanode go down recently and is not back in rotation?
Did you delete any hdfs blocks directly from datanodes?
On May 6, 2013 2:28 PM, Krishna Kishore Bonagiri write2kish...@gmail.com
wrote:
Hi,
I have been running application on my
Hi Kishore,
It should not be a bug. After restarting HDFS, namenode will enter safe
mode until all needed data is collected. During safe mode, all update
operations will fail.
In some cases, as Nitin mentioned, namenode will never leave safe mode
because it can't get enough data. In that case
Hi Nithin Ted,
Thanks for the replies.
I don't know what my replication factor is, I don't seem to have set
anything in my configuration files. I run on a single node cluster. My data
node has gone down and came back, and also I didn't delete any of the hdfs
blocks.
I know that name node
Hi Nithin Ted,
Thanks for the replies.
I don't know what my replication factor is, I don't seem to have set
anything in my configuration files. I run on a single node cluster. My data
node has gone down and came back, and also I didn't delete any of the hdfs
blocks.
I know that name node
If you are running a single node and you have not changed the hdfs
configuration for repliaction then he default is set to 3 and you will just
have one available.
As long as this is not a production environment you can force namenode to
leave safe mode. Only issue in this will be in case any of