TransferFSImage should timeout
------------------------------
Key: HDFS-1490
URL: https://issues.apache.org/jira/browse/HDFS-1490
Project: Hadoop HDFS
Issue Type: Bug
Components: name-node
Reporter: Dmytro Molkov
Assignee: Dmytro Molkov
Priority: Minor
Sometimes when primary crashes during image transfer secondary namenode would
hang trying to read the image from HTTP connection forever.
It would be great to set timeouts on the connection so if something like that
happens there is no need to restart the secondary itself.
In our case restarting components is handled by the set of scripts and since
the Secondary as the process is running it would just stay hung until we get an
alarm saying the checkpointing doesn't happen.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.