Collector left in a bad state after temprorary NN outage
--------------------------------------------------------
Key: CHUKWA-487
URL: https://issues.apache.org/jira/browse/CHUKWA-487
Project: Hadoop Chukwa
Issue Type: Bug
Components: data collection
Affects Versions: 0.4.0
Reporter: Bill Graham
When the name node returns errors to the collector, at some point the collector
dies half way. This behavior should be changed to either resemble the agents
and keep trying, or to completely shutdown. Instead, what I'm seeing is that
the collector logs that it's shutting down, and the var/pidDir/Collector.pid
file gets removed, but the collector continues to run, albeit not handling new
data. Instead, this log entry is repeated ad infinitum:
2010-05-06 17:35:06,375 INFO Timer-1 root -
stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
2010-05-06 17:36:06,379 INFO Timer-1 root -
stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
2010-05-06 17:37:06,384 INFO Timer-1 root -
stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.