Carol Pearson created TRAFODION-2245:
----------------------------------------
Summary: Multiple sqcheck and jps processes running when monitor
is downed and up as dcsserver checks if trafodion is up
Key: TRAFODION-2245
URL: https://issues.apache.org/jira/browse/TRAFODION-2245
Project: Apache Trafodion
Issue Type: Bug
Components: dcs
Affects Versions: 2.1-incubating
Environment: Testing trafodion when failures occurred. HDP 2.4 distro
contents and a standard installation on CentOS 6
Reporter: Carol Pearson
Dcsserver checks if Trafodion is running by using sqcheck. That can hang in
some circumstances
In this case we had a DTM failure and recovery took a while. The node went to a
SoftDown state as the DTM recovered. Meanwhile, dcsserver was looking for
trafodion to come up so that it could start the mxosrvrs on that node. That
resulted in many hung sqchecks - the notable symptom is that they all had the
same ppid.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)