[
https://issues.apache.org/jira/browse/TRAFODION-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
David Wayne Birdsall updated TRAFODION-2142:
--------------------------------------------
Description: In development environments, developers often use local_hadoop
for unit and developer regression testing. Often these test environments are on
workstations shared between many developers. When running regressions
overnight, quite frequently the HMaster process will die due to timeouts if the
workstation is particularly busy. This sometimes causes HBase errors during the
tests but more often causes hangs. It would be nice to have a tool that will
monitor HMaster and if it goes away, try to restart it. It has been observed
that restarting it often resolves the hangs, allowing the regression run to
continue. (was: In development environments, developers often use local_hadoop
for unit and developer regression testing. Often these test environments are on
workstations shared between many developers. When running regressions
overnight, quite frequently the HMaster process will die due to timeouts if the
workstation is particularly busy. This sometimes causes HBase errors during the
tests but more often causes hangs. It would be nice to have a tool that will
monitor HMaster and if it goes away, try to restart it.)
> Test script to restart HBase automatically in local_hadoop test settings
> ------------------------------------------------------------------------
>
> Key: TRAFODION-2142
> URL: https://issues.apache.org/jira/browse/TRAFODION-2142
> Project: Apache Trafodion
> Issue Type: Improvement
> Components: foundation
> Affects Versions: any
> Reporter: David Wayne Birdsall
> Assignee: David Wayne Birdsall
> Priority: Minor
> Fix For: 2.1-incubating
>
>
> In development environments, developers often use local_hadoop for unit and
> developer regression testing. Often these test environments are on
> workstations shared between many developers. When running regressions
> overnight, quite frequently the HMaster process will die due to timeouts if
> the workstation is particularly busy. This sometimes causes HBase errors
> during the tests but more often causes hangs. It would be nice to have a tool
> that will monitor HMaster and if it goes away, try to restart it. It has been
> observed that restarting it often resolves the hangs, allowing the regression
> run to continue.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)