[ 
https://issues.apache.org/jira/browse/TRAFODION-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Wayne Birdsall updated TRAFODION-2142:
--------------------------------------------
    Description: In development environments, developers often use local_hadoop 
for unit and developer regression testing. Often these test environments are on 
workstations shared between many developers. When running regressions 
overnight, quite frequently the HMaster process will die due to timeouts if the 
workstation is particularly busy. This sometimes causes HBase errors during the 
tests but more often causes hangs. It would be nice to have a tool that will 
monitor HMaster and if it goes away, try to restart it. It has been observed 
that restarting it often resolves the hangs, allowing the regression run to 
continue.  (was: In development environments, developers often use local_hadoop 
for unit and developer regression testing. Often these test environments are on 
workstations shared between many developers. When running regressions 
overnight, quite frequently the HMaster process will die due to timeouts if the 
workstation is particularly busy. This sometimes causes HBase errors during the 
tests but more often causes hangs. It would be nice to have a tool that will 
monitor HMaster and if it goes away, try to restart it.)

> Test script to restart HBase automatically in local_hadoop test settings
> ------------------------------------------------------------------------
>
>                 Key: TRAFODION-2142
>                 URL: https://issues.apache.org/jira/browse/TRAFODION-2142
>             Project: Apache Trafodion
>          Issue Type: Improvement
>          Components: foundation
>    Affects Versions: any
>            Reporter: David Wayne Birdsall
>            Assignee: David Wayne Birdsall
>            Priority: Minor
>             Fix For: 2.1-incubating
>
>
> In development environments, developers often use local_hadoop for unit and 
> developer regression testing. Often these test environments are on 
> workstations shared between many developers. When running regressions 
> overnight, quite frequently the HMaster process will die due to timeouts if 
> the workstation is particularly busy. This sometimes causes HBase errors 
> during the tests but more often causes hangs. It would be nice to have a tool 
> that will monitor HMaster and if it goes away, try to restart it. It has been 
> observed that restarting it often resolves the hangs, allowing the regression 
> run to continue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to