[jira] [Updated] (TRAFODION-3318) Change process management of DTM to improve HA behavior

2019-07-24 Thread Gonzalo E Correa (JIRA)


 [ 
https://issues.apache.org/jira/browse/TRAFODION-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gonzalo E Correa updated TRAFODION-3318:

Affects Version/s: 2.4
Fix Version/s: 2.4

> Change process management of DTM to improve HA behavior
> ---
>
> Key: TRAFODION-3318
> URL: https://issues.apache.org/jira/browse/TRAFODION-3318
> Project: Apache Trafodion
>  Issue Type: Improvement
>  Components: dtm, foundation
>Affects Versions: 2.4
>Reporter: Gonzalo E Correa
>Priority: Major
> Fix For: 2.4
>
>   Original Estimate: 120h
>  Remaining Estimate: 120h
>
> Current process management model for process type DTM enforces and soft node 
> down behavior which kills all processes in a node where a DTM process 
> terminates abnormally. The DTM process is recreated by the monitor along with 
> all persistent processes hosted in that node.
> To reduce the fault zone impact, this change removes the soft node down/up 
> functionality so that the DTM process is recreated without killing all other 
> processes in the node. The rule where the persistent DTM process cannot be 
> restarted within the configured retries in the specified time window will 
> cause a node down will still be enforced.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Updated] (TRAFODION-3318) Change process management of DTM to improve HA behavior

2019-07-24 Thread Gonzalo E Correa (JIRA)


 [ 
https://issues.apache.org/jira/browse/TRAFODION-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gonzalo E Correa updated TRAFODION-3318:

Summary: Change process management of DTM to improve HA behavior  (was: 
Change process management of DTM improve HA behavior)

> Change process management of DTM to improve HA behavior
> ---
>
> Key: TRAFODION-3318
> URL: https://issues.apache.org/jira/browse/TRAFODION-3318
> Project: Apache Trafodion
>  Issue Type: Improvement
>  Components: dtm, foundation
>Reporter: Gonzalo E Correa
>Priority: Major
>   Original Estimate: 120h
>  Remaining Estimate: 120h
>
> Current process management model for process type DTM enforces and soft node 
> down behavior which kills all processes in a node where a DTM process 
> terminates abnormally. The DTM process is recreated by the monitor along with 
> all persistent processes hosted in that node.
> To reduce the fault zone impact, this change removes the soft node down/up 
> functionality so that the DTM process is recreated without killing all other 
> processes in the node. The rule where the persistent DTM process cannot be 
> restarted within the configured retries in the specified time window will 
> cause a node down will still be enforced.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)