[jira] [Updated] (TRAFODION-3318) Change process management of DTM to improve HA behavior

2019-07-24 Thread Gonzalo E Correa (JIRA)


 [ 
https://issues.apache.org/jira/browse/TRAFODION-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gonzalo E Correa updated TRAFODION-3318:

Affects Version/s: 2.4
Fix Version/s: 2.4

> Change process management of DTM to improve HA behavior
> ---
>
> Key: TRAFODION-3318
> URL: https://issues.apache.org/jira/browse/TRAFODION-3318
> Project: Apache Trafodion
>  Issue Type: Improvement
>  Components: dtm, foundation
>Affects Versions: 2.4
>Reporter: Gonzalo E Correa
>Priority: Major
> Fix For: 2.4
>
>   Original Estimate: 120h
>  Remaining Estimate: 120h
>
> Current process management model for process type DTM enforces and soft node 
> down behavior which kills all processes in a node where a DTM process 
> terminates abnormally. The DTM process is recreated by the monitor along with 
> all persistent processes hosted in that node.
> To reduce the fault zone impact, this change removes the soft node down/up 
> functionality so that the DTM process is recreated without killing all other 
> processes in the node. The rule where the persistent DTM process cannot be 
> restarted within the configured retries in the specified time window will 
> cause a node down will still be enforced.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Updated] (TRAFODION-3318) Change process management of DTM to improve HA behavior

2019-07-24 Thread Gonzalo E Correa (JIRA)


 [ 
https://issues.apache.org/jira/browse/TRAFODION-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gonzalo E Correa updated TRAFODION-3318:

Summary: Change process management of DTM to improve HA behavior  (was: 
Change process management of DTM improve HA behavior)

> Change process management of DTM to improve HA behavior
> ---
>
> Key: TRAFODION-3318
> URL: https://issues.apache.org/jira/browse/TRAFODION-3318
> Project: Apache Trafodion
>  Issue Type: Improvement
>  Components: dtm, foundation
>Reporter: Gonzalo E Correa
>Priority: Major
>   Original Estimate: 120h
>  Remaining Estimate: 120h
>
> Current process management model for process type DTM enforces and soft node 
> down behavior which kills all processes in a node where a DTM process 
> terminates abnormally. The DTM process is recreated by the monitor along with 
> all persistent processes hosted in that node.
> To reduce the fault zone impact, this change removes the soft node down/up 
> functionality so that the DTM process is recreated without killing all other 
> processes in the node. The rule where the persistent DTM process cannot be 
> restarted within the configured retries in the specified time window will 
> cause a node down will still be enforced.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (TRAFODION-3318) Change process management of DTM improve HA behavior

2019-07-24 Thread Gonzalo E Correa (JIRA)
Gonzalo E Correa created TRAFODION-3318:
---

 Summary: Change process management of DTM improve HA behavior
 Key: TRAFODION-3318
 URL: https://issues.apache.org/jira/browse/TRAFODION-3318
 Project: Apache Trafodion
  Issue Type: Improvement
  Components: dtm, foundation
Reporter: Gonzalo E Correa


Current process management model for process type DTM enforces and soft node 
down behavior which kills all processes in a node where a DTM process 
terminates abnormally. The DTM process is recreated by the monitor along with 
all persistent processes hosted in that node.

To reduce the fault zone impact, this change removes the soft node down/up 
functionality so that the DTM process is recreated without killing all other 
processes in the node. The rule where the persistent DTM process cannot be 
restarted within the configured retries in the specified time window will cause 
a node down will still be enforced.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)