[jira] [Commented] (CAMEL-17157) AggregateProcessor, TimeoutMap Restoration and Cluster

Claus Ibsen (Jira) Tue, 02 Nov 2021 12:13:09 -0700


    [ 
https://issues.apache.org/jira/browse/CAMEL-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17437546#comment-17437546
 ]


Claus Ibsen commented on CAMEL-17157:
-------------------------------------

Its not cluster safe, so its not a bug

> AggregateProcessor, TimeoutMap Restoration and Cluster
> ------------------------------------------------------
>
>                 Key: CAMEL-17157
>                 URL: https://issues.apache.org/jira/browse/CAMEL-17157
>             Project: Camel
>          Issue Type: Improvement
>          Components: camel-core
>    Affects Versions: 3.11.3, 3.12.0
>            Reporter: Benjamin BONNET
>            Priority: Major
>
> Hi,
> Consider an aggregate having completion timeout and backed by a persistent 
> repository (e.g. JBCAggregationRepository). When route starts, there is an 
> invocation to     restoreTimeoutMapFromAggregationRepositonry()  
> (AggregatorProcessor, line 877). That method consists in :
> # getting all keys of pending aggregations (i.e. aggregation that were not 
> yet completed when route stopped)
> # iterate on each key to get each row and put row timeout into timeoutmap.
> That works fine when there is only one instance, but if you deploy on a 
> cluster, things may go wrong.
> As a matter of fact, if one instance is warming-up while another is modifying 
> repository, warm-up may fail (NullPointerException) : that occurs when a row 
> has been deleted (because aggregation was completed by a running instance) 
> between 1. and 2. 
> One can imagine another less noisy failure : a row is created by a running 
> instance between 1. and 2. . Then warming-up does not complain, but the new 
> row will not be included in timeout map, which may be an issue if the 
> instance that inserted that row into the repo is stopped before completion 
> (timeout will not be detected).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (CAMEL-17157) AggregateProcessor, TimeoutMap Restoration and Cluster

Reply via email to