[
https://issues.apache.org/jira/browse/NIFI-11945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17760636#comment-17760636
]
ASF subversion and git services commented on NIFI-11945:
--------------------------------------------------------
Commit a9ac8fb7073ef89e1f1105775de2fdb56b8da0b3 in nifi's branch
refs/heads/main from p-kimberley
[ https://gitbox.apache.org/repos/asf?p=nifi.git;h=a9ac8fb707 ]
NIFI-11945 Add Cache Entry Identifier property to DeduplicateRecord
This closes #7603
Signed-off-by: David Handermann <[email protected]>
> DeduplicateRecord does not add keys to distributed map cache
> ------------------------------------------------------------
>
> Key: NIFI-11945
> URL: https://issues.apache.org/jira/browse/NIFI-11945
> Project: Apache NiFi
> Issue Type: Bug
> Components: Core Framework
> Affects Versions: 1.23.0
> Environment: Docker
> Reporter: Peter Kimberley
> Priority: Major
> Time Spent: 2h 20m
> Remaining Estimate: 0h
>
> The `DeduplicateRecord` processor supports the use of a distributed map cache
> (DMC).
> After generating the record key, it checks for the existence of that key in
> the cache. It then calls `DistributedMapCacheClientWrapper::put()`, which in
> this case, is a noop. Therefore, a cache entry is never written and records
> are always routed to the `non-duplicate` relationship.
> The correct behaviour would be for
> `DistributedMapCacheClientWrapper:contains()` to call
> `DistributedMapCacheClient::putIfAbsent()`, which would atomically check/set
> the key in the target cache.
> An additional problem is a NPE where a DMC is used and the
> `DeduplicateRecord` property `Record Hashing Algorithm` is set to `NONE`.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)