[
https://issues.apache.org/jira/browse/BEAM-9827?focusedWorklogId=427637&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-427637
]
ASF GitHub Bot logged work on BEAM-9827:
----------------------------------------
Author: ASF GitHub Bot
Created on: 27/Apr/20 13:28
Start Date: 27/Apr/20 13:28
Worklog Time Spent: 10m
Work Description: dmvk commented on a change in pull request #11533:
URL: https://github.com/apache/beam/pull/11533#discussion_r415812145
##########
File path:
runners/flink/src/main/java/org/apache/beam/runners/flink/translation/wrappers/streaming/state/FlinkStateInternals.java
##########
@@ -76,8 +76,8 @@
private final KeyedStateBackend<ByteBuffer> flinkStateBackend;
private Coder<K> keyCoder;
- // Combined watermark holds for all keys of this partition
- private final Map<String, Instant> watermarkHolds = new HashMap<>();
+ // Watermark holds for all keys/windows of this partition
+ private final PriorityQueue<Long> watermarkHolds = new PriorityQueue<>();
Review comment:
👍 that makes sense
It would be nice if we could get rid of `pq.remove(...)` calls as these are
O(n) and number of keys may be fairly large. How about using `TreeMap<Long,
Integer>` instead, where value would be number of references to that particular
offset?
This should have O(log n) characteristic and may hopefully deduplicate some
of the entries.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 427637)
Time Spent: 1.5h (was: 1h 20m)
> Test SplittableDoFnTest#testPairWithIndexBasicBounded is flaky
> --------------------------------------------------------------
>
> Key: BEAM-9827
> URL: https://issues.apache.org/jira/browse/BEAM-9827
> Project: Beam
> Issue Type: Test
> Components: runner-flink
> Reporter: Maximilian Michels
> Assignee: Maximilian Michels
> Priority: Major
> Fix For: 2.21.0
>
> Time Spent: 1.5h
> Remaining Estimate: 0h
>
> Both {{testPairWithIndexBasicUnbounded}} and
> {{testPairWithIndexBasicBounded}} from {{SplittableDoFnTest}} are flaky every
> other run. We need to investigate the cause for this.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)