[
https://issues.apache.org/jira/browse/BEAM-11833?focusedWorklogId=557796&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-557796
]
ASF GitHub Bot logged work on BEAM-11833:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 25/Feb/21 08:32
Start Date: 25/Feb/21 08:32
Worklog Time Spent: 10m
Work Description: je-ik commented on a change in pull request #14013:
URL: https://github.com/apache/beam/pull/14013#discussion_r582635836
##########
File path: sdks/java/core/src/main/java/org/apache/beam/sdk/io/Read.java
##########
@@ -789,6 +799,7 @@ public CheckpointMark getCheckpointMark() {
private boolean readerHasBeenStarted;
private Cache<Object, UnboundedReader<OutputT>> cachedReaders;
private Coder<UnboundedSourceRestriction<OutputT, CheckpointT>>
restrictionCoder;
+ private Instant lastPreSplitWatermark;
Review comment:
Yes, you are right that it under current implementation probably plays
no role. On the other hand, I'd like to avoid updating the watermark estimator
to `BoundedWindow.TIMESTAMP_MAX_VALUE` (if that is not what actually came from
the reader). Therefore, I'd propose to add method `isSplit` to
`UnboundedSourceRestiction` and do not update the watermark estimator if that
is the case (which then should result in returning
`ProcessContinuation.done()`, which should be OK).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 557796)
Time Spent: 4h 20m (was: 4h 10m)
> UnboundedSourceAsSDFRestrictionTracker reports incorrect watermark after
> failed claim
> -------------------------------------------------------------------------------------
>
> Key: BEAM-11833
> URL: https://issues.apache.org/jira/browse/BEAM-11833
> Project: Beam
> Issue Type: Bug
> Components: sdk-java-core
> Affects Versions: 2.28.0
> Reporter: Jan Lukavský
> Assignee: Jan Lukavský
> Priority: P1
> Time Spent: 4h 20m
> Remaining Estimate: 0h
>
> After being split by call to {{trySplit}}, the watermark reported by
> {{UnboundedSourceAsSDFRestrictionTracker.currentRestriction().getWatermark()}}
> is BoundedWindow.TIMESTAMP_MAX_VALUE, which is incorrect.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)