[ 
https://issues.apache.org/jira/browse/BEAM-9979?focusedWorklogId=473025&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-473025
 ]

ASF GitHub Bot logged work on BEAM-9979:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 20/Aug/20 18:25
            Start Date: 20/Aug/20 18:25
    Worklog Time Spent: 10m 
      Work Description: lukecwik commented on a change in pull request #12639:
URL: https://github.com/apache/beam/pull/12639#discussion_r474185716



##########
File path: 
sdks/java/harness/src/test/java/org/apache/beam/fn/harness/BeamFnDataReadRunnerTest.java
##########
@@ -352,6 +354,35 @@ public void testRegistration() {
       fail("Expected registrar not found.");
     }
 
+    @Test
+    public void testSplittingBeforeStartBundle() throws Exception {
+      List<WindowedValue<String>> outputValues = new ArrayList<>();
+      BeamFnDataReadRunner<String> readRunner =
+          createReadRunner(outputValues::add, PTRANSFORM_ID, 
mockBeamFnDataClient);
+      // The split should happen at 5 since the allowedSplitPoints is empty.
+      assertEquals(

Review comment:
       We always did before as well since `registerInputLocation` doesn't mean 
we have seen any elements yet just that we are eligible to get them. This 
expands that time frame before receiving the first element slightly.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 473025)
    Time Spent: 1h 50m  (was: 1h 40m)

> Fix race condition where the read index maybe reported from the last executed 
> bundle
> ------------------------------------------------------------------------------------
>
>                 Key: BEAM-9979
>                 URL: https://issues.apache.org/jira/browse/BEAM-9979
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-java-harness
>    Affects Versions: 2.21.0, 2.22.0, 2.23.0, 2.24.0
>            Reporter: Luke Cwik
>            Assignee: Luke Cwik
>            Priority: P3
>             Fix For: 2.25.0
>
>          Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> When the BeamFnDataReadRunner is reused there is a short period of time when 
> a progress request could happen before the the start function is called 
> resetting the read index to -1.
> I believe there should be a way to *reset* an operator before it gets added 
> to the set of cached bundle processors separate instead of placing clean-up 
> in any *start* functions that those operators may rely on preventing exposing 
> details of those operators before *start* may have been invoked.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to