[ 
https://issues.apache.org/jira/browse/BEAM-9979?focusedWorklogId=472973&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-472973
 ]

ASF GitHub Bot logged work on BEAM-9979:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 20/Aug/20 16:53
            Start Date: 20/Aug/20 16:53
    Worklog Time Spent: 10m 
      Work Description: boyuanzz commented on a change in pull request #12639:
URL: https://github.com/apache/beam/pull/12639#discussion_r474135224



##########
File path: 
sdks/java/harness/src/test/java/org/apache/beam/fn/harness/BeamFnDataReadRunnerTest.java
##########
@@ -352,6 +354,35 @@ public void testRegistration() {
       fail("Expected registrar not found.");
     }
 
+    @Test
+    public void testSplittingBeforeStartBundle() throws Exception {
+      List<WindowedValue<String>> outputValues = new ArrayList<>();
+      BeamFnDataReadRunner<String> readRunner =
+          createReadRunner(outputValues::add, PTRANSFORM_ID, 
mockBeamFnDataClient);
+      // The split should happen at 5 since the allowedSplitPoints is empty.
+      assertEquals(

Review comment:
       I may misunderstand some content here but I thought we don't perform 
splitting if the bundle is not started.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 472973)
    Time Spent: 1h 20m  (was: 1h 10m)

> Fix race condition where the read index maybe reported from the last executed 
> bundle
> ------------------------------------------------------------------------------------
>
>                 Key: BEAM-9979
>                 URL: https://issues.apache.org/jira/browse/BEAM-9979
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-java-harness
>    Affects Versions: 2.21.0, 2.22.0, 2.23.0, 2.24.0
>            Reporter: Luke Cwik
>            Assignee: Luke Cwik
>            Priority: P3
>             Fix For: 2.25.0
>
>          Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> When the BeamFnDataReadRunner is reused there is a short period of time when 
> a progress request could happen before the the start function is called 
> resetting the read index to -1.
> I believe there should be a way to *reset* an operator before it gets added 
> to the set of cached bundle processors separate instead of placing clean-up 
> in any *start* functions that those operators may rely on preventing exposing 
> details of those operators before *start* may have been invoked.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to