[ 
https://issues.apache.org/jira/browse/BEAM-6242?focusedWorklogId=176206&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-176206
 ]

ASF GitHub Bot logged work on BEAM-6242:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 17/Dec/18 19:27
            Start Date: 17/Dec/18 19:27
    Worklog Time Spent: 10m 
      Work Description: tweise commented on issue #7298: [BEAM-6242] Fix 
Portable WordCount PreCommit by caching environment
URL: https://github.com/apache/beam/pull/7298#issuecomment-447967990
 
 
   > > I would prefer we change one thing at a time. There wasn't any change to 
the cache millis in previous PR and I don't see why this would make a 
difference. There is no need to guess though, running the test repeatedly with 
the PR phrase should provide a signal.
   > 
   > The caching is the only fix here.
   > 
   > Parallelism was just added to avoid too much concurrency on Jenkins were 
resources are limited. Parallelism is auto-set form the number of available 
cores in local execution mode.
   
   If parallelism isn't an issue then why is it part of this PR?
   
   Regarding the environment caching: that used to be a concern for batch only 
(due to different scheduling). But this settings has been in place for a long 
time, including when Ankur tested this new pre-commit job. Do you know why this 
now occurs for streaming? Are there any other changes that cause this? Why is 
the cache timeout changed from 10000 to 60000? Did you see issues with the 
batch mode that require this?
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 176206)
    Time Spent: 2h 10m  (was: 2h)

> PreCommit Portable WordCount is flaky
> -------------------------------------
>
>                 Key: BEAM-6242
>                 URL: https://issues.apache.org/jira/browse/BEAM-6242
>             Project: Beam
>          Issue Type: Test
>          Components: runner-flink
>            Reporter: Maximilian Michels
>            Assignee: Maximilian Michels
>            Priority: Major
>          Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> [https://builds.apache.org/job/beam_PreCommit_Portable_Python_Phrase/42/consoleText]
>  
> {noformat}
> Caused by: java.util.concurrent.ExecutionException: 
> java.lang.RuntimeException: Error received from SDK harness for instruction 
> 533: Traceback (most recent call last):
>   File 
> "/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
>  line 145, in _execute
>     response = task()
>   File 
> "/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
>  line 180, in <lambda>
>     self._execute(lambda: worker.do_instruction(work), work)
>   File 
> "/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
>  line 253, in do_instruction
>     request.instruction_id)
>   File 
> "/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
>  line 269, in process_bundle
>     bundle_processor.process_bundle(instruction_id)
>   File 
> "/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
>  line 489, in process_bundle
>     ].process_encoded(data.data)
>   File 
> "/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
>  line 126, in process_encoded
>     self.output(decoded_value)
>   File "apache_beam/runners/worker/operations.py", line 182, in 
> apache_beam.runners.worker.operations.Operation.output
>     def output(self, windowed_value, output_index=0):
>   File "apache_beam/runners/worker/operations.py", line 183, in 
> apache_beam.runners.worker.operations.Operation.output
>     cython.cast(Receiver, 
> self.receivers[output_index]).receive(windowed_value)
>   File "apache_beam/runners/worker/operations.py", line 89, in 
> apache_beam.runners.worker.operations.ConsumerSet.receive
>     cython.cast(Operation, consumer).process(windowed_value)
>   File "apache_beam/runners/worker/operations.py", line 497, in 
> apache_beam.runners.worker.operations.DoOperation.process
>     with self.scoped_process_state:
>   File "apache_beam/runners/worker/operations.py", line 498, in 
> apache_beam.runners.worker.operations.DoOperation.process
>     self.dofn_receiver.receive(o)
>   File "apache_beam/runners/common.py", line 680, in 
> apache_beam.runners.common.DoFnRunner.receive
>     self.process(windowed_value)
>   File "apache_beam/runners/common.py", line 686, in 
> apache_beam.runners.common.DoFnRunner.process
>     self._reraise_augmented(exn)
>   File "apache_beam/runners/common.py", line 724, in 
> apache_beam.runners.common.DoFnRunner._reraise_augmented
>     raise_with_traceback(new_exn)
>   File "apache_beam/runners/common.py", line 684, in 
> apache_beam.runners.common.DoFnRunner.process
>     self.do_fn_invoker.invoke_process(windowed_value)
>   File "apache_beam/runners/common.py", line 535, in 
> apache_beam.runners.common.PerWindowInvoker.invoke_process
>     self._invoke_per_window(
>   File "apache_beam/runners/common.py", line 604, in 
> apache_beam.runners.common.PerWindowInvoker._invoke_per_window
>     output_processor.process_outputs(
>   File "apache_beam/runners/common.py", line 755, in 
> apache_beam.runners.common._OutputProcessor.process_outputs
>     def process_outputs(self, windowed_input_element, results):
>   File "apache_beam/runners/common.py", line 770, in 
> apache_beam.runners.common._OutputProcessor.process_outputs
>     for result in results:
>   File "/usr/local/lib/python2.7/site-packages/apache_beam/io/iobase.py", 
> line 1077, in <genexpr>
>     window.TimestampedValue(v, timestamp.MAX_TIMESTAMP) for v in outputs)
>   File 
> "/usr/local/lib/python2.7/site-packages/apache_beam/io/filebasedsink.py", 
> line 321, in finalize_write
>     'Encountered exceptions in finalize_write: %s' % all_exceptions)
> Exception: Encountered exceptions in finalize_write: [IOError(OSError(2, 'No 
> such file or directory'),), IOError(OSError(2, 'No such file or 
> directory'),)] [while running 'write/Write/WriteImpl/FinalizeWrite']
>       at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
>       at 
> java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
>       at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
>       at 
> org.apache.beam.runners.fnexecution.control.SdkHarnessClient$ActiveBundle.close(SdkHarnessClient.java:263)
>       at 
> org.apache.beam.runners.flink.translation.wrappers.streaming.ExecutableStageDoFnOperator$SdkHarnessDoFnRunner.finishBundle(ExecutableStageDoFnOperator.java:540)
>       ... 14 more{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to