[
https://issues.apache.org/jira/browse/BEAM-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Fabian updated BEAM-7266:
-------------------------
Summary: Pipeline run does not terminate because of Dataflow runner can not
close file system writer (was: Pipeline run does not terminate because of
Dataflow runner can close file system writer)
> Pipeline run does not terminate because of Dataflow runner can not close file
> system writer
> -------------------------------------------------------------------------------------------
>
> Key: BEAM-7266
> URL: https://issues.apache.org/jira/browse/BEAM-7266
> Project: Beam
> Issue Type: Bug
> Components: io-python-gcp, runner-dataflow
> Affects Versions: 2.11.0
> Reporter: Fabian
> Priority: Major
>
> We are using Apache Beam in version 2.11.0 (Python SDK) with the Dataflow
> runner running on the Google Cloud Platform. Two pipeline runs did not
> terminate, i.e. after multiple days (instead of some minutes) they where
> still running. The only error that was logged is:
> If fails to close a writer:
> {code:java}
> Traceback (most recent call last):
> File
> "/usr/local/lib/python2.7/dist-packages/dataflow_worker/batchworker.py", line
> 649, in do_work
> work_executor.execute()
> File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/executor.py",
> line 178, in execute
> op.finish()
> File "dataflow_worker/native_operations.py", line 93, in
> dataflow_worker.native_operations.NativeWriteOperation.finish
> def finish(self):
> File "dataflow_worker/native_operations.py", line 94, in
> dataflow_worker.native_operations.NativeWriteOperation.finish
> with self.scoped_finish_state:
> File "dataflow_worker/native_operations.py", line 95, in
> dataflow_worker.native_operations.NativeWriteOperation.finish
> self.writer.__exit__(None, None, None)
> File
> "/usr/local/lib/python2.7/dist-packages/dataflow_worker/nativeavroio.py",
> line 277, in __exit__
> self._data_file_writer.close()
> File "/usr/local/lib/python2.7/dist-packages/avro/datafile.py", line 220,
> in close
> self.writer.close()
> File
> "/usr/local/lib/python2.7/dist-packages/apache_beam/io/filesystemio.py", line
> 202, in close
> self._uploader.finish()
> File "/usr/local/lib/python2.7/dist-packages/apache_beam/io/gcp/gcsio.py",
> line 606, in finish
> raise self._upload_thread.last_error # pylint: disable=raising-bad-type
> NotImplementedError{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)