[
https://issues.apache.org/jira/browse/BEAM-3736?focusedWorklogId=507841&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-507841
]
ASF GitHub Bot logged work on BEAM-3736:
----------------------------------------
Author: ASF GitHub Bot
Created on: 04/Nov/20 23:15
Start Date: 04/Nov/20 23:15
Worklog Time Spent: 10m
Work Description: tvalentyn commented on pull request #13048:
URL: https://github.com/apache/beam/pull/13048#issuecomment-722025568
I am seeing an error in one of the tests that we run internally with this
change:
```
test_global_fanout (apache_beam.transforms.combiners_test.CombineTest) ...
ERROR
======================================================================
ERROR: test_global_fanout (apache_beam.transforms.combiners_test.CombineTest)
----------------------------------------------------------------------
Traceback (most recent call last):
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/transforms/combiners_test.py",
line 544, in test_global_fanout
assert_that(result, equal_to([49.5]))
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/pipeline.py",
line 568, in __exit__
self.result = self.run()
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/testing/test_pipeline.py",
line 112, in run
False if self.not_use_test_runner_api else test_runner_api))
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/pipeline.py",
line 518, in run
allow_proto_holders=True).run(False)
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/pipeline.py",
line 547, in run
return self.runner.run_pipeline(self, self._options)
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/runners/dataflow/test_dataflow_runner.py",
line 57, in run_pipeline
self).run_pipeline(pipeline, options)
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/runners/dataflow/dataflow_runner.py",
line 495, in run_pipeline
self._check_for_unsupported_features_on_non_portable_worker(pipeline)
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/runners/dataflow/dataflow_runner.py",
line 467, in _check_for_unsupported_features_on_non_portable_worker
pipeline.visit(self.combinefn_visitor())
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/pipeline.py",
line 594, in visit
self._root_transform().visit(visitor, self, visited)
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/pipeline.py",
line 1085, in visit
part.visit(visitor, pipeline, visited)
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/pipeline.py",
line 1085, in visit
part.visit(visitor, pipeline, visited)
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/pipeline.py",
line 1085, in visit
part.visit(visitor, pipeline, visited)
[Previous line repeated 2 more times]
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/pipeline.py",
line 1088, in visit
visitor.visit_transform(self)
File
"/build/work/12791c230bd5926c2b95fef2c4fb1814e38e/google3/runfiles/google3/third_party/py/apache_beam/runners/dataflow/dataflow_runner.py",
line 430, in visit_transform
'CombineFn.setup and CombineFn.teardown are '
ValueError: CombineFn.setup and CombineFn.teardown are not supported with
non-portable Dataflow runner. Please use Dataflow Runner V2 instead.
----------------------------------------------------------------------
Ran 1 test in 1.216s
FAILED (errors=1)
```
I wonder if this broke ValidatesRunner tests on RunnerV1.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 507841)
Time Spent: 10h 20m (was: 10h 10m)
> Add SetUp() and TearDown() for CombineFns
> -----------------------------------------
>
> Key: BEAM-3736
> URL: https://issues.apache.org/jira/browse/BEAM-3736
> Project: Beam
> Issue Type: Improvement
> Components: beam-model, sdk-py-core
> Reporter: Chuan Yu Foo
> Assignee: Kamil Wasilewski
> Priority: P3
> Time Spent: 10h 20m
> Remaining Estimate: 0h
>
> I have a CombineFn that has a large amount of state that needs to be loaded
> once before it can add_input or merge_combiners (for example, the CombineFn
> might load up a large lookup table used for combining).
> Right now, to initialise this state, for each of the methods, I check if the
> state has already been initialised, and if not, I initialise it. It would be
> nice if CombineFn provided a SetUp() method that is called once to initialise
> this state (and a corresponding TearDown() method to clean up this state if
> necessary).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)