Brian Hulette created BEAM-11472:
------------------------------------
Summary: taxi_dataframe example broken
Key: BEAM-11472
URL: https://issues.apache.org/jira/browse/BEAM-11472
Project: Beam
Issue Type: Bug
Components: sdk-py-core
Reporter: Brian Hulette
Assignee: Robert Bradshaw
Fix For: 2.27.0
I just tried running the taxi_dataframe example at HEAD on Dataflow and found
that it fails with the following error:
{code}
INFO:apache_beam.runners.dataflow.dataflow_runner:2020-12-15T16:27:55.653Z:
JOB_MESSAGE_ERROR: Traceback (most recent call last):
File "apache_beam/runners/common.py", line 1214, in
apache_beam.runners.common.DoFnRunner.process
File "apache_beam/runners/common.py", line 721, in
apache_beam.runners.common.PerWindowInvoker.invoke_process
File "apache_beam/runners/common.py", line 753, in
apache_beam.runners.common.PerWindowInvoker._should_process_window_for_sdf
File "apache_beam/runners/common.py", line 548, in
apache_beam.runners.common.DoFnInvoker.invoke_create_tracker
File "/usr/local/lib/python3.8/site-packages/apache_beam/dataframe/io.py",
line 313, in create_tracker
tracker = beam.io.restriction_trackers.OffsetRestrictionTracker(restriction)
File
"/usr/local/lib/python3.8/site-packages/apache_beam/io/restriction_trackers.py",
line 91, in __init__
assert isinstance(offset_range, OffsetRange)
AssertionError
{code}
https://github.com/apache/beam/pull/13443 Seems like the most likely culprit,
and I confirmed that running with Beam before that commit (at
bd825f574e342cfa83fb09767c7d5a19a3accc55) does not fail.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)