Brian Hulette created BEAM-11472:
------------------------------------

             Summary: taxi_dataframe example broken
                 Key: BEAM-11472
                 URL: https://issues.apache.org/jira/browse/BEAM-11472
             Project: Beam
          Issue Type: Bug
          Components: sdk-py-core
            Reporter: Brian Hulette
            Assignee: Robert Bradshaw
             Fix For: 2.27.0


I just tried running the taxi_dataframe example at HEAD on Dataflow and found 
that it fails with the following error:

{code}
INFO:apache_beam.runners.dataflow.dataflow_runner:2020-12-15T16:27:55.653Z: 
JOB_MESSAGE_ERROR: Traceback (most recent call last):
  File "apache_beam/runners/common.py", line 1214, in 
apache_beam.runners.common.DoFnRunner.process
  File "apache_beam/runners/common.py", line 721, in 
apache_beam.runners.common.PerWindowInvoker.invoke_process
  File "apache_beam/runners/common.py", line 753, in 
apache_beam.runners.common.PerWindowInvoker._should_process_window_for_sdf
  File "apache_beam/runners/common.py", line 548, in 
apache_beam.runners.common.DoFnInvoker.invoke_create_tracker
  File "/usr/local/lib/python3.8/site-packages/apache_beam/dataframe/io.py", 
line 313, in create_tracker
    tracker = beam.io.restriction_trackers.OffsetRestrictionTracker(restriction)
  File 
"/usr/local/lib/python3.8/site-packages/apache_beam/io/restriction_trackers.py",
 line 91, in __init__
    assert isinstance(offset_range, OffsetRange)
AssertionError 
{code}

https://github.com/apache/beam/pull/13443 Seems like the most likely culprit, 
and I confirmed that running with Beam before that commit (at 
bd825f574e342cfa83fb09767c7d5a19a3accc55) does not fail.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to