[ 
https://issues.apache.org/jira/browse/BEAM-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16853808#comment-16853808
 ] 

Tanay Tummalapalli edited comment on BEAM-7437 at 6/1/19 8:08 PM:
------------------------------------------------------------------

I am getting consistent failures on the Integration Tests for BigQuery with the 
Dataflow runner, for the classes `BigQueryFileLoadsIT` and 
`BigQueryStreamingInsertTransformIntegrationTests` which seem to be related to 
recent b64 changes. The traceback is:

{code:java}
File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/batchworker.py", 
line 649, in do_work
    work_executor.execute()
  File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/executor.py", 
line 150, in execute
    test_shuffle_sink=self._test_shuffle_sink)
  File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/executor.py", 
line 116, in create_operation
    is_streaming=False)
  File "apache_beam/runners/worker/operations.py", line 962, in 
apache_beam.runners.worker.operations.create_operation
    op = BatchGroupAlsoByWindowsOperation(
  File "dataflow_worker/shuffle_operations.py", line 219, in 
dataflow_worker.shuffle_operations.BatchGroupAlsoByWindowsOperation.__init__
    self.windowing = deserialize_windowing_strategy(self.spec.window_fn)
  File "dataflow_worker/shuffle_operations.py", line 207, in 
dataflow_worker.shuffle_operations.deserialize_windowing_strategy
    return pickler.loads(serialized_data)
  File 
"/usr/local/lib/python2.7/dist-packages/apache_beam/internal/pickler.py", line 
248, in loads
    c = base64.b64decode(encoded)
  File "/usr/lib/python2.7/base64.py", line 78, in b64decode
    raise TypeError(msg)
TypeError: Incorrect padding
{code}

The failure is ONLY on the master branch. I tested the 2.13.0RC2 branch as 
well. It passed.

I need to investigate this further. 


was (Author: ttanay):
I am getting consistent failures on the Integration Tests for BigQuery with the 
Dataflow runner, for the classes `BigQueryFileLoadsIT` and 
`BigQueryStreamingInsertTransformIntegrationTests` which seem to be related to 
recent b64 changes. The traceback is:
```
File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/batchworker.py", 
line 649, in do_work
    work_executor.execute()
  File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/executor.py", 
line 150, in execute
    test_shuffle_sink=self._test_shuffle_sink)
  File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/executor.py", 
line 116, in create_operation
    is_streaming=False)
  File "apache_beam/runners/worker/operations.py", line 962, in 
apache_beam.runners.worker.operations.create_operation
    op = BatchGroupAlsoByWindowsOperation(
  File "dataflow_worker/shuffle_operations.py", line 219, in 
dataflow_worker.shuffle_operations.BatchGroupAlsoByWindowsOperation.__init__
    self.windowing = deserialize_windowing_strategy(self.spec.window_fn)
  File "dataflow_worker/shuffle_operations.py", line 207, in 
dataflow_worker.shuffle_operations.deserialize_windowing_strategy
    return pickler.loads(serialized_data)
  File 
"/usr/local/lib/python2.7/dist-packages/apache_beam/internal/pickler.py", line 
248, in loads
    c = base64.b64decode(encoded)
  File "/usr/lib/python2.7/base64.py", line 78, in b64decode
    raise TypeError(msg)
TypeError: Incorrect padding
```
The failure is ONLY on the master branch. I tested the 2.13.0RC2 branch as 
well. It passed.

I need to investigate this further. 

> Integration Test for BQ streaming inserts for streaming pipelines
> -----------------------------------------------------------------
>
>                 Key: BEAM-7437
>                 URL: https://issues.apache.org/jira/browse/BEAM-7437
>             Project: Beam
>          Issue Type: Test
>          Components: io-python-gcp
>    Affects Versions: 2.12.0
>            Reporter: Tanay Tummalapalli
>            Assignee: Tanay Tummalapalli
>            Priority: Minor
>
> Integration Test for BigQuery Sink using Streaming Inserts for streaming 
> pipelines.
> Integration tests currently exist for batch pipelines, it can also be added 
> for streaming pipelines using TestStream. This will be a precursor to the 
> failing integration test to be added for [BEAM-6611| 
> https://issues.apache.org/jira/browse/BEAM-6611].



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to