Luke Cwik created BEAM-5782:
-------------------------------
Summary: BigQuery TableRows not cloneable when using Dataflow
Key: BEAM-5782
URL: https://issues.apache.org/jira/browse/BEAM-5782
Project: Beam
Issue Type: Bug
Components: io-java-gcp
Reporter: Luke Cwik
Assignee: Luke Cwik
TableRows are expected to be cloneable.
BigQueryAvroUtils converts repeated records using an ImmutableList which is not
cloneable.
Reproduction steps:
1. Clone the code
[https://github.com/nahuellofeudo/row-clone-poc.git|https://www.google.com/url?q=https://github.com/nahuellofeudo/row-clone-poc.git&sa=D&usg=AFQjCNGkT0bYzhAoozGTQ4vsizxtphxj-g]
2. Run on Dataflow:
mvn clean compile exec:java
-Dexec.mainClass=org.apache.beam.examples.RowClonePoc \
-Dexec.args="--runner=DataflowRunner
--gcpTempLocation=gs://<GCS_TEMP_FOLDER> \
--tempLocation=gs://<GCS_TEMP_FOLDER> --project=<PROJECT_ID>"
-Pdataflow-runner
3. Run locally:
mvn clean compile exec:java
-Dexec.mainClass=org.apache.beam.examples.RowClonePoc \
-Dexec.args="--tempLocation=gs://<GCS_TEMP_FOLDER> --project=<PROJECT_ID>"
4. Job execution on step 2 will fail [1].
5. Job execution on step 3 will success [2].
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)