[GitHub] [beam] svetakvsundhar commented on a diff in pull request #17159: [WIP][BEAM-11587] Generate PColl element from TableSchema

GitBox Fri, 06 May 2022 09:01:44 -0700


svetakvsundhar commented on code in PR #17159:
URL: https://github.com/apache/beam/pull/17159#discussion_r866406831



##########
sdks/python/apache_beam/io/gcp/bigquery.py:
##########
@@ -2525,6 +2526,12 @@ def _get_pipeline_details(unused_elm):
                 **self._kwargs))
         | _PassThroughThenCleanupTempDatasets(project_to_cleanup_pcoll))
 
+  def get_pcoll_from_schema(table_schema):
+    pcoll_val = apache_beam.io.gcp.bigquery_schema_tools.\
+        produce_pcoll_with_schema(table_schema)
+    return beam.Map(lambda values: pcoll_val(**values)).with_output_types(

Review Comment:
   @TheNeuralBit , tried this here but looks like the pickling still had an 
error. Something to note is that this is able to run fine on Colab notebook via 
DirectRunner 
(https://colab.sandbox.google.com/drive/1_BkzBwCLB7OjR1jGGzCkxXFHreUZgCZl#revisionId=0BzxzuriHpr5fcXNOWHcvcGlqYTBWWUZxRWdQaFIrVVFYV0Y4PQ&scrollTo=YITcZHkoZR59)
 
   
   Could you expand more on why we shouldn't be pickling these instances? ```In 
general, we shouldn't be pickling these instances, but occasionally it may be 
necessary``` (from the JIRA description)



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [beam] svetakvsundhar commented on a diff in pull request #17159: [WIP][BEAM-11587] Generate PColl element from TableSchema

Reply via email to