[
https://issues.apache.org/jira/browse/BEAM-11587?focusedWorklogId=767481&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-767481
]
ASF GitHub Bot logged work on BEAM-11587:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 07/May/22 01:20
Start Date: 07/May/22 01:20
Worklog Time Spent: 10m
Work Description: TheNeuralBit commented on code in PR #17159:
URL: https://github.com/apache/beam/pull/17159#discussion_r867283114
##########
sdks/python/apache_beam/io/gcp/bigquery.py:
##########
@@ -2525,6 +2526,12 @@ def _get_pipeline_details(unused_elm):
**self._kwargs))
| _PassThroughThenCleanupTempDatasets(project_to_cleanup_pcoll))
+ def get_pcoll_from_schema(table_schema):
+ pcoll_val = apache_beam.io.gcp.bigquery_schema_tools.\
+ produce_pcoll_with_schema(table_schema)
+ return beam.Map(lambda values: pcoll_val(**values)).with_output_types(
Review Comment:
When we met we weren't able to repro the issue in the test locally because
of other auth issues. We did run through some logic like the test in a colab
notebook, and it didn't have any problem. When we observed the pcolleciton in
that case it had the element_type set properly, and we verified that that
element_type was registered to use RowCoder.
I think we need to either get the test running locally and check those same
data points, or do that via print() debugging on jenkins (as a last resort).
Issue Time Tracking
-------------------
Worklog Id: (was: 767481)
Time Spent: 7h 40m (was: 7.5h)
> Support pd.read_gbq and DataFrame.to_gbq
> ----------------------------------------
>
> Key: BEAM-11587
> URL: https://issues.apache.org/jira/browse/BEAM-11587
> Project: Beam
> Issue Type: New Feature
> Components: dsl-dataframe, io-py-gcp, sdk-py-core
> Reporter: Brian Hulette
> Assignee: Svetak Vihaan Sundhar
> Priority: P3
> Labels: dataframe-api
> Time Spent: 7h 40m
> Remaining Estimate: 0h
>
> We should support
> [read_gbq|https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_gbq.html]
> andÂ
> [to_gbq|https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.to_gbq.html]
> in the DataFrame API when gcp extras are installed.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)