Vinoth Govindarajan created HUDI-2438:
-----------------------------------------

             Summary: [Umbrella] [RFC-34] Implement BigQuerySyncTool for 
BigQuery Sync
                 Key: HUDI-2438
                 URL: https://issues.apache.org/jira/browse/HUDI-2438
             Project: Apache Hudi
          Issue Type: New Feature
          Components: Common Core
            Reporter: Vinoth Govindarajan
            Assignee: Vinoth Govindarajan
             Fix For: 0.10.0


BigQuery is Google Cloud's fully managed, petabyte-scale, and cost-effective 
analytics data warehouse that lets you run analytics over vast amounts of data 
in near real-time. BigQuery currently [doesn’t 
support|https://cloud.google.com/bigquery/external-data-cloud-storage] Apache 
Hudi file format, but it has support for the Parquet file format. The proposal 
is to implement a BigQuerySync similar to HiveSync to sync the Hudi table as 
the BigQuery External Parquet table so that users can query the Hudi tables 
using BigQuery. Uber is already syncing some of its Hudi tables to BigQuery 
data mart this will help them to write, sync, and query.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to