Zhijing Lu created COMDEV-512:
---------------------------------
Summary: [GSoC][Doris] Supports BigQuery/Apache Kudu/Apache
Cassandra/Apache Druid in Federated Queries
Key: COMDEV-512
URL: https://issues.apache.org/jira/browse/COMDEV-512
Project: Community Development
Issue Type: Task
Components: GSoC/Mentoring ideas
Reporter: Zhijing Lu
*Apache Doris*
Apache Doris is a real-time analytical database based on MPP architecture. As a
unified platform that supports multiple data processing scenarios, it ensures
high performance for low-latency and high-throughput queries, allows for easy
federated queries on data lakes, and supports various data ingestion methods.
Page: https://doris.apache.org
Github: [https://github.com/apache/doris]
h3. *Background*
Apache Doris supports acceleration of queries on external data sources to meet
users' needs for federated queries and analysis.
Currently, Apache Doris supports multiple external catalogs including those
from Hive, Iceberg, Hudi, and JDBC. Developers can connect more data sources to
Apache Doris based on a unified framework.
h4. *Objective*
*
Enable Apache Doris to access one or more of these data sources via the
Multi-Catalog feature: BigQuery/Kudu/Cassandra/Druid;
*
Compile relevant documentation. See an example here:
[https://doris.apache.org/docs/dev/lakehouse/multi-catalog/hive]
*Task*
{*}Phase One{*}:
*
Get familiar with the Multi-Catalog structure of Apache Doris, including the
metadata synchronization mechanism in FE and the data reading mechanism of BE.
*
Investigate how metadata should be acquired and how data access works regarding
the picked data source(s); produce the corresponding design documentation.
{*}Phase Two{*}:
* Develop connections to the picked data source(s) and implement access to
metadata and data.
h3. *Learning Material*
{*}Page{*}: [https://doris.apache.org|https://doris.apache.org/]
{*}Github{*}: [https://github.com/apache/doris]
h3. Mentor
* Mentor: Mingyu Chen, Apache Doris PMC Member & Committer,
[[email protected] |mailto:[email protected]]
* Mentor: Calvin Kirs, Apache Dolphinscheduler PMC & Committer,
[[email protected]|mailto:[email protected]]
* Mailing List: [email protected]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]