[
https://issues.apache.org/jira/browse/HUDI-6752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17760631#comment-17760631
]
Ethan Guo edited comment on HUDI-6752 at 8/30/23 9:32 PM:
----------------------------------------------------------
I've create JIRA tickets in the corresponding EPICs and here's the scope:
EPIC HUDI-3217: (15pt P0) Finalize RecordMerger API for use with java, python
and other languages.
EPIC HUDI-6243: (17pt) Engine agnostic FileGroupReader "internal" API, replaces
Spark and Hive reads.
EPIC HUDI-6722: (22pt) Positional update, delete, partial update, event_time
based merge and custom merger support on read and write paths.
EPIC HUDI-6243: (27pt) Spark MoR Snapshot, Incremental, ReadOptimized, CDC,
TimeTravel queries on new storage format.
was (Author: guoyihua):
I've create JIRA tickets in the corresponding EPICs and here's the scope:
EPIC HUDI-3217:
(15pt P0) Finalize RecordMerger API for use with java, python and other
languages.
EPIC HUDI-6243:
(17pt) Engine agnostic FileGroupReader "internal" API, replaces Spark and Hive
reads.
EPIC HUDI-6722:
(22pt) Positional update, delete, partial update, event_time based merge and
custom merger support on read and write paths.
EPIC HUDI-6243:
(27pt) Spark MoR Snapshot, Incremental, ReadOptimized, CDC, TimeTravel queries
on new storage format.
> Scope out the work for file group reading and writing with record merging in
> Spark
> ----------------------------------------------------------------------------------
>
> Key: HUDI-6752
> URL: https://issues.apache.org/jira/browse/HUDI-6752
> Project: Apache Hudi
> Issue Type: Task
> Reporter: Ethan Guo
> Assignee: Ethan Guo
> Priority: Major
> Fix For: 1.0.0
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)