sivabalan narayanan created HUDI-7101:
-----------------------------------------
Summary: File slice instantiation for MDT file groups
Key: HUDI-7101
URL: https://issues.apache.org/jira/browse/HUDI-7101
Project: Apache Hudi
Issue Type: Improvement
Components: metadata
Reporter: sivabalan narayanan
here is what a typical file group instantiation of MDT partition looks like
t10: create a dummy commit w/ base commit time "0000000".
So this will create a log file w/ dummy delete block.
Immediately following this, we take the bulk_insert which will create a new
file slice but w/ same commit time.
base_file_00000.parquet.
Theoretically, these both belong to diff file slices and when latest snapshot
is read, only latest base file should be read. but as of now, we consider the
log file also as latest and read it. Since its dummy delete log block, there is
no correctness issue here.
Just some code clean up is required.
this is an issue only w/ a fresh table.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)