[
https://issues.apache.org/jira/browse/HUDI-53?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17376291#comment-17376291
]
ASF GitHub Bot commented on HUDI-53:
------------------------------------
prashantwason commented on pull request #2487:
URL: https://github.com/apache/hudi/pull/2487#issuecomment-875308439
This is a very comprehensive implementation for the record-level index.
There are some changes required to the current Metadata Table design to
enable record-level-index:
1. Synchronous updates of metadata table
2. Performance improvement for multi key-lookups (requires for tagLocation)
3. In-line file system for point lookup of keys from HFile Blocks in log
files so we dont have to load the entire log block in memory
I am working on these changes and should be complete before end of July.
@nsivabalan I will be happy to collaborate with your to take this draft
towards the Hoodie Metadata Table based record-level-index.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
> Implement Record level Index to map a record key to a <partition path,
> FileID> pair #90
> ---------------------------------------------------------------------------------------
>
> Key: HUDI-53
> URL: https://issues.apache.org/jira/browse/HUDI-53
> Project: Apache Hudi
> Issue Type: Improvement
> Components: Writer Core
> Affects Versions: 0.9.0
> Reporter: Vinoth Chandar
> Assignee: sivabalan narayanan
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.9.0
>
>
> [https://github.com/uber/hudi/issues/90]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)