[
https://issues.apache.org/jira/browse/HUDI-1?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-1:
------------------------------
Labels: pull-request-available (was: )
> Design and Implement embedded timeline service to cache filesystem view to
> reduce listStatus calls
> --------------------------------------------------------------------------------------------------
>
> Key: HUDI-1
> URL: https://issues.apache.org/jira/browse/HUDI-1
> Project: Apache Hudi
> Issue Type: Improvement
> Components: performance, writer-core
> Reporter: Balaji Varadarajan
> Assignee: Balaji Varadarajan
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.5.0
>
> Original Estimate: 1,008h
> Remaining Estimate: 1,008h
>
> Currently, Hudi writers repeatedly list partitions to create file-system
> views in executors. This task addresses the reductions in listStatus
> name-node calls in Hudi 2.0 writers by taking advantage of MVCC view of HUDI
> and caching file-system view and reusing them.
> An embedded file-system view server on driver will be preloaded with the
> view. It will act as a cache and service File-system view calls from
> executors.
>
> [https://github.com/uber/hudi/issues/433]
> https://github.com/uber/hudi/issues/269
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)