[
https://issues.apache.org/jira/browse/HUDI-1?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vinoth Chandar updated HUDI-1:
------------------------------
Summary: Design and Implement embedded timeline service to cache filesystem
view to reduce listStatus calls (was: Desgin and Implement embedded timeline
service to cache filesystem view to reduce listStatus calls)
> Design and Implement embedded timeline service to cache filesystem view to
> reduce listStatus calls
> --------------------------------------------------------------------------------------------------
>
> Key: HUDI-1
> URL: https://issues.apache.org/jira/browse/HUDI-1
> Project: Apache Hudi (incubating)
> Issue Type: Improvement
> Components: Performance, Write Client
> Reporter: Balaji Varadarajan
> Assignee: Balaji Varadarajan
> Priority: Major
> Fix For: 0.5.0
>
> Original Estimate: 1,008h
> Remaining Estimate: 1,008h
>
> Currently, Hudi writers repeatedly list partitions to create file-system
> views in executors. This task addresses the reductions in listStatus
> name-node calls in Hudi 2.0 writers by taking advantage of MVCC view of HUDI
> and caching file-system view and reusing them.
> An embedded file-system view server on driver will be preloaded with the
> view. It will act as a cache and service File-system view calls from
> executors.
>
> [https://github.com/uber/hudi/issues/433]
> https://github.com/uber/hudi/issues/269
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)