[ 
https://issues.apache.org/jira/browse/HUDI-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-4073:
----------------------------
    Description: In BaseTableMetadata::getAllFilesInPartitions, we construct 
FileStatus instances for the later use of reading files.  This can be deferred, 
or the necessary information can be stored in simple POJO instead of 
FileStatus.  The instantiation of FileStatus instances can also be parallelized.

> Optimize FileStatus construction in file listing from metadata table
> --------------------------------------------------------------------
>
>                 Key: HUDI-4073
>                 URL: https://issues.apache.org/jira/browse/HUDI-4073
>             Project: Apache Hudi
>          Issue Type: Task
>          Components: metadata
>            Reporter: Ethan Guo
>            Assignee: Ethan Guo
>            Priority: Blocker
>             Fix For: 0.12.0
>
>
> In BaseTableMetadata::getAllFilesInPartitions, we construct FileStatus 
> instances for the later use of reading files.  This can be deferred, or the 
> necessary information can be stored in simple POJO instead of FileStatus.  
> The instantiation of FileStatus instances can also be parallelized.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to