[
https://issues.apache.org/jira/browse/HIVE-26432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ádám Szita reassigned HIVE-26432:
---------------------------------
> Improve LlapCacheAwareFs by caching file status information
> -----------------------------------------------------------
>
> Key: HIVE-26432
> URL: https://issues.apache.org/jira/browse/HIVE-26432
> Project: Hive
> Issue Type: Improvement
> Reporter: Ádám Szita
> Assignee: Ádám Szita
> Priority: Major
>
> The current implementation of LlapCacheAwareFs is used to wrap InputStreams
> of non-ORC file formatted file reads, if set up to utilize LLAP caching.
> File content is cached by the calculated file ID and the required offsets
> within the file. This is later served from cache, however LlapCacheAwareFs
> acting as a FileSystem sometimes receives listStatus / getFileStatus calls
> too, which is only proxied to the original FS. If such operation on the
> original FS is slow, e.g. listing on S3, performance will be impacted. (This
> is not the case with how ORC is integrated into LLAP cache as it's not acting
> as a FS)
> I propose we cache the file status information too besides the content.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)