[
https://issues.apache.org/jira/browse/HIVE-17423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16973893#comment-16973893
]
Tak-Lon (Stephen) Wu edited comment on HIVE-17423 at 11/14/19 2:54 AM:
-----------------------------------------------------------------------
ignore my previous comment, HIVE-20127 fixed the LLAP with parquet cache issues
and thanks
was (Author: taklwu):
ignore my previous comment, HIVE-20127 fixed it and thanks
> LLAP Parquet caching - support file ID in splits
> ------------------------------------------------
>
> Key: HIVE-17423
> URL: https://issues.apache.org/jira/browse/HIVE-17423
> Project: Hive
> Issue Type: Bug
> Reporter: Sergey Shelukhin
> Priority: Major
>
> To get LLAP cache data one needs a file ID which is either an HDFS inode ID,
> or a composite of path, modification time and size. These can be embedded
> into splits for ORC, cause in particular for the former it's possible to get
> the IDs as a part of a normal file enumeration that split generation performs
> anyway.
> If they are missing, the IDs need to be obtained for every file on the
> fragment side.
> We should explore adding file IDs to Parquet splits when the cache is enabled.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)