[ https://issues.apache.org/jira/browse/HIVE-23459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17123614#comment-17123614 ]
Sankar Hariappan edited comment on HIVE-23459 at 6/2/20, 10:38 AM: ------------------------------------------------------------------- Thanks [~pvary] to help us avoid duplicate work! I guess, we can close this ticket as duplicate since [HIVE-23495|https://issues.apache.org/jira/browse/HIVE-23495] seems to handle multiple list calls. was (Author: sankarh): Thanks [~pvary] to help us avoid duplicate work! I guess, we can close this ticket as duplicate since [HIVE-23495|https://issues.apache.org/jira/browse/HIVE-23495] seems to handle duplicate list calls. > Reduce number of listPath calls in AcidUtils::getAcidState > ---------------------------------------------------------- > > Key: HIVE-23459 > URL: https://issues.apache.org/jira/browse/HIVE-23459 > Project: Hive > Issue Type: Improvement > Reporter: Rajesh Balamohan > Assignee: Nishant Goel > Priority: Minor > Attachments: image-2020-05-13-13-57-27-270.png > > > There are atleast 3 places where listPaths is invoked for FS (highlighted in > the follow profile). > !image-2020-05-13-13-57-27-270.png|width=869,height=626! > > Dir caching works mainly for BI strategy and when there are no-delta files. > It would be good to consider reducing number of NN calls to reduce getSplits > time. -- This message was sent by Atlassian Jira (v8.3.4#803005)