Rajkumar Singh created HIVE-22081:
-------------------------------------

             Summary: Hivemetastore Performance: Compaction Initiator thread 
overwhelmed if no there are too many Table/partitions are eligible for 
compaction 
                 Key: HIVE-22081
                 URL: https://issues.apache.org/jira/browse/HIVE-22081
             Project: Hive
          Issue Type: Improvement
          Components: Transactions
    Affects Versions: 3.1.1
            Reporter: Rajkumar Singh
            Assignee: Rajkumar Singh


if Automatic Compaction is turned on, Initiator thread check for potential 
table/partitions which are eligible for compactions and run some checks in for 
loop before requesting compaction for eligibles. Though initiator thread is 
configured to run at interval 5 min default, in case of many objects it keeps 
on running as these checks are IO intensive and hog cpu.
In the proposed changes, I am planning to do
1. passing less object to for loop by filtering out the objects based on the 
condition which we are checking within the loop.
2. Doing Async call using future to determine compaction type(this is where we 
do FileSystem calls)



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to