Rajesh Balamohan created HIVE-24943:
---------------------------------------

             Summary: Initiator: Short circuit compaction checks on 
tables/partitions when DB isn't eligible for compaction
                 Key: HIVE-24943
                 URL: https://issues.apache.org/jira/browse/HIVE-24943
             Project: Hive
          Issue Type: Improvement
            Reporter: Rajesh Balamohan


When the entire database isn't qualified for compaction due to replication, 
initiator could potentially skip the entire table/partition checks to save HMS 
resources. Currently it runs in tight loop for each table/partition for the 
database which isn't eligible for compaction.

E.g Note that for "delete_orc_10" database repl parameters are enabled. So 
until "ReplUtils.isFirstIncPending(database.getParameters());", it ends up 
considering that entire database as "not chosen for compaction".

Ref: 
[https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Initiator.java#L469]

[https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Initiator.java#L469]

 
{noformat}
+----------------------------------------------------+
|                   createtab_stmt                   |
+----------------------------------------------------+
| CREATE TABLE `delete_orc_10.test_con_s3_part_1`(   |
|   `d_date_id` string,                              |
|   `d_date` date,                                   |
|   `d_dom` int,                                     |
|   `d_year` int)                                    |
| PARTITIONED BY (                                   |
|   `d_date_sk` bigint)                              |
| ROW FORMAT SERDE                                   |
|   'org.apache.hadoop.hive.ql.io.orc.OrcSerde'      |
| STORED AS INPUTFORMAT                              |
|   'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat'  |
| OUTPUTFORMAT                                       |
|   'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat' |
| LOCATION                                           |
|   's3a://blah/delete_orc_10.db/test_con_s3_part_1' |
| TBLPROPERTIES (                                    |
|   'bucketing_version'='2',                         |
|   'transactional'='true',                          |
|   'transactional_properties'='default',            |
|   'transient_lastDdlTime'='1610926861')            |
+----------------------------------------------------+




+----------------------------------------------------+
|                   createdb_stmt                    |
+----------------------------------------------------+
| CREATE DATABASE `delete_orc_10`                    |
| LOCATION                                           |
|   's3a://blah/delete_orc_10.db' |
| WITH DBPROPERTIES (                                |
|   
'hive.repl.ckpt.key'='s3a://blha/loc/dHBjZHNfYmluX3BhcnRpdGlvbmVkX29yY18xMDAwMA==/7f1c5529-e17a-4750-bf16-a9861c9589dc/hive',
  |
|   'hive.repl.first.inc.pending'='true',            |
|   'repl.last.id'='95653')                          |
+----------------------------------------------------+

{noformat}
{noformat}
<14>1 2021-03-25T23:42:50.095Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.095Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_1.d_date_sk=2447933
<14>1 2021-03-25T23:42:50.118Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.118Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_debug.d_date_sk=2471951
<14>1 2021-03-25T23:42:50.142Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.142Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_debug_1.d_date_sk=2421767
<14>1 2021-03-25T23:42:50.164Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.164Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_1.d_date_sk=2479669
<14>1 2021-03-25T23:42:50.185Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.185Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_2.d_date_sk=2482553
<14>1 2021-03-25T23:42:50.205Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.205Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_debug_1.d_date_sk=2444638
<14>1 2021-03-25T23:42:50.227Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.227Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_debug_1.d_date_sk=2480095
<14>1 2021-03-25T23:42:50.253Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.253Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_1.d_date_sk=2468447
<14>1 2021-03-25T23:42:50.281Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.281Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_1.d_date_sk=2440357
<14>1 2021-03-25T23:42:50.308Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
<14>1 2021-03-25T23:42:50.308Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.Initiator" level="INFO" thread="Thread-12"] Checking to see if 
we should compact delete_orc_10.test_con_s3_part_debug_1.d_date_sk=2465904
<14>1 2021-03-25T23:42:50.332Z 
metastore-0.metastore-service.warehouse-1616689513-66lb.svc.cluster.local 
metastore 1 e6cad49f-5d36-4a73-8da1-083e205976ee [mdc@18060 
class="compactor.CompactorThread" level="INFO" thread="Thread-12"] Compaction 
is disabled for database delete_orc_10
{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to