LiuYuan created IMPALA-13160:
--------------------------------

             Summary: Impala query stuck after query from special partition 
'hour=0' and 'hour=0' which hour type is int
                 Key: IMPALA-13160
                 URL: https://issues.apache.org/jira/browse/IMPALA-13160
             Project: IMPALA
          Issue Type: Bug
          Components: Catalog, fe
    Affects Versions: Impala 4.3.0, Impala 3.4.0
            Reporter: LiuYuan


1.When create table as below:
{code:java}
 CREATE TABLE hive_partition.two_partition (               
   id INT,                                                 
   name STRING                                             
 )                                                         
 PARTITIONED BY (                                          
   day INT,                                                
   hour INT                                                
 )                                                         
 WITH SERDEPROPERTIES ('serialization.format'='1')         
 STORED AS ORC                                             
 LOCATION 'hdfs://ly-pfs/hive/hive_partition/two_partition'{code}
2.Then create dir as below:

 
{code:java}
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=0
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=00
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=01
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=02
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=03
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=04
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=05
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=06
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=07
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=08
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=09
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=1
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=10
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=11
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=12
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=13
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=14
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=15
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=16
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=17
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=18
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=19
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=2
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=20
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=21
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=22
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=23
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=3
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=4
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=5
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=6
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=7
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=8
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=9{code}
 

3. Execute Refresh hive_partition.two_partition more times

on Impala 3.4.0, total parititons grow after refresh, partitions grows from 34 
to 74 after refresh three times

 
{code:java}
I0617 17:01:36.244355 18605 CatalogServiceCatalog.java:2225] Refreshing table 
metadata: hive_partition.two_partition
I0617 17:01:38.033699 18605 HdfsTable.java:995] Reloading metadata for table 
definition and all partition(s) of hive_partition.two_partition (REFRESH issued 
by root)
I0617 17:01:39.245016 18605 ParallelFileMetadataLoader.java:147] Loading file 
and block metadata for 10 paths for table hive_partition.two_partition using a 
thread pool of size 10
I0617 17:01:39.336242 18605 HdfsTable.java:690] Loaded file and block metadata 
for hive_partition.two_partition partitions: day=20240613/hour=0, 
day=20240613/hour=1, day=20240613/hour=2, and 7 others. Time taken: 91.234ms
I0617 17:01:39.336658 18605 ParallelFileMetadataLoader.java:147] Refreshing 
file and block metadata for 34 paths for table hive_partition.two_partition 
using a thread pool of size 20
I0617 17:01:39.435528 18605 HdfsTable.java:690] Loaded file and block metadata 
for hive_partition.two_partition partitions: day=20240613/hour=0, 
day=20240613/hour=0, day=20240613/hour=2, and 51 others. Time taken: 99.075ms
I0617 17:01:39.435572 18605 HdfsTable.java:1026] Incrementally loaded table 
metadata for: hive_partition.two_partition
I0617 17:01:39.450933 18605 CatalogServiceCatalog.java:2249] Refreshed table 
metadata: hive_partition.two_partition
I0617 17:01:40.930032 18605 CatalogServiceCatalog.java:2225] Refreshing table 
metadata: hive_partition.two_partition
I0617 17:01:40.942590 18605 HdfsTable.java:995] Reloading metadata for table 
definition and all partition(s) of hive_partition.two_partition (REFRESH issued 
by root)
I0617 17:01:41.086898 18605 ParallelFileMetadataLoader.java:147] Loading file 
and block metadata for 10 paths for table hive_partition.two_partition using a 
thread pool of size 10
I0617 17:01:41.360704 18605 HdfsTable.java:690] Loaded file and block metadata 
for hive_partition.two_partition partitions: day=20240613/hour=0, 
day=20240613/hour=1, day=20240613/hour=2, and 7 others. Time taken: 273.775ms
I0617 17:01:41.361122 18605 ParallelFileMetadataLoader.java:147] Refreshing 
file and block metadata for 34 paths for table hive_partition.two_partition 
using a thread pool of size 20
I0617 17:01:41.494148 18605 HdfsTable.java:690] Loaded file and block metadata 
for hive_partition.two_partition partitions: day=20240613/hour=0, 
day=20240613/hour=0, day=20240613/hour=2, and 61 others. Time taken: 133.243ms
I0617 17:01:41.494203 18605 HdfsTable.java:1026] Incrementally loaded table 
metadata for: hive_partition.two_partition
I0617 17:01:41.494494 18605 CatalogServiceCatalog.java:2249] Refreshed table 
metadata: hive_partition.two_partition
I0617 17:01:42.312086 18605 CatalogServiceCatalog.java:2225] Refreshing table 
metadata: hive_partition.two_partition
I0617 17:01:42.326767 18605 HdfsTable.java:995] Reloading metadata for table 
definition and all partition(s) of hive_partition.two_partition (REFRESH issued 
by root)
I0617 17:01:42.516573 18605 ParallelFileMetadataLoader.java:147] Loading file 
and block metadata for 10 paths for table hive_partition.two_partition using a 
thread pool of size 10
I0617 17:01:42.708683 18605 HdfsTable.java:690] Loaded file and block metadata 
for hive_partition.two_partition partitions: day=20240613/hour=0, 
day=20240613/hour=1, day=20240613/hour=2, and 7 others. Time taken: 192.084ms
I0617 17:01:42.709103 18605 ParallelFileMetadataLoader.java:147] Refreshing 
file and block metadata for 34 paths for table hive_partition.two_partition 
using a thread pool of size 20
I0617 17:01:42.828385 18605 HdfsTable.java:690] Loaded file and block metadata 
for hive_partition.two_partition partitions: day=20240613/hour=0, 
day=20240613/hour=0, day=20240613/hour=2, and 71 others. Time taken: 
119.517ms{code}
 

on impala 4.3.0, select query drop into  dead loop to  prioritized load 
metadata:

 
{code:java}
E0617 17:07:01.926093 4158610 ImpaladCatalog.java:264] Error adding catalog 
object: Error applying incremental updates on table 
hive_partition.two_partition: missing partition ids true, stale partition ids 
false
Java exception follows:
org.apache.impala.catalog.TableLoadingException: Error applying incremental 
updates on table hive_partition.two_partition: missing partition ids true, 
stale partition ids false
        at 
org.apache.impala.catalog.HdfsTable.validatePartitions(HdfsTable.java:2066)
        at 
org.apache.impala.catalog.ImpaladCatalog.addTable(ImpaladCatalog.java:545)
        at 
org.apache.impala.catalog.ImpaladCatalog.addCatalogObject(ImpaladCatalog.java:334)
        at 
org.apache.impala.catalog.ImpaladCatalog.updateCatalog(ImpaladCatalog.java:262)
        at 
org.apache.impala.service.FeCatalogManager$CatalogdImpl.updateCatalogCache(FeCatalogManager.java:114)
        at 
org.apache.impala.service.Frontend.updateCatalogCache(Frontend.java:560)
        at 
org.apache.impala.service.JniFrontend.updateCatalogCache(JniFrontend.java:186)
I0617 17:07:01.927062 4158610 impala-server.cc:2176] Catalog topic update 
applied with version: 14 new min catalog object version: 1
I0617 17:07:03.927403 1511568 StmtMetadataLoader.java:225] 
9c4e7b92e71da380:855fd23400000000] Waiting for table metadata. Waited for 10 
catalog updates and 12182ms. Tables remaining: [hive_partition.two_partition]
I0617 17:07:13.931190 1511568 StmtMetadataLoader.java:225] 
9c4e7b92e71da380:855fd23400000000] Waiting for table metadata. Waited for 20 
catalog updates and 22186ms. Tables remaining: [hive_partition.two_partition]
 
{code}
 

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to