Quanlong Huang created IMPALA-12008:
---------------------------------------
Summary: End Time in profile is set before DML finishes
Key: IMPALA-12008
URL: https://issues.apache.org/jira/browse/IMPALA-12008
Project: IMPALA
Issue Type: Bug
Components: Backend
Reporter: Quanlong Huang
Attachments: insert_profile.txt
The runtime profile for DML (e.g. INSERT) has incorrect "End Time" that it's
actually the time of DML data written. The DML might still running in updating
hive metastore.
Attached a profile as an example:
{code:java}
Query (id=b64d22b7100e40ea:5345525100000000):
Start Time: 2023-03-17 13:17:05.158778000
End Time: 2023-03-17 13:17:05.679796000
Query Type: DML
Sql Statement: insert into my_alltypes partition(year, month) select * from
functional.alltypestiny{code}
The "End Time" is 13:17:05. However, it's still running after that time.
Showing in the coordinator logs:
{noformat}
I0317 13:17:05.158836 32226 impala-server.cc:1363]
b64d22b7100e40ea:5345525100000000] Registered query
query_id=b64d22b7100e40ea:5345525100000000
session_id=4f4b9d15a11d89b1:483361be5dda7d80
I0317 13:17:05.159116 32226 Frontend.java:1928]
b64d22b7100e40ea:5345525100000000] Analyzing query: insert into my_alltypes
partition(year, month) select * from functional.alltypestiny db: default
...
I0317 13:17:05.658111 1848 coordinator.cc:855] Finalizing query:
b64d22b7100e40ea:5345525100000000
I0317 13:17:05.677682 1848 coordinator.cc:899] Removing staging directory:
hdfs://localhost:20500/test-warehouse/my_alltypes/_impala_insert_staging/b64d22b7100e40ea_5345525100000000/
I0317 13:17:05.679735 1848 coordinator.cc:735] ExecState: query
id=b64d22b7100e40ea:5345525100000000 execution completed
I0317 13:17:05.679783 1848 coordinator.cc:1399] Release admission control
resources for query_id=b64d22b7100e40ea:5345525100000000
I0317 13:17:05.680047 1848 client-request-state.cc:1523] Updating metastore
with 4 altered partitions (year=2009/month=1, year=2009/month=2,
year=2009/month=3, year=2009/month=4)
I0317 13:17:05.680058 1848 client-request-state.cc:1553] Executing
FinalizeDml() using CatalogService
I0317 13:17:31.990342 1848 ImpaladCatalog.java:223] Adding:
TABLE:default.my_alltypes version: 1838 size: 7747
I0317 13:17:32.309155 32226 impala-hs2-server.cc:845] CloseOperation():
query_id=b64d22b7100e40ea:5345525100000000
I0317 13:17:32.309190 32226 impala-server.cc:1447] UnregisterQuery():
query_id=b64d22b7100e40ea:5345525100000000
I0317 13:17:32.309198 32226 coordinator-backend-state.cc:998]
query_id=b64d22b7100e40ea:5345525100000000 target backend=127.0.0.1:27002: Not
cancelling because the backend is already done:
I0317 13:17:32.309206 32226 coordinator-backend-state.cc:998]
query_id=b64d22b7100e40ea:5345525100000000 target backend=127.0.0.1:27001: Not
cancelling because the backend is already done:
I0317 13:17:32.309214 32226 coordinator-backend-state.cc:998]
query_id=b64d22b7100e40ea:5345525100000000 target backend=127.0.0.1:27000: Not
cancelling because the backend is already done:
I0317 13:17:32.309221 32226 coordinator.cc:1023] CancelBackends()
query_id=b64d22b7100e40ea:5345525100000000, tried to cancel 0 backends
I0317 13:17:32.312103 32105 impala-server.cc:1479] Query successfully
unregistered: query_id=b64d22b7100e40ea:5345525100000000
I0317 13:17:32.315893 32226 query-exec-mgr.cc:213] ReleaseQueryState(): deleted
query_id=b64d22b7100e40ea:5345525100000000
I0317 13:17:34.208513 32140 ImpaladCatalog.java:223] Adding:
TABLE:default.my_alltypes version: 1838 size: 3860
I0317 13:17:34.208752 32140 ImpaladCatalog.java:223] Adding: CATALOG_SERVICE_ID
version: 1838 size: 60
I0317 13:17:34.209161 32140 ImpaladCatalog.java:248] Adding 4 partition(s):
HDFS_PARTITION:default.my_alltypes:(year=2009/month=1,year=2009/month=2,...,year=2009/month=4),
version=1838, size=(avg=1006, min=1006, max=1006, sum=4024)
I0317 13:17:34.210037 32140 impala-server.cc:2061] Catalog topic update applied
with version: 1838 new min catalog object version: 3{noformat}
Note that to reproduce the issue, I add a breakpoint here to manually pause
updateCatalog() in catalogd:
[https://github.com/apache/impala/blob/8b375a66a29cfea33a4e418b1fa3b9b5139cf907/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java#L6791]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]