[ 
https://issues.apache.org/jira/browse/IMPALA-6556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18045146#comment-18045146
 ] 

Quanlong Huang commented on IMPALA-6556:
----------------------------------------

{quote} * Progress
 ** How many partitions have been loaded. This is only useful for HdfsTable. It 
can be tracked in this 
[loop|https://github.com/apache/impala/blob/711797e7fbda6f30fc49d91e30ad6ab31a4f4a69/fe/src/main/java/org/apache/impala/catalog/ParallelFileMetadataLoader.java#L193].{quote}
As I mentioned before, we can use this loop: 
[https://github.com/apache/impala/blob/711797e7fbda6f30fc49d91e30ad6ab31a4f4a69/fe/src/main/java/org/apache/impala/catalog/ParallelFileMetadataLoader.java#L193]

It's not precise but at least useful. So yes, we need changes in 
ParallelFileMetadataLoader to expose the progress.

On the other hand, we have metrics like num_file_metadata_loading_tasks but 
it's a global counter. We can track it in table level but need more code 
changes. To be simple, I think we can try using the above loop progress first.

> Show what tables have been loading on Catalog WebUI
> ---------------------------------------------------
>
>                 Key: IMPALA-6556
>                 URL: https://issues.apache.org/jira/browse/IMPALA-6556
>             Project: IMPALA
>          Issue Type: New Feature
>          Components: Catalog
>            Reporter: Juan Yu
>            Assignee: Arnab Karmakar
>            Priority: Major
>         Attachments: Screenshot 2025-12-15 at 11.09.06 AM.png, Screenshot 
> 2025-12-15 at 11.10.18 AM.png
>
>
> This helps users to know how many DDLs are running. How many tables have been 
> loading. 
> So users could know if a query is hung or just waiting for metadata. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to