[ 
https://issues.apache.org/jira/browse/IMPALA-12956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18083313#comment-18083313
 ] 

Aleksandr Efimov commented on IMPALA-12956:
-------------------------------------------

Uploaded a patch for review:
https://gerrit.cloudera.org/24353

Validation:
- ./buildall.sh -notests -noclean
- bin/impala-py.test -q tests/observability/test_profile_tool.py
- Manual smoke-tested summary output on V1/V2 profile logs
- Manual spill profile validation confirmed Bytes Spilled extraction

> Provide a summary output for impala-profile-tool
> ------------------------------------------------
>
>                 Key: IMPALA-12956
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12956
>             Project: IMPALA
>          Issue Type: New Feature
>          Components: Infrastructure
>            Reporter: Quanlong Huang
>            Assignee: Aleksandr Efimov
>            Priority: Major
>
> impala-profile-tool is useful to parse the profile logs, i.e. log files like 
> impala_profile_log_1.1-1711621243861 under the /profiles folder. However, it 
> just prints the profiles in text format. It'd be useful to print a summary of 
> each query, e.g. providing a tsv file with one line for a query containing 
> columns like what we show in the /queries page, including
>  * Query ID
>  * User
>  * Default Db
>  * Query Type
>  * Start Time
>  * End Time
>  * Duration
>  * Queued Duration
>  * Mem Usage
>  * Mem Estimate
>  * -Scan Progress-
>  * -Query Progress-
>  * Bytes Read
>  * Bytes Sent
>  * Bytes Spilled
>  * State
>  * # rows fetched
>  * Resource Pool
>  * Statement (truncated)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to