[
https://issues.apache.org/jira/browse/IMPALA-12956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18083313#comment-18083313
]
Aleksandr Efimov commented on IMPALA-12956:
-------------------------------------------
Uploaded a patch for review:
https://gerrit.cloudera.org/24353
Validation:
- ./buildall.sh -notests -noclean
- bin/impala-py.test -q tests/observability/test_profile_tool.py
- Manual smoke-tested summary output on V1/V2 profile logs
- Manual spill profile validation confirmed Bytes Spilled extraction
> Provide a summary output for impala-profile-tool
> ------------------------------------------------
>
> Key: IMPALA-12956
> URL: https://issues.apache.org/jira/browse/IMPALA-12956
> Project: IMPALA
> Issue Type: New Feature
> Components: Infrastructure
> Reporter: Quanlong Huang
> Assignee: Aleksandr Efimov
> Priority: Major
>
> impala-profile-tool is useful to parse the profile logs, i.e. log files like
> impala_profile_log_1.1-1711621243861 under the /profiles folder. However, it
> just prints the profiles in text format. It'd be useful to print a summary of
> each query, e.g. providing a tsv file with one line for a query containing
> columns like what we show in the /queries page, including
> * Query ID
> * User
> * Default Db
> * Query Type
> * Start Time
> * End Time
> * Duration
> * Queued Duration
> * Mem Usage
> * Mem Estimate
> * -Scan Progress-
> * -Query Progress-
> * Bytes Read
> * Bytes Sent
> * Bytes Spilled
> * State
> * # rows fetched
> * Resource Pool
> * Statement (truncated)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]