[
https://issues.apache.org/jira/browse/HIVE-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13106720#comment-13106720
]
[email protected] commented on HIVE-2453:
-----------------------------------------------------
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1933/
-----------------------------------------------------------
Review request for hive and Ning Zhang.
Summary
-------
The information that would be useful for categorizing queries is clearest in
the Semantic Analyzer, when the data from the Parser is interpreted. I added a
new class which is designed to collect that data here, and place it ultimately
in the QueryPlan where it will be available to hooks.
The information I collect is whether or not the query has the following clauses:
Join
Group By
Order By
Sort By
Group By after a Join clause
Also, I store whether or not a script is used for mapping or reducing.
This addresses bug HIVE-2453.
https://issues.apache.org/jira/browse/HIVE-2453
Diffs
-----
trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryPlan.java 1170719
trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryProperties.java PRE-CREATION
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java
1170719
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
1170719
trunk/ql/src/test/org/apache/hadoop/hive/ql/hooks/CheckQueryPropertiesHook.java
PRE-CREATION
trunk/ql/src/test/queries/clientpositive/query_properties.q PRE-CREATION
trunk/ql/src/test/results/clientpositive/query_properties.q.out PRE-CREATION
Diff: https://reviews.apache.org/r/1933/diff
Testing
-------
I added a new test, which runs a variety of queries, such that each of the
flags in QueryProperties is set by at least one query, and also some are set in
combinations.
I also added a hook which prints the contents of QueryProperties to error on
the console.
I checked the output in the results file and verified it matched what I
expected.
Thanks,
Kevin
> Need a way to categorize queries in hooks for improved logging
> --------------------------------------------------------------
>
> Key: HIVE-2453
> URL: https://issues.apache.org/jira/browse/HIVE-2453
> Project: Hive
> Issue Type: Improvement
> Reporter: Kevin Wilfong
> Assignee: Kevin Wilfong
> Attachments: HIVE-2453.1.patch.txt
>
>
> We need a way to categorize queries, such as whether or not the include a
> join clause, a group by clause, etc., in the hooks. This will allow for
> better performance logging.
> Currently the only way I can find is to go through the operators in the
> tasks, but which operators are used for the different types of queries may
> change over time.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira