[jira] [Commented] (HIVE-24574) Add DIAGNOSE Statement

2021-01-05 Thread David Mollitor (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-24574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17259102#comment-17259102
 ] 

David Mollitor commented on HIVE-24574:
---

Results could be a huge result set, so it's not always practical to collect it, 
so I propose here to make it optional.

> Add DIAGNOSE Statement
> --
>
> Key: HIVE-24574
> URL: https://issues.apache.org/jira/browse/HIVE-24574
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Priority: Major
>
> Add a new statement to Hive called {{DIAGNOSE}}
> {code:sql}
> DIAGNOSE [WITH (PARQUET|ORC|JSON|AVRO) RESULTS] ...
> {code}
> Returns a single binary (BLOB) column which contains a TAR-GZ file comprised 
> of several other files:
> * A JSON file containing HS2 version information, HS2 host name, date or 
> query submission, query id(s) etc.
> * The query itself (file name is MD5 of the query)
> * EXPLAIN plan (file name is MD5 of the explain plan)
> * SHOW CREATE for each table in the query ()
> * The configuration of the session (set)
> * The Hive logs generated by the query
> * The processing engine logs generated by the query
> * Any counters associated with the processing engine
> * Optionally, the results of the query in a single file (file name is MD5 of 
> the results)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HIVE-24574) Add DIAGNOSE Statement

2021-01-05 Thread David Mollitor (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-24574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17259100#comment-17259100
 ] 

David Mollitor commented on HIVE-24574:
---

Keeping in mind that a query can be made up of nested views, so each 'table' in 
the query needs a SHOW CREATE for itself and all of the sub-views/tables that 
make it compose it.

> Add DIAGNOSE Statement
> --
>
> Key: HIVE-24574
> URL: https://issues.apache.org/jira/browse/HIVE-24574
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Priority: Major
>
> Add a new statement to Hive called {{DIAGNOSE}}
> {code:sql}
> DIAGNOSE [WITH (PARQUET|ORC|JSON|AVRO) RESULTS] ...
> {code}
> Returns a single binary (BLOB) column which contains a TAR-GZ file comprised 
> of several other files:
> * A JSON file containing HS2 version information, HS2 host name, date or 
> query submission, query id(s) etc.
> * The query itself (file name is MD5 of the query)
> * EXPLAIN plan (file name is MD5 of the explain plan)
> * SHOW CREATE for each table in the query ()
> * The configuration of the session (set)
> * The Hive logs generated by the query
> * The processing engine logs generated by the query
> * Any counters associated with the processing engine
> * Optionally, the results of the query in a single file (file name is MD5 of 
> the results)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)