[
https://issues.apache.org/jira/browse/HIVE-431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Adam Kramer updated HIVE-431:
-----------------------------
Description:
A syntactic copy of the query that was used to fill a table would often be
AMAZINGLY useful for figuring out where the data in the table came from.
I think the best way to implement this would be to automatically add a table
property which includes the SELECT statement. For partitioned tables, this
would need to exist for each partition...or perhaps use some canonical name
like selectquery for unpartitioned tables, plus selectquery_ds=<DATEID> for
partitioned tables.
This problem is growing as more and more tables in our database are generated
by either "root" or by people who are no longer easy to contact.
was:
It would be lovely if, when I write a query that generates a table, that table
could have a "query" property automatically set which includes the query which
I used to create the table. This is currently doable by using ALTER TABLE
table_name SET PROPERTIES (query_partitionid='SELECT ...'), but for my goal,
that's an unholy mess due to quoting out ' marks...plus, it's really likely
that I'll forget to do so (esp. if a query fails).
Summary: Auto-add table property "select" to be the select statement
that created the table (was: Add generating query as a "table property")
> Auto-add table property "select" to be the select statement that created the
> table
> ----------------------------------------------------------------------------------
>
> Key: HIVE-431
> URL: https://issues.apache.org/jira/browse/HIVE-431
> Project: Hadoop Hive
> Issue Type: Wish
> Reporter: Adam Kramer
>
> A syntactic copy of the query that was used to fill a table would often be
> AMAZINGLY useful for figuring out where the data in the table came from.
> I think the best way to implement this would be to automatically add a table
> property which includes the SELECT statement. For partitioned tables, this
> would need to exist for each partition...or perhaps use some canonical name
> like selectquery for unpartitioned tables, plus selectquery_ds=<DATEID> for
> partitioned tables.
> This problem is growing as more and more tables in our database are generated
> by either "root" or by people who are no longer easy to contact.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.