[
https://issues.apache.org/jira/browse/HIVE-27726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17768570#comment-17768570
]
Stamatis Zampetakis commented on HIVE-27726:
--------------------------------------------
HIVE-19770 and HIVE-27399 showcase some examples where enhanced constant
folding led to "missing lineage" information.
> No lineage for constant literals
> --------------------------------
>
> Key: HIVE-27726
> URL: https://issues.apache.org/jira/browse/HIVE-27726
> Project: Hive
> Issue Type: Improvement
> Components: HiveServer2
> Affects Versions: 4.0.0-beta-1
> Reporter: Stamatis Zampetakis
> Priority: Minor
>
> Consider the following statement where we create a table based on another
> table and some constant expressions/literals.
> {code:java}
> CREATE TABLE tbl1 AS (SELECT 'Bob', 'Alice', key, value || 'some' FROM src)
> {code}
> Observe that column 0, and column 1 of tbl1 originate from constant literals.
> Currently (commit e5a7ce2f091da1f8a324da6e489cda59b9e4bfc6), there is no
> lineage information for columns originating from constants.
> The {{org.apache.hadoop.hive.ql.hooks.LineageLogger}} will display the
> following the aforementioned DDL statement.
>
> {noformat}
> POSTHOOK: Lineage: tbl1._c0 SIMPLE []
> POSTHOOK: Lineage: tbl1._c1 SIMPLE []
> POSTHOOK: Lineage: tbl1._c3 EXPRESSION [(src)src.FieldSchema(name:value,
> type:string, comment:default), ]
> POSTHOOK: Lineage: tbl1.key SIMPLE [(src)src.FieldSchema(name:key,
> type:string, comment:default), ]
> {noformat}
> This is not really a bug since we cannot really say that something is broken
> but maybe there is a way to reflect that {{c0}} and {{c1}} originate from
> constants to avoid the misconception that lineage is missing.
> Note, that constant folding may also lead to the same behavior since
> expressions may be simplified to constants and the latter do not appear in
> the lineage output.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)