[
https://issues.apache.org/jira/browse/SPARK-8403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong Shen updated SPARK-8403:
-----------------------------
Description:
When partition field and fieldSchema exist in sql predicates, pruner partition
won't effective.
Here is the sql,
{code}
select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_2090000026 r where
r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)>16)
{code}
When run on hive, it will only scan data in partition 20150615, but if run on
spark sql, it will scan the whole table t_dw_qqlive_2090000026.
was:
When udf exit in sql predicates, pruner partition won't effective.
Here is the sql,
{code}
select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_2090000026 r where
r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)>16)
{code}
When run on hive, it will only scan data in partition 20150615, but if run on
spark sql, it will scan the whole table t_dw_qqlive_2090000026.
> Pruner partition won't effective when partition field and fieldSchema exit in
> sql predicate
> -------------------------------------------------------------------------------------------
>
> Key: SPARK-8403
> URL: https://issues.apache.org/jira/browse/SPARK-8403
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Reporter: Hong Shen
>
> When partition field and fieldSchema exist in sql predicates, pruner
> partition won't effective.
> Here is the sql,
> {code}
> select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_2090000026 r
> where r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)>16)
> {code}
> When run on hive, it will only scan data in partition 20150615, but if run on
> spark sql, it will scan the whole table t_dw_qqlive_2090000026.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]