[jira] [Updated] (SPARK-19408) cardinality estimation involving two columns of the same table

2017-04-03 Thread Xiao Li (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-19408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiao Li updated SPARK-19408:

Description: 
In SPARK-17075, we estimate cardinality of predicate expression "column (op) 
literal", where op is =, <, <=, >, >= or <=>.  In SQL queries, we also see 
predicate expressions involving two columns such as "column-1 (op) column-2" 
where column-1 and column-2 belong to same table.  Note that, if column-1 and 
column-2 belong to different tables, then it is a join operator's work, NOT a 
filter operator's work.

In this jira, we want to estimate the filter factor of predicate expressions 
involving two columns of same table.   For example, multiple tpc-h queries have 
this kind of predicate "WHERE l_commitdate < l_receiptdate".

  was:
In SPARK-17075, we estimate cardinality of predicate expression "column (op) 
literal", where op is =, <, <=, >, or >=.  In SQL queries, we also see 
predicate expressions involving two columns such as "column-1 (op) column-2" 
where column-1 and column-2 belong to same table.  Note that, if column-1 and 
column-2 belong to different tables, then it is a join operator's work, NOT a 
filter operator's work.

In this jira, we want to estimate the filter factor of predicate expressions 
involving two columns of same table.   For example, multiple tpc-h queries have 
this kind of predicate "WHERE l_commitdate < l_receiptdate".


> cardinality estimation involving two columns of the same table
> --
>
> Key: SPARK-19408
> URL: https://issues.apache.org/jira/browse/SPARK-19408
> Project: Spark
>  Issue Type: Sub-task
>  Components: Optimizer
>Affects Versions: 2.1.0
>Reporter: Ron Hu
> Fix For: 2.2.0
>
>
> In SPARK-17075, we estimate cardinality of predicate expression "column (op) 
> literal", where op is =, <, <=, >, >= or <=>.  In SQL queries, we also see 
> predicate expressions involving two columns such as "column-1 (op) column-2" 
> where column-1 and column-2 belong to same table.  Note that, if column-1 and 
> column-2 belong to different tables, then it is a join operator's work, NOT a 
> filter operator's work.
> In this jira, we want to estimate the filter factor of predicate expressions 
> involving two columns of same table.   For example, multiple tpc-h queries 
> have this kind of predicate "WHERE l_commitdate < l_receiptdate".



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-19408) cardinality estimation involving two columns of the same table

2017-03-31 Thread Ron Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-19408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ron Hu updated SPARK-19408:
---
Description: 
In SPARK-17075, we estimate cardinality of predicate expression "column (op) 
literal", where op is =, <, <=, >, or >=.  In SQL queries, we also see 
predicate expressions involving two columns such as "column-1 (op) column-2" 
where column-1 and column-2 belong to same table.  Note that, if column-1 and 
column-2 belong to different tables, then it is a join operator's work, NOT a 
filter operator's work.

In this jira, we want to estimate the filter factor of predicate expressions 
involving two columns of same table.   For example, multiple tpc-h queries have 
this kind of predicate "WHERE l_commitdate < l_receiptdate".

  was:
In SPARK-17075, we estimate cardinality of predicate expression "column (op) 
literal", where op is =, <, <=, >, or >=.  In SQL queries, we also see 
predicate expressions involving two columns such as "column-1 (op) column-2" 
where column-1 and column-2 belong to same table.  Note that, if column-1 and 
column-2 belong to different tables, then it is a join operator's work, NOT a 
filter operator's work.

In this jira, we want to estimate the filter factor of predicate expressions 
involving two columns of same table.   


> cardinality estimation involving two columns of the same table
> --
>
> Key: SPARK-19408
> URL: https://issues.apache.org/jira/browse/SPARK-19408
> Project: Spark
>  Issue Type: Sub-task
>  Components: Optimizer
>Affects Versions: 2.1.0
>Reporter: Ron Hu
>
> In SPARK-17075, we estimate cardinality of predicate expression "column (op) 
> literal", where op is =, <, <=, >, or >=.  In SQL queries, we also see 
> predicate expressions involving two columns such as "column-1 (op) column-2" 
> where column-1 and column-2 belong to same table.  Note that, if column-1 and 
> column-2 belong to different tables, then it is a join operator's work, NOT a 
> filter operator's work.
> In this jira, we want to estimate the filter factor of predicate expressions 
> involving two columns of same table.   For example, multiple tpc-h queries 
> have this kind of predicate "WHERE l_commitdate < l_receiptdate".



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-19408) cardinality estimation involving two columns of the same table

2017-03-24 Thread Ron Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-19408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ron Hu updated SPARK-19408:
---
Target Version/s: 2.2.0  (was: 2.3.0)

> cardinality estimation involving two columns of the same table
> --
>
> Key: SPARK-19408
> URL: https://issues.apache.org/jira/browse/SPARK-19408
> Project: Spark
>  Issue Type: Sub-task
>  Components: Optimizer
>Affects Versions: 2.1.0
>Reporter: Ron Hu
>
> In SPARK-17075, we estimate cardinality of predicate expression "column (op) 
> literal", where op is =, <, <=, >, or >=.  In SQL queries, we also see 
> predicate expressions involving two columns such as "column-1 (op) column-2" 
> where column-1 and column-2 belong to same table.  Note that, if column-1 and 
> column-2 belong to different tables, then it is a join operator's work, NOT a 
> filter operator's work.
> In this jira, we want to estimate the filter factor of predicate expressions 
> involving two columns of same table.   



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-19408) cardinality estimation involving two columns of the same table

2017-03-21 Thread Ron Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-19408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ron Hu updated SPARK-19408:
---
Target Version/s: 2.3.0  (was: 2.2.0)

> cardinality estimation involving two columns of the same table
> --
>
> Key: SPARK-19408
> URL: https://issues.apache.org/jira/browse/SPARK-19408
> Project: Spark
>  Issue Type: Sub-task
>  Components: Optimizer
>Affects Versions: 2.1.0
>Reporter: Ron Hu
>
> In SPARK-17075, we estimate cardinality of predicate expression "column (op) 
> literal", where op is =, <, <=, >, or >=.  In SQL queries, we also see 
> predicate expressions involving two columns such as "column-1 (op) column-2" 
> where column-1 and column-2 belong to same table.  Note that, if column-1 and 
> column-2 belong to different tables, then it is a join operator's work, NOT a 
> filter operator's work.
> In this jira, we want to estimate the filter factor of predicate expressions 
> involving two columns of same table.   



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org