jcamachor commented on a change in pull request #970: Hive 23100
URL: https://github.com/apache/hive/pull/970#discussion_r409989722
 
 

 ##########
 File path: ql/src/test/results/clientpositive/perf/tez/query82.q.out
 ##########
 @@ -85,7 +85,7 @@ Stage-0
                               Select Operator [SEL_86] (rows=297 width=400)
                                 Output:["_col0","_col1","_col2","_col3"]
                                 Filter Operator [FIL_85] (rows=297 width=404)
-                                  predicate:(i_current_price BETWEEN 30 AND 60 
and (i_manufact_id) IN (437, 129, 727, 663) and i_item_sk is not null)
 
 Review comment:
   I checked, I thought it was worth it. `i_current_price` min is 
0.0900000035762787 and max 99.9899978637695; thus, assuming uniform 
distribution, BETWEEN will filter ~70% of the data. On the other hand, the min 
for `i_manufact_id` is 1 and the max is 1000; assuming the same, IN will filter 
>99% of the data.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to