Sindhu Subhas created HIVE-25985: ------------------------------------ Summary: Estimate stats gives out incorrect number of columns when using predicates like c=22 Key: HIVE-25985 URL: https://issues.apache.org/jira/browse/HIVE-25985 Project: Hive Issue Type: Bug Components: Hive Affects Versions: 3.0.0 Environment: Hive 3 Reporter: Sindhu Subhas
Table type: External Stats: No stats collected. When filter operator appeared in the plan and the row estimates went bad. Changed the original query on table and modifying the filter predicate form. |*predicate form*|*optimised as* |*filter Op rows out*|*estimate quality*| |prd_i_tmp.type = '22'|predicate:(type = '22')|Filter Operator [FIL_12] (rows=5 width=3707) \||bad| |prd_i_tmp.type in ('22')|predicate:(type = '22')|Filter Operator [FIL_12] (rows=5 width=3707)|bad| |prd_i_tmp.type < '23' and prd_i_tmp.type > '21'|predicate:((type < '23') and (type > '21'))|Filter Operator [FIL_12] (rows=8706269 width=3707) |good| |prd_i_tmp.type like '22'|predicate:(type like '22')|Filter Operator [FIL_12] (rows=39178213 width=3707)|best| |prd_i_tmp.type in ('22','AA','BB')|predicate:(type) IN ('22', 'AA', 'BB')|Filter Operator [FIL_12] (rows=15 width=3707)|bad| |prd_i_tmp.type rlike '22'|predicate:type regexp '22'| Filter Operator [FIL_12] (rows=39178213 width=3707)|good| -- This message was sent by Atlassian Jira (v8.20.1#820001)