Re: count(*) with count(distinct) gives different results between Hive 2.3.2 and Hive 3.1.2

2020-07-29 Thread Eugene Chung
I found that hive.optimize.countdistinct=true; is the problem, It looks like https://issues.apache.org/jira/browse/HIVE-16654 made the side effect. Best regards, Eugene Chung (Korean : 정의근)

Re: Help needed for count(*) query error on Hive 3.1.2 & Tez 0.9.2

2020-07-02 Thread Eugene Chung
think it’s not. Regards, Eugene On Thu, Jul 2, 2020 at 5:59 Zoltan Haindrich wrote: > Hey Eugene! > > I don't see any hints in these outputs what could be the issue...have you > checked the tez container logs? > > cheers, > Zoltan > > > On 7/1/20 9:58 AM, Eugene

Help needed for count(*) query error on Hive 3.1.2 & Tez 0.9.2

2020-07-01 Thread Eugene Chung
;_col0"],aggregations:["count()"] | | Select Operator [SEL_2] (rows=1 width=18325049344) | | TableScan [TS_0] (rows=1 width=18325049344) | | MY_DB@ORC_TABLE,ORC_TABLE,Tbl:PARTIAL,Col:NONE | | | ++ Best regards, Eugene Chung (Korean : 정의근)

count(*) with count(distinct) gives different results between Hive 2.3.2 and Hive 3.1.2

2020-07-29 Thread Eugene Chung
160) | | Output:["mid"] | | TableScan [TS_0] (rows=343640771 width=4160) | | japan_line_search_refine_log@daily_kpi_log,daily_kpi_log,Tbl:COMPLETE,Col:NONE,Output:["mid"] | || ++ Any guess on the reason why they are different? Best regards, Eugene Chung (Korean : 정의근)