miklosgergely opened a new pull request #801: HIVE-22248 Fix statistics persisting issues URL: https://github.com/apache/hive/pull/801 - During the thrift call the XXXXColumnStatsDataInspector was transformed into a XXXXColumnStatsData object, which then was converted back, by calling the xxxxInspectorFromStats functions. The new object was never put back though to the aggregateStats, so all the modifications made by the XXXXColumnStatsMerger was made on an object that was never used again. Added aggregateColStats.getStatsData().setXXXXStats(aggregateData); calls to put them there, so the changes made by the merger are actually in effect. - The min value was miscalculated for Long and Double types, as the null value was treated as 0. It was fixed by calculating the min values by also using the isSetLowValue() function. - In case of vector_coalesce_3.q the bad statistics made the engine "think" that the column is a primary key following some heuristics based on statistics, and made it guess the statistics in a different way, thus is the output change.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For additional commands, e-mail: gitbox-h...@hive.apache.org