miklosgergely opened a new pull request #801: HIVE-22248 Fix statistics 
persisting issues
URL: https://github.com/apache/hive/pull/801
 
 
   - During the thrift call the XXXXColumnStatsDataInspector was transformed 
into a XXXXColumnStatsData object, which then was converted back, by calling 
the xxxxInspectorFromStats functions. The new object was never put back though 
to the aggregateStats, so all the modifications made by the 
XXXXColumnStatsMerger was made on an object that was never used again. Added 
aggregateColStats.getStatsData().setXXXXStats(aggregateData); calls to put them 
there, so the changes made by the merger are actually in effect.
   
   - The min value was miscalculated for Long and Double types, as the null 
value was treated as 0. It was fixed by calculating the min values by also 
using the isSetLowValue() function.
   
   - In case of vector_coalesce_3.q the bad statistics made the engine "think" 
that the column is a primary key following some heuristics based on statistics, 
and made it guess the statistics in a different way, thus is the output change.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org
For additional commands, e-mail: gitbox-h...@hive.apache.org

Reply via email to