okumin opened a new pull request #1984: URL: https://github.com/apache/hive/pull/1984
https://issues.apache.org/jira/browse/HIVE-24240 ### What changes were proposed in this pull request? Fix incorrect computations to estimate UDTF size. - Put 1 when numRows becomes zero because Hive expects it will be non-zero in regular cases - Use `StatsUtils .scaleColStatistics` to update col stats so as to update # of distinct values - Wrap the final stats with `applyRuntimeStats` This is a follow-up of https://github.com/apache/hive/pull/1531. ### Why are the changes needed? This PR would help Hive to compute more precise stats for UDTF. ### Does this PR introduce _any_ user-facing change? Compatible from the point of view of users. ### How was this patch tested? Revised one unit test. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
