Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/19831
Is it really an issue? If you manually set a wrong statistics, how would
you expect the system to do? I think data source tables don't allow you set the
statistics manually, so this problem is inherited from Hive. cc @wzhfy to
confirm.
This PR treats 0 row count as invalid, which is arguable, i.e. if we
analyze an empty table, and then the 0 row count is valid.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]