GitHub user scwf opened a pull request:
https://github.com/apache/spark/pull/3304
[SPARK-4443][SQL] Fix statistics for external table in spark sql hive
The `totalSize` of external table is always zero, which will influence
join strategy(always use broadcast join for external table).
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/scwf/spark statistics
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/3304.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #3304
----
commit 5f5fda8244c59970d117a0e9d32bd8bddde348f2
Author: w00228970 <[email protected]>
Date: 2014-11-17T02:34:17Z
https -> http in pom
commit 78e438699882b02543d7b31271b0c489834518f2
Author: w00228970 <[email protected]>
Date: 2014-11-17T03:17:40Z
fix statistics for external table
commit 3d7571d364d8e278cc81841ab160e6dac1845680
Author: w00228970 <[email protected]>
Date: 2014-11-17T05:46:20Z
Revert "https -> http in pom"
This reverts commit 5f5fda8244c59970d117a0e9d32bd8bddde348f2.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]