Alex Behm has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/8110 )

Change subject: IMPALA-5955: Use totalSize tblproperty instead of rawDataSize.
......................................................................


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/8110/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/8110/1//COMMIT_MSG@9
PS1, Line 9: Today, Impala populates the 'rawDataSize' property
           : during COMPUTE STATS for the purpose of extrapolating
           : row counts based on file sizes.
           :
           : Intended meaning/use of tblproperties:
           : - rawDataSize' is the estimated in-memory size of a table
           :   (without encoding and compression)
           : - 'totalSize' represents the on-disk size
           :
           : Using the fields correctly is important for compatibility
           : with other users of the HMS such as Hive and SparkSQL.
           : For example, SparkSQL relies on the 'totalSize' for
           : join ordering.
> Although this is very informative, I don't think I understand what this com
The title says "Use totalSize tblproperty instead of rawDataSize".

Added extra info in commit msg body.


http://gerrit.cloudera.org:8080/#/c/8110/1/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/8110/1/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@1178
PS1, Line 1178: droppedRawDataSize
> rename to droppedTotalSize to be consistent with the tblproperty being upda
Good catch. I missed that one.



--
To view, visit http://gerrit.cloudera.org:8080/8110
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If7c2c4e1e99b297c849f9f0d18b2bef34ad811c6
Gerrit-Change-Number: 8110
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Behm <alex.b...@cloudera.com>
Gerrit-Reviewer: Alex Behm <alex.b...@cloudera.com>
Gerrit-Reviewer: Bharath Vissapragada <bhara...@cloudera.com>
Gerrit-Reviewer: Dimitris Tsirogiannis <dtsirogian...@cloudera.com>
Gerrit-Comment-Date: Thu, 21 Sep 2017 20:53:00 +0000
Gerrit-HasComments: Yes

Reply via email to