[ 
https://issues.apache.org/jira/browse/HIVE-18894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich updated HIVE-18894:
------------------------------------
    Description: 
rawdatasize 5312 seems to be an underestimation...

afaik for orc the rawDataSize is estimated as the "online" datasize; for text 
tables it currently seems like its calculated as {{TOTAL_SIZE - ROW_NUM}} in 
some cases

  was:rawdatasize 5312 seems to be an underestimation...


> Statistics: rawDataSize seems to be underestimated for text tables
> ------------------------------------------------------------------
>
>                 Key: HIVE-18894
>                 URL: https://issues.apache.org/jira/browse/HIVE-18894
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Zoltan Haindrich
>            Priority: Major
>
> rawdatasize 5312 seems to be an underestimation...
> afaik for orc the rawDataSize is estimated as the "online" datasize; for text 
> tables it currently seems like its calculated as {{TOTAL_SIZE - ROW_NUM}} in 
> some cases



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to