[ 
https://issues.apache.org/jira/browse/ORC-128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15829440#comment-15829440
 ] 

ASF GitHub Bot commented on ORC-128:
------------------------------------

Github user prasanthj commented on the issue:

    https://github.com/apache/orc/pull/78
  
    @dain Hive already uses stats API (reader side and writer side) to get 
basic statistics like (numRows, rawDataSize, etc.) from the footer to avoid 
row-by-row stats gathering. This new API is to extend the same for column 
statistics (although ORC is missing NDV at this point). 


> Add capability to get column statistics during writing
> ------------------------------------------------------
>
>                 Key: ORC-128
>                 URL: https://issues.apache.org/jira/browse/ORC-128
>             Project: Orc
>          Issue Type: New Feature
>          Components: Java
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>             Fix For: 1.3.0
>
>
> It would be useful if users could get the column statistics as the file is 
> being written.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to