[
https://issues.apache.org/jira/browse/ORC-128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15829440#comment-15829440
]
ASF GitHub Bot commented on ORC-128:
------------------------------------
Github user prasanthj commented on the issue:
https://github.com/apache/orc/pull/78
@dain Hive already uses stats API (reader side and writer side) to get
basic statistics like (numRows, rawDataSize, etc.) from the footer to avoid
row-by-row stats gathering. This new API is to extend the same for column
statistics (although ORC is missing NDV at this point).
> Add capability to get column statistics during writing
> ------------------------------------------------------
>
> Key: ORC-128
> URL: https://issues.apache.org/jira/browse/ORC-128
> Project: Orc
> Issue Type: New Feature
> Components: Java
> Reporter: Owen O'Malley
> Assignee: Owen O'Malley
> Fix For: 1.3.0
>
>
> It would be useful if users could get the column statistics as the file is
> being written.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)