[
https://issues.apache.org/jira/browse/HBASE-8034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13603695#comment-13603695
]
Matteo Bertozzi commented on HBASE-8034:
----------------------------------------
HFile block headers, reading DATA_BLOCKS_FRACTION_ESTIMATE it seems that you're
just calculating the "data blocks" without all the index, block filters, block
headers...
I see an information like the number of keys (as Nick suggested) more useful,
than having another full file size that we can get from the file status... but
maybe there's something I'm missing.
> record on-disk data size for store file and make it available during writing
> ----------------------------------------------------------------------------
>
> Key: HBASE-8034
> URL: https://issues.apache.org/jira/browse/HBASE-8034
> Project: HBase
> Issue Type: Task
> Reporter: Sergey Shelukhin
> Assignee: Sergey Shelukhin
> Priority: Minor
> Attachments: HBASE-8034-v0.patch, HBASE-8034-v1.patch,
> HBASE-8034-v2.patch
>
>
> To better estimate the size of data in the file, and to be able to split
> files intelligently during any multi-file compactor like stripe or level.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira