[
https://issues.apache.org/jira/browse/CRUNCH-230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Wills updated CRUNCH-230:
------------------------------
Attachment: CRUNCH-230.patch
The patch.
> Attempt to estimate HBase table sizes when we're given a trivial Scan object
> ----------------------------------------------------------------------------
>
> Key: CRUNCH-230
> URL: https://issues.apache.org/jira/browse/CRUNCH-230
> Project: Crunch
> Issue Type: Improvement
> Components: IO
> Affects Versions: 0.6.0
> Reporter: Josh Wills
> Priority: Minor
> Attachments: CRUNCH-230.patch
>
>
> If we're asked to do a scan of an entire HBase table, we can actually do a
> pretty good job of estimating how large it is by looking up its directory in
> HDFS. This patch checks the input scan given to the HBaseSourceTarget, and if
> it doesn't specify any filters, looks up the size of the input table on HDFS.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira