[
https://issues.apache.org/jira/browse/ACCUMULO-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15121941#comment-15121941
]
Christopher Tubbs edited comment on ACCUMULO-4119 at 1/28/16 5:37 PM:
----------------------------------------------------------------------
bq. That does seem weird. I can't think of a reason why timestamps on the bulk
files would influence our size estimation.
The original code basically did {code}if ((a = string.split(",")).length == 2)
{ tabletSize += a[0]; }{code}. It looks like it was just a mistake in avoiding
ArrayIndexOutOfBoundsException, which only stood out when it was translated to
its equivalent DataFileValue class usage.
was (Author: ctubbsii):
bq. That does seem weird. I can't think of a reason why timestamps on the bulk
files would influence our size estimation.
The original code basically did {{ if ((a = string.split(",").length) == 2) \{
tabletSize += a[0]; \} }}. It looks like it was just a mistake in avoiding
ArrayIndexOutOfBoundsException, which only stood out when it was translated to
its equivalent DataFileValue class usage.
> Consistently use DataFileValue to parse as UTF-8
> ------------------------------------------------
>
> Key: ACCUMULO-4119
> URL: https://issues.apache.org/jira/browse/ACCUMULO-4119
> Project: Accumulo
> Issue Type: Bug
> Reporter: Christopher Tubbs
> Assignee: Christopher Tubbs
> Fix For: 1.6.5, 1.7.1, 1.8.0
>
>
> Noticed that we weren't always using DataFileValue to parse file values from
> the metadata tables, and also that DataFileValue doesn't specify UTF-8
> encoding explicitly.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)