[ 
https://issues.apache.org/jira/browse/ACCUMULO-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15121941#comment-15121941
 ] 

Christopher Tubbs commented on ACCUMULO-4119:
---------------------------------------------

bq. That does seem weird. I can't think of a reason why timestamps on the bulk 
files would influence our size estimation.

The original code basically did {{ if ((a = string.split(",").length) == 2) \{ 
tabletSize += a[0]; \} }}. It looks like it was just a mistake in avoiding 
ArrayIndexOutOfBoundsException, which only stood out when it was translated to 
its equivalent DataFileValue class usage.

> Consistently use DataFileValue to parse as UTF-8
> ------------------------------------------------
>
>                 Key: ACCUMULO-4119
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-4119
>             Project: Accumulo
>          Issue Type: Bug
>            Reporter: Christopher Tubbs
>            Assignee: Christopher Tubbs
>             Fix For: 1.6.5, 1.7.1, 1.8.0
>
>
> Noticed that we weren't always using DataFileValue to parse file values from 
> the metadata tables, and also that DataFileValue doesn't specify UTF-8 
> encoding explicitly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to