[ 
https://issues.apache.org/jira/browse/CHUKWA-75?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12694786#action_12694786
 ] 

Jerome Boulon commented on CHUKWA-75:
-------------------------------------

If we don't know the column that we are processing how can we be sure that a 
value >1000000000L is an overflow?
- Should we or not apply the overflow only on a specific list of headers that 
we are aware of?

Also, can we have a validation that will not create any exception from Demux, 
DumpRecord, pig? 
Having a null there will force everyone than want to use this recordType to 
check for NULL (performance impact + check for null at query time), 
so in the worse case it should be documented but having a  well defined value 
in place of NULL will be better.


> iostat output overflow
> ----------------------
>
>                 Key: CHUKWA-75
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-75
>             Project: Hadoop Chukwa
>          Issue Type: Bug
>          Components: Data Processors
>         Environment: Redhat 5.1, Java 6
>            Reporter: Eric Yang
>            Assignee: Eric Yang
>         Attachments: CHUKWA-75-1.patch, CHUKWA-75.patch
>
>
> On older systems, iostat metrics contain random overflow.  The plan is the 
> put in a filter to remove the overflow in the demux parser to improve the 
> data quality.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to