[ 
https://issues.apache.org/jira/browse/FLUME-2360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13965675#comment-13965675
 ] 

Alexandre Dutra commented on FLUME-2360:
----------------------------------------

This is a duplicate of FLUME-2215. See my patches for possible fixes for this 
problem.

> Spooling Source: LineDeserializer incorrectly stops reading events if UTF-8 
> char occurs at end of buffer
> --------------------------------------------------------------------------------------------------------
>
>                 Key: FLUME-2360
>                 URL: https://issues.apache.org/jira/browse/FLUME-2360
>             Project: Flume
>          Issue Type: Bug
>          Components: Sinks+Sources
>    Affects Versions: v1.4.0
>         Environment: CDH 4.4
>            Reporter: Keith Wright
>         Attachments: FBX_BID.80.log.2014-04-10-00-47.log
>
>
> We are using a Spooling source to read from files and have noticed that there 
> are situations where the source will stop reading in the middle of a file 
> (see attached example file).  After much debugging, I have determine that 
> this occurs if the buffer in ResettableFileInputStream happens to fill when 
> in the middle of handling a UTF-8 character.  In this case readChar() will 
> return -1 which ends the read lines loop in LineSerializer.  Running the 
> spool source with the attached file should result in > 5400 events in the 
> channel but now it will only return ~3300.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to