[
https://issues.apache.org/jira/browse/FLUME-2360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13965675#comment-13965675
]
Alexandre Dutra commented on FLUME-2360:
----------------------------------------
This is a duplicate of FLUME-2215. See my patches for possible fixes for this
problem.
> Spooling Source: LineDeserializer incorrectly stops reading events if UTF-8
> char occurs at end of buffer
> --------------------------------------------------------------------------------------------------------
>
> Key: FLUME-2360
> URL: https://issues.apache.org/jira/browse/FLUME-2360
> Project: Flume
> Issue Type: Bug
> Components: Sinks+Sources
> Affects Versions: v1.4.0
> Environment: CDH 4.4
> Reporter: Keith Wright
> Attachments: FBX_BID.80.log.2014-04-10-00-47.log
>
>
> We are using a Spooling source to read from files and have noticed that there
> are situations where the source will stop reading in the middle of a file
> (see attached example file). After much debugging, I have determine that
> this occurs if the buffer in ResettableFileInputStream happens to fill when
> in the middle of handling a UTF-8 character. In this case readChar() will
> return -1 which ends the read lines loop in LineSerializer. Running the
> spool source with the attached file should result in > 5400 events in the
> channel but now it will only return ~3300.
--
This message was sent by Atlassian JIRA
(v6.2#6252)