[
https://issues.apache.org/jira/browse/FLUME-2277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13866911#comment-13866911
]
Hari Shreedharan commented on FLUME-2277:
-----------------------------------------
One of the things I have noticed is that having multiple data directories (a
handful not thousands) even if they are on the same disk helps, since Flume
serializes the operations to a single disk even when there are no fsyncs.
Unfortunately, there is no real way to work around this (since we decide
whether to roll and cache the offset at which we wrote), but most disks can
handle multiple files being written to at the same time and can fsync them with
reasonable latency - so having multiple data disks helps
> Improve FileChannel documentation to address commons support issues
> -------------------------------------------------------------------
>
> Key: FLUME-2277
> URL: https://issues.apache.org/jira/browse/FLUME-2277
> Project: Flume
> Issue Type: Task
> Reporter: Brock Noland
> Assignee: Brock Noland
> Attachments: FLUME-2277.patch
>
>
> Often users configure too small of batch size with File Channel, use sources
> such as Exec source which generate small batches, or do not configure
> multiple disks.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)