[
https://issues.apache.org/jira/browse/FLUME-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14117824#comment-14117824
]
Brock Noland commented on FLUME-2450:
-------------------------------------
bq. I had 45gigs of data parked in the file channel , with the patch flume
took about 25 mins to figure itself out
Could you share how many events were in the queue? Also, was that for a full
replay? Are you using backup checkpoints?
bq. The frustration right now for us is that our flume nodes are basically
'down' until this recovery completes.
Are your nodes performing a full recovery often? Are you using backup
checkpoints? Unless the checkpoint and backpoint checkpoints are gone, a replay
should be quite fast.
bq. Make a new config option to run the version that requires extending the
amount of JVM memory
This actually would not improve recovery much.
> Improve replay index insertion speed.
> -------------------------------------
>
> Key: FLUME-2450
> URL: https://issues.apache.org/jira/browse/FLUME-2450
> Project: Flume
> Issue Type: Bug
> Reporter: Hari Shreedharan
> Assignee: Hari Shreedharan
> Fix For: v1.6.0
>
> Attachments: FLUME-2450.patch
>
>
> Insertion into the replay index can take long sometimes because we use a file
> based index and tree set. We should switch this out for a memory mapped db
> and a hash set.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)