[
https://issues.apache.org/jira/browse/FLUME-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14116231#comment-14116231
]
gautham varada commented on FLUME-2450:
---------------------------------------
I tried the patch it's definitely much faster , I had 45gigs of data parked in
the file channel , with the patch flume took about 25 mins to figure itself out
and the sinks to start pulling the data from the channel. However the ports
avro source port and the Json reporting port opened only after the sinks
started to pull the data from the channel.
Sent from my iPhone
> Improve replay index insertion speed.
> -------------------------------------
>
> Key: FLUME-2450
> URL: https://issues.apache.org/jira/browse/FLUME-2450
> Project: Flume
> Issue Type: Bug
> Reporter: Hari Shreedharan
> Assignee: Hari Shreedharan
> Fix For: v1.6.0
>
> Attachments: FLUME-2450.patch
>
>
> Insertion into the replay index can take long sometimes because we use a file
> based index and tree set. We should switch this out for a memory mapped db
> and a hash set.
--
This message was sent by Atlassian JIRA
(v6.2#6252)