[jira] [Commented] (FLUME-2450) Improve replay index insertion speed.

gautham varada (JIRA) Fri, 29 Aug 2014 21:45:04 -0700

    [ 
https://issues.apache.org/jira/browse/FLUME-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14116231#comment-14116231
 ]


gautham varada commented on FLUME-2450:
---------------------------------------

I tried the patch it's definitely much faster , I had  45gigs of data parked in 
the file channel , with the patch flume took about 25 mins to figure itself out 
and the sinks to start pulling the data from the channel. However the ports 
avro source port and the Json reporting port opened only after the sinks 
started to pull the data from the channel.

Sent from my iPhone



> Improve replay index insertion speed.
> -------------------------------------
>
>                 Key: FLUME-2450
>                 URL: https://issues.apache.org/jira/browse/FLUME-2450
>             Project: Flume
>          Issue Type: Bug
>            Reporter: Hari Shreedharan
>            Assignee: Hari Shreedharan
>             Fix For: v1.6.0
>
>         Attachments: FLUME-2450.patch
>
>
> Insertion into the replay index can take long sometimes because we use a file 
> based index and tree set. We should switch this out for a memory mapped db 
> and a hash set.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (FLUME-2450) Improve replay index insertion speed.

Reply via email to