Re: Improving Accumulo Replication Latency

Adam J. Shook Wed, 15 Feb 2017 14:25:30 -0800

Thanks -- I'll reach out offline to get something set up.

On Wed, Feb 15, 2017 at 3:21 PM, Josh Elser <josh.el...@gmail.com> wrote:


> Gotcha. That's definitely the biggest factor that I was aware of. I wasn't
> sure if you knew more than I did by now ;). I can respect the implications
> of too much bookkeeping going on. That might really start pounding the
> metadata and replication tables.
>
> Happy to do lunch, also happy to just have a video call too if that's more
> convenient.
>
> Adam J. Shook wrote:
>
>> Thanks, Josh.  I think the main pain-point is that replication doesn't
>> occur until the WAL is closed.  We've made some aggressive configuration
>> changes to Accumulo to reduce the WAL time rollover and minor compaction
>> frequency to force replication to go faster.  It is down to around 20
>> minutes or so on our production clusters, but we are kind of at our
>> limit -- Accumulo is spending a lot more time doing bookkeeping tasks
>> and it is starting to affect our query performance.
>>
>> My initial thoughts are to increase the replication parallelism and
>> start replicating the WAL before it is closed (I see a few JIRAs open
>> already that mention these things), but I haven't done enough digging in
>> the code base to see what is really available.
>>
>> Are you free for a bit in the near future to meet up for a bit and talk
>> replication?  I'll buy lunch!
>>
>> Cheers,
>> --Adam
>>
>> On Wed, Feb 15, 2017 at 2:52 PM, Josh Elser <josh.el...@gmail.com
>> <mailto:josh.el...@gmail.com>> wrote:
>>
>>     Hi Adam,
>>
>>     I'm not presently working on anything (too many irons in other
>>     fires), but I'd be happy to help work through a design doc for
>>     improvements.
>>
>>     Do you have a list of pain-points which are the primary causes of
>>     latency? That would help in identifying the changes to make and how
>>     best to implement them.
>>
>>     - Josh
>>
>>
>>     Adam J. Shook wrote:
>>
>>         I'm currently scoping what it would take to improve the latency
>>         in the
>>         replication feature of Accumulo.  I'm interested in knowing what
>>         work,
>>         if any, is being done to improve replication latency?  If work
>>         is being
>>         done, would there be some interest in collaborating on that
>> effort?
>>
>>         If nothing is currently being planned, I'd be interested in
>>         design ideas
>>         and pointers from the community for improvements to the existing
>>         implementation.  We're looking to get replication down to less
>>         than five
>>         minutes and are willing to put in the effort to implement the
>>         improvements.
>>
>>         Thank you for your time!
>>
>>         Cheers,
>>         --Adam
>>
>>
>>

Re: Improving Accumulo Replication Latency

Reply via email to