UNOFFICIAL
Hi,

I have a custom table balancer set on a particular table, and a cron job that 
creates splits for the next-days data, each day.  Normally it is all fine, but 
after some problems happened, I found that for certain days all the splits 
resided on a single tablet server - which then caused performance problems with 
ingest.  This was solved by temporarily taking the tablet server out of the 
cluster (stopping the Accumulo service not HDFS) and then (days) later putting 
it back.  This caused a re-assignment of the tablets and presumably triggered 
the table balancer as part of that.  This seemed like a very heavy-handed 
solution and brought about the question:

What is the recommended (least intrusive) way to trigger the table balancer in 
Accumulo for a known set of splits (tablets)?

Additional information: whilst the cluster is well balanced in terms of 
tablets-per-server, there is an imbalance in terms of entries (3-1 or 5-1 in 
some cases).  I noticed that the new (empty) pre-splits appeared to be on the 
server or servers with significantly less entries.

Thank you in advance.

Bruce.



Reply via email to