I read the following from the Accumulo manual on tablet merging
<https://accumulo.apache.org/1.6/accumulo_user_manual.html#_merging_tablets>:



> Over time, a table can get very large, so large that it has hundreds of
> thousands of split points. Once there are enough tablets to spread a table
> across the entire cluster, additional splits may not improve performance,
> and may create unnecessary bookkeeping.
>

So would the optimal number of tablets for a very large table be close to
the total tservers times the total cores of the machine (or the worker
threads the tservers are config to use--whichever is less)?

Reply via email to