Also, when they say tablets - I assume this is before replication? so in reality, it is number of nodes x cpu cores / replication factor? If this is the case, it is not looking good...
On Wed, Oct 10, 2018 at 5:02 PM Boris Tyukin <[email protected]> wrote: > Hi all, > > can someone clarify if this recommendation below - does it mean physical > or hyper-threaded CPU cores? quite a big difference... > Thanks, > Boris > > Partitioning Guidelines (https://kudu.apache.org/docs/ > kudu_impala_integration.html#partitioning_rules_of_thumb) > - For large tables, such as fact tables, aim for as many tablets as you > have cores in the cluster. > - For small tables, such as dimension tables, aim for a large enough > number of tablets that each tablet is at least 1 GB in size. > > In general, be mindful the number of tablets limits the parallelism of > reads, in the current implementation. Increasing the number of tablets > significantly beyond the number of cores is likely to have diminishing > returns. > >
