So there is not e.g. a single thread responsible per directory in log.dirs that 
could become a bottleneck relative to SSD throughput of GB/s?

This is in fact the case for Apache Pulsar, and the openmessaging benchmark 
uses 4 directories on the same SSD to increase throughput.

________________________________
差出人: Peter Bukowinski <pmb...@gmail.com>
送信日時: 2020年3月12日 8:51
宛先: users@kafka.apache.org <users@kafka.apache.org>
件名: Re: log.dirs and SSDs

> On Mar 11, 2020, at 4:28 PM, Eugen Dueck <eu...@tworks.co.jp> wrote:
>
> So log.dirs should contain only one entry per HDD disk, to avoid random seeks.
> What about SSDs? Can throughput be increased by specifying multiple 
> directories on the same SSD?


Given a constant number of partitions, I don’t see any advantage to splitting 
partitions among multiple log directories vs. keeping them all in one (per 
disk). You’d still have the same total number of topic-partition directories 
and the same number of topic-partition leaders.

If you want to increase throughput, focus on using the appropriate number of 
partitions.

—
Peter Bukowinski

Reply via email to