Re: Flink大state读取磁盘，磁盘IO打满，任务相互影响的问题探讨

Wesley Peng Mon, 09 Sep 2019 23:40:47 -0700



on 2019/9/10 13:47, 蒋涛涛 wrote:

尝试手段：

1. 手动迁移IO比较高的任务到其他机器，但是yarn任务提交比较随机，只能偶尔为之

2. 目前没有SSD，只能用普通STATA盘，目前加了两块盘提示磁盘IO能力，但是单盘对单任务的磁盘IO瓶颈还在

还有哪些策略可以解决或者缓解么？


It seems the tricks to improve RocksDB's throughput might be helpfu.

With writes and reads accessing mostly the recent data, our goal is tolet them stay in memory as much as possible without using up all thememory on the server. The following parameters are worth tuning:

Block cache size: When uncompressed blocks are read from SSTables, theyare cached in memory. The amount of data that can be stored beforeeviction policies apply is determined by the block cache size. Thebigger the better.

Write buffer size: How big can Memtable get before it is frozen.Generally, the bigger the better. The tradeoff is that big write buffertakes more memory and longer to flush to disk and to recover.

Write buffer number: How many Memtables to keep before flushing toSSTable. Generally, the bigger the better. Similarly, the tradeoff isthat too many write buffers take up more memory and longer to flush to disk.

Minimum write buffers to merge: If most recently written keys arefrequently changed, it is better to only flush the latest version toSSTable. This parameter controls how many Memtables it will try to mergebefore flushing to SSTable. It should be less than the write buffernumber. A suggested value is 2. If the number is too big, it takeslonger to merge buffers and there is less chance of duplicate keys inthat many buffers.

The list above is far from being exhaustive, but tuning them correctlycan have a big impact on performance. Please refer to RocksDB’s TuningGuide for more details on these parameters. Figuring out the optimalcombination of values for all of them is an art in itself.


please ref: https://klaviyo.tech/flinkperf-c7bd28acc67

regards.

Re: Flink大state读取磁盘，磁盘IO打满，任务相互影响的问题探讨

回复