Will Berkeley created KUDU-2318:
-----------------------------------

             Summary: Kudu flushes aggressively even if it can't relieve memory 
pressure.
                 Key: KUDU-2318
                 URL: https://issues.apache.org/jira/browse/KUDU-2318
             Project: Kudu
          Issue Type: Bug
    Affects Versions: 1.6.0
            Reporter: Will Berkeley
         Attachments: rowsetblackbarofdeath.png

Kudu starts flushing aggressively when memory usage exceeds 60% of the hard 
limit. In pathological cases, this can cause Kudu to flush an extremely large 
amount of small rowsets.

For example, if -block_cache_capacity_mb is set to over 60% of the hard limit, 
and the block cache fills, then Kudu will always be under memory pressure. 
Flushing won't ever be able to reduce memory usage under the aggressive 
flushing threshold. However, Kudu will still flush, producing lots of tiny 
rowsets. This can eventually cause problems like KUDU-2317.

Attached is the rowset diagram from a tablet showing this phenomenon. To 
produce this I
1. Ran the tablet servers with -memory_limit_hard_bytes=1048576000 (1GiB) and 
-block_cache_capacity_mb=750.
2. Started an insert workload (single-threaded so insert would be sequential): 
build/latest/bin/kudu perf loadgen -keep_auto_table -num_threads=1 
-num_rows_per_thread=100000000.
3. Waited a bit for tablet servers to have a gigabyte or two of data.
4. Ran a ksck checksum set to cache blocks: build/latest/bin/kudu cluster ksck 
-checksum_scan -checksum_cache_blocks.





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to