Will Berkeley created KUDU-2725:
-----------------------------------

             Summary: RollingDiskRowSetWriter create rowsets that are bigger 
than the target rowset size
                 Key: KUDU-2725
                 URL: https://issues.apache.org/jira/browse/KUDU-2725
             Project: Kudu
          Issue Type: Improvement
    Affects Versions: 1.9.0
            Reporter: Will Berkeley


The diskrowset writer create rowsets that are bigger than the target rowset 
size, with the excess proportional to the number of columns that compress 
poorly. For example, modifying loadgen to create a table with 280 columns and 
then using the {{--use_random}} flag, I saw rowsets that were in excess of 
80MB. This is a problem because the budget for compactions is 128MB, so rowsets 
that are that big can never participate in a compaction.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to