Andrzej Bialecki  created SOLR-12941:
----------------------------------------

             Summary: IndexSizeTrigger and splitMethod=link problems
                 Key: SOLR-12941
                 URL: https://issues.apache.org/jira/browse/SOLR-12941
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
    Affects Versions: 7.6, master (8.0)
            Reporter: Andrzej Bialecki 
            Assignee: Andrzej Bialecki 


{{IndexSizeTrigger}} can be configured to use {{splitMethod=link}} 
(SOLR-12730), which uses hard-linking for creating sub-shards.

However, if the trigger uses {{aboveBytes}} condition the resulting sub-shards 
will not immediately decrease in size, until all of the deleted documents will 
be expunged (either by gradual merges or by explicit and costly expungeDeletes 
command). As a result the new sub-shards will still exceed the {{aboveBytes}} 
threshold, which will cause the trigger to keep generating new split requests.

I see two options how to solve this:
 * disallow using {{aboveBytes}} with {{splitMethod=link}}. This unfortunately 
is a very desirable combination because it monitors the actual index size and 
uses the fast splitting method.
 * calculate an internal estimate of "eventual index size" for an index with 
deletions, and use this estimate when checking with {{aboveBytes}} instead of 
the real index size. This of course introduces a potentially significant 
estimation error but allows to properly treat hard-linked sub-shards with 
deletions as (eventually) significantly smaller than the parent shard.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to