Will Berkeley created KUDU-2570:
-----------------------------------

             Summary: Initialize rowset iterators lazily
                 Key: KUDU-2570
                 URL: https://issues.apache.org/jira/browse/KUDU-2570
             Project: Kudu
          Issue Type: Improvement
          Components: tablet
    Affects Versions: 1.7.1
            Reporter: Will Berkeley


When a scan inits a tablet iterator, the tablet iterator inits iterator for 
each rowset that is deemed relevant. When there's a lot of rowsets (usually 
because of some missing feature like KUDU-1400 or a pathological partition 
schema or configuration), this can take a long time, leading to scan timeouts 
like
{noformat}
WARNINGS: Unable to open scanner: Timed out: Scan RPC to 10.1.11.187:7050 timed 
out after 169.988s (SENT){noformat}
For non-fault-tolerant scans, it seems like we should be able to init the 
rowset when we first go to retrieve rows from, and thereby amortize all the 
seeks to open rowsets across many ScanRequest RPC round trips.

For fault-tolerant scans, things are more complicated, but still it should be 
possible to be lazier about initting rowset iterators.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to