Jacques Nadeau created DRILL-3907:
-------------------------------------

             Summary: Enable ScanBatch to provide a merge on pre-sorted readers
                 Key: DRILL-3907
                 URL: https://issues.apache.org/jira/browse/DRILL-3907
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Other
            Reporter: Jacques Nadeau
             Fix For: 1.3.0


In some situations, individual record readers will be presorted by a key. Given 
Drill's approach to parallelization, it is possible that a Single ScanBatch 
will be interacting with multiple readers. If we want to maintain the collation 
of the underlying data, we need to Drill to do a n-way merge on the streams as 
they are read into Drill. This functionality already exists in the 
MergingReceiver. 

This JIRA is to refactor merging receiver so that the underlying N-Way merge of 
batches can be used in other locations. We then need to decide whether to 
incorporate it directly into the ScanBatch (when needed) or to do something 
external. We also need to resolve how we decide whether the collation that 
could be provided by utilizing an n-way merge is necessary (to avoid paying the 
cost of maintaining an unused collation).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to