[ 
https://issues.apache.org/jira/browse/DRILL-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Paul Rogers reassigned DRILL-5021:
----------------------------------

    Assignee:     (was: Paul Rogers)

> ExternalSortBatch redundantly redefines the batch schema
> --------------------------------------------------------
>
>                 Key: DRILL-5021
>                 URL: https://issues.apache.org/jira/browse/DRILL-5021
>             Project: Apache Drill
>          Issue Type: Bug
>    Affects Versions: 1.8.0
>            Reporter: Paul Rogers
>            Priority: Minor
>
> Much code in the {{ExternalSortBatch}} (ESB) deals with building vector 
> batches and schemas. However, ESB cannot handle schema changes. The only 
> valid schema difference is the same field path in a different position in the 
> vector array. Given this restriction, the code can be simplified (and sped 
> up) by exploiting the fact that all batches are required to have the same 
> conceptual schema (same set of fields, but perhaps in different vector order) 
> and most probably, the same physical schema (same fields and same vector 
> order.) Note that, because of the way that the {{getValueVectorId()}} method 
> works, each lookup of a value vector is an O\(n) operation, so that each 
> remapping of vectors is O(n\^2).



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to