[ 
https://issues.apache.org/jira/browse/DRILL-6211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kunal Khatua updated DRILL-6211:
--------------------------------
    Attachment: 255d2880-48a2-d86b-5410-29ce0cd249ed.sys.drill
                255d26ae-2c0b-6cd6-ae71-4ad04c992daf.sys.drill
                255d2682-8481-bed0-fc22-197a75371c04.sys.drill
                255d2664-2418-19e0-00ea-2076a06572a2.sys.drill
                255d264c-f55e-b343-0bef-49d3e672d93f.sys.drill

> Optimizations for SelectionVectorRemover 
> -----------------------------------------
>
>                 Key: DRILL-6211
>                 URL: https://issues.apache.org/jira/browse/DRILL-6211
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Codegen
>            Reporter: Kunal Khatua
>            Assignee: Karthikeyan Manivannan
>            Priority: Major
>             Fix For: 1.14.0
>
>         Attachments: 255d264c-f55e-b343-0bef-49d3e672d93f.sys.drill, 
> 255d2664-2418-19e0-00ea-2076a06572a2.sys.drill, 
> 255d2682-8481-bed0-fc22-197a75371c04.sys.drill, 
> 255d26ae-2c0b-6cd6-ae71-4ad04c992daf.sys.drill, 
> 255d2880-48a2-d86b-5410-29ce0cd249ed.sys.drill
>
>
> Currently, when a SelectionVectorRemover receives a record batch from an 
> upstream operator (like a Filter), it immediately starts copying over records 
> into a new outgoing batch.
> It can be worthwhile if the RecordBatch can be enriched with some additional 
> summary statistics about the attached SelectionVector, such as 
> # number of records that need to be removed/copied
> # total number of records in the record-batch
> The benefit of this would be that in extreme cases, if *all* the records in a 
> batch need to be either truncated or copies, the SelectionVectorRemover can 
> simply drop the record-batch or simply forward it to the next downstream 
> operator.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to