[ 
https://issues.apache.org/jira/browse/DRILL-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16384736#comment-16384736
 ] 

ASF GitHub Bot commented on DRILL-6126:
---------------------------------------

Github user ppadma commented on the issue:

    https://github.com/apache/drill/pull/1125
  
    @paul-rogers Updated the PR with latest changes.  I have decided not to use 
vector initializer for allocation as it is subject to alias issues like you 
mentioned. Instead, I added allocate vector method to columnSize in batch 
sizer, which will use internal sizing information it has to allocate memory 
(including all it's children) for a particular record count. 
    Refactored the batch sizer code, added unit tests for verifying sizing and 
allocation for different vector types. 
    Please take a look when you get a chance and let me know what you think.



> Allocate memory for value vectors upfront in flatten operator
> -------------------------------------------------------------
>
>                 Key: DRILL-6126
>                 URL: https://issues.apache.org/jira/browse/DRILL-6126
>             Project: Apache Drill
>          Issue Type: Improvement
>            Reporter: Padma Penumarthy
>            Assignee: Padma Penumarthy
>            Priority: Critical
>             Fix For: 1.12.0
>
>
> With recent changes to control batch size for flatten operator, we figure out 
> row count in the output batch based on memory. Since we know how many rows we 
> are going to include in the batch, we can also allocate the memory needed 
> upfront instead of starting with initial value (4096) and doubling, copying 
> every time we need more. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to