[jira] [Work logged] (HIVE-21391) LLAP: Pool of column vector buffers can cause memory pressure

ASF GitHub Bot (JIRA) Tue, 21 May 2019 15:21:41 -0700


     [ 
https://issues.apache.org/jira/browse/HIVE-21391?focusedWorklogId=246410&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-246410
 ]


ASF GitHub Bot logged work on HIVE-21391:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 21/May/19 22:20
            Start Date: 21/May/19 22:20
    Worklog Time Spent: 10m 
      Work Description: b-slim commented on pull request #641: HIVE-21391: Fix  
VRBsQueue size estimate
URL: https://github.com/apache/hive/pull/641
 
 
   Limit the queue using absolute min and max in addition to that make the 
queue length estimate related to actual size.
   As minor switched to ArrayBlockingQ to avoid extra GC.
   Add fn to drain the pool when done to help GC.
   Fixed some NPE issue.
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

            Worklog Id:     (was: 246410)
            Time Spent: 10m
    Remaining Estimate: 0h

> LLAP: Pool of column vector buffers can cause memory pressure
> -------------------------------------------------------------
>
>                 Key: HIVE-21391
>                 URL: https://issues.apache.org/jira/browse/HIVE-21391
>             Project: Hive
>          Issue Type: Bug
>          Components: llap
>    Affects Versions: 4.0.0, 3.2.0
>            Reporter: Prasanth Jayachandran
>            Assignee: Prasanth Jayachandran
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: HIVE-21391.1.patch
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Where there are too many columns (in the order of 100s), with decimal, string 
> types the column vector pool of buffers created here 
> [https://github.com/apache/hive/blob/master/llap-server/src/java/org/apache/hadoop/hive/llap/io/decode/EncodedDataConsumer.java#L59]
>  can cause memory pressure. 
> Example:
> 128 (poolSize) * 300 (numCols) * 1024 (batchSize) * 80 (decimalSize) ~= 3GB
> The pool size keeps increasing when there is slow consumer but fast llap io 
> (SSDs) leading to GC pressure when all LLAP io threads read splits from same 
> table. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Work logged] (HIVE-21391) LLAP: Pool of column vector buffers can cause memory pressure

Reply via email to