[jira] [Commented] (HIVE-21329) Custom Tez runtime unordered output buffer size depending on operator pipeline

Gopal V (JIRA) Tue, 26 Feb 2019 22:52:21 -0800


    [ 
https://issues.apache.org/jira/browse/HIVE-21329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16778975#comment-16778975
 ]


Gopal V commented on HIVE-21329:
--------------------------------

LGTM - +1

{code}
2019-02-27 01:35:38,588 [INFO] [TezChild] 
|writers.UnorderedPartitionedKVWriter|: Reducer_2: numBuffers=2, 
sizePerBuffer=524288, skipBuffers=true, numPartitions=1, 
availableMemory=1048576, maxSingleBufferSizeBytes=2147483647, 
pipelinedShuffle=false, isFinalMergeEnabled=false, numPartitions=1, 
reportPartitionStats=ENABLED
2019-02-27 01:35:38,592 [INFO] [TezChild] |tez.MapRecordProcessor|: Main input 
name is Map 1
{code}

The allocation does show up in the timings in quick tests.

{code}
0: jdbc:hive2://localhost:10013/> set 
hive.tez.unordered.output.buffer.size.mb=1024;
0: jdbc:hive2://localhost:10013/> select count(1) from store_sales order by 1;
INFO  : Completed executing 
command(queryId=gopal_20190227015023_75eeb0ec-edd2-4248-8122-a3147c0bb502); 
Time taken: 5.436 seconds
{code}

with -1

{code}
INFO  : Completed executing 
command(queryId=gopal_20190227015049_2068d1e9-b9ac-463a-8bdb-738966a480a7); 
Time taken: 3.645 seconds
{code}

> Custom Tez runtime unordered output buffer size depending on operator pipeline
> ------------------------------------------------------------------------------
>
>                 Key: HIVE-21329
>                 URL: https://issues.apache.org/jira/browse/HIVE-21329
>             Project: Hive
>          Issue Type: Improvement
>          Components: Tez
>            Reporter: Jesus Camacho Rodriguez
>            Assignee: Jesus Camacho Rodriguez
>            Priority: Major
>         Attachments: HIVE-21329.patch
>
>
> For instance, if we have a reduce sink operator with no keys followed by a 
> Group By (merge partial), we can decrease the output buffer size since we 
> will only produce a single row.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (HIVE-21329) Custom Tez runtime unordered output buffer size depending on operator pipeline

Reply via email to