[
https://issues.apache.org/jira/browse/DRILL-5294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15879676#comment-15879676
]
Paul Rogers commented on DRILL-5294:
------------------------------------
Unrelated to this bug, but the log shows some unnecessary overhead:
{code}
2017-02-22 16:49:36,484
[Partitioner-2751ce6d-67e6-ae08-3b68-e33b29f9d2a3:frag:2:0-170] DEBUG
o.a.drill.exec.vector.UInt4Vector - Reallocating vector
[$offsets$(UINT4:REQUIRED)]. # of bytes: [4096] -> [8192]
...
2017-02-22 16:49:36,501 [2751ce6d-67e6-ae08-3b68-e33b29f9d2a3:frag:3:9] DEBUG
o.a.drill.exec.vector.UInt4Vector - Reallocating vector
[$offsets$(UINT4:REQUIRED)]. # of bytes: [16384] -> [32768]
{code}
The resize to 8192 occurs about 22 times. The resize to 32768 occurs about 20+
times.
> Managed External Sort throws an OOM during the merge and spill phase
> --------------------------------------------------------------------
>
> Key: DRILL-5294
> URL: https://issues.apache.org/jira/browse/DRILL-5294
> Project: Apache Drill
> Issue Type: Bug
> Components: Execution - Relational Operators
> Reporter: Rahul Challapalli
> Assignee: Paul Rogers
> Fix For: 1.10.0
>
> Attachments: 2751ce6d-67e6-ae08-3b68-e33b29f9d2a3.sys.drill,
> drillbit.log
>
>
> commit # : 38f816a45924654efd085bf7f1da7d97a4a51e38
> The below query fails with managed sort while it succeeds on the old sort
> {code}
> select * from (select columns[433] col433, columns[0],
> columns[1],columns[2],columns[3],columns[4],columns[5],columns[6],columns[7],columns[8],columns[9],columns[10],columns[11]
> from dfs.`/drill/testdata/resource-manager/3500cols.tbl` order by
> columns[450],columns[330],columns[230],columns[220],columns[110],columns[90],columns[80],columns[70],columns[40],columns[10],columns[20],columns[30],columns[40],columns[50])
> d where d.col433 = 'sjka skjf';
> Error: RESOURCE ERROR: External Sort encountered an error while spilling to
> disk
> Fragment 1:11
> [Error Id: 0aa20284-cfcc-450f-89b3-645c280f33a4 on qa-node190.qa.lab:31010]
> (state=,code=0)
> {code}
> Env :
> {code}
> No of Drillbits : 1
> DRILL_MAX_DIRECT_MEMORY="32G"
> DRILL_MAX_HEAP="4G"
> {code}
> Attached the logs and profile. Data is too large for a jira
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)