[
https://issues.apache.org/jira/browse/DRILL-5478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16167463#comment-16167463
]
Robert Hou commented on DRILL-5478:
-----------------------------------
How is the config option set? I checked the sys.boot values:
{noformat}
0: jdbc:drill:drillbit=10.10.100.190> select * from sys.boot where name like
'%spill%';
+--------------------------------------------------+---------+-------+---------+----------+------------------------------------------------------------------------------------------------------------------------+-----------+------------+
| name | kind | type | status
| num_val | string_val
| bool_val | float_val |
+--------------------------------------------------+---------+-------+---------+----------+------------------------------------------------------------------------------------------------------------------------+-----------+------------+
| drill.exec.hashagg.spill.directories | STRING | BOOT | BOOT
| null | [
#
jar:file:/opt/drill/jars/drill-java-exec-1.12.0-SNAPSHOT.jar!/drill-module.conf:
228
"/tmp/drill/spill"
] | null | null |
| drill.exec.hashagg.spill.fs | STRING | BOOT | BOOT
| null | "file:///"
| null | null |
| drill.exec.sort.external.spill.directories | STRING | BOOT | BOOT
| null | [
# drill-override.conf: 27
"/tmp/drill"
] | null
| null |
| drill.exec.sort.external.spill.file_size | STRING | BOOT | BOOT
| null | "256M"
| null | null |
| drill.exec.sort.external.spill.fs | STRING | BOOT | BOOT
| null | "maprfs:///"
| null | null |
| drill.exec.sort.external.spill.group.size | LONG | BOOT | BOOT
| 40000 | null
| null | null |
| drill.exec.sort.external.spill.merge_batch_size | STRING | BOOT | BOOT
| null | "16M"
| null | null |
| drill.exec.sort.external.spill.spill_batch_size | STRING | BOOT | BOOT
| null | "1M"
| null | null |
| drill.exec.sort.external.spill.threshold | LONG | BOOT | BOOT
| 40000 | null
| null | null |
| drill.exec.spill.directories | STRING | BOOT | BOOT
| null | [
#
jar:file:/opt/drill/jars/drill-java-exec-1.12.0-SNAPSHOT.jar!/drill-module.conf:
228
"/tmp/drill/spill"
] | null | null |
| drill.exec.spill.fs | STRING | BOOT | BOOT
| null | "file:///"
| null | null |
+--------------------------------------------------+---------+-------+---------+----------+------------------------------------------------------------------------------------------------------------------------+-----------+------------+
{noformat}
And I see spill files that are about 38 MB in size:
{noformat}
-rwxr-xr-x 3 root root 38067297 2017-09-15 00:22
/tmp/drill/qa-node190.qa.lab-31010_2644807c-be7b-8e98-b6fb-027bd156719e_Sort_0-5-0/spill1
-rwxr-xr-x 3 root root 38067297 2017-09-15 00:23
/tmp/drill/qa-node190.qa.lab-31010_2644807c-be7b-8e98-b6fb-027bd156719e_Sort_0-5-0/spill2
-rwxr-xr-x 3 root root 38067297 2017-09-15 00:23
/tmp/drill/qa-node190.qa.lab-31010_2644807c-be7b-8e98-b6fb-027bd156719e_Sort_0-5-0/spill3
-rwxr-xr-x 3 root root 38067297 2017-09-15 00:24
/tmp/drill/qa-node190.qa.lab-31010_2644807c-be7b-8e98-b6fb-027bd156719e_Sort_0-5-0/spill4
-rwxr-xr-x 3 root root 38067297 2017-09-15 00:25
/tmp/drill/qa-node190.qa.lab-31010_2644807c-be7b-8e98-b6fb-027bd156719e_Sort_0-5-0/spill5
-rwxr-xr-x 3 root root 38067297 2017-09-15 00:25
/tmp/drill/qa-node190.qa.lab-31010_2644807c-be7b-8e98-b6fb-027bd156719e_Sort_0-5-0/spill6
-rwxr-xr-x 3 root root 10027008 2017-09-15 00:26
/tmp/drill/qa-node190.qa.lab-31010_2644807c-be7b-8e98-b6fb-027bd156719e_Sort_0-5-0/spill7
{noformat}
> Spill file size parameter is not honored by the managed external sort
> ---------------------------------------------------------------------
>
> Key: DRILL-5478
> URL: https://issues.apache.org/jira/browse/DRILL-5478
> Project: Apache Drill
> Issue Type: Bug
> Components: Execution - Relational Operators
> Affects Versions: 1.10.0
> Reporter: Rahul Challapalli
> Assignee: Paul Rogers
> Fix For: 1.12.0
>
>
> git.commit.id.abbrev=1e0a14c
> Query:
> {code}
> ALTER SESSION SET `exec.sort.disable_managed` = false;
> alter session set `planner.width.max_per_node` = 1;
> alter session set `planner.disable_exchanges` = true;
> alter session set `planner.width.max_per_query` = 1;
> alter session set `planner.memory.max_query_memory_per_node` = 1052428800;
> alter session set `planner.enable_decimal_data_type` = true;
> select count(*) from (
> select * from dfs.`/drill/testdata/resource-manager/all_types_large` d1
> order by d1.map.missing
> ) d;
> {code}
> Boot Options (spill file size is set to 256MB)
> {code}
> 0: jdbc:drill:zk=10.10.100.190:5181> select * from sys.boot where name like
> '%spill%';
> +--------------------------------------------------+---------+-------+---------+----------+----------------------------------------------------+-----------+------------+
> | name | kind | type | status
> | num_val | string_val | bool_val
> | float_val |
> +--------------------------------------------------+---------+-------+---------+----------+----------------------------------------------------+-----------+------------+
> | drill.exec.sort.external.spill.directories | STRING | BOOT | BOOT
> | null | [
> # drill-override.conf: 26
> "/tmp/test"
> ] | null | null |
> | drill.exec.sort.external.spill.file_size | STRING | BOOT | BOOT
> | null | "256M" | null
> | null |
> | drill.exec.sort.external.spill.fs | STRING | BOOT | BOOT
> | null | "maprfs:///" | null
> | null |
> | drill.exec.sort.external.spill.group.size | LONG | BOOT | BOOT
> | 40000 | null | null
> | null |
> | drill.exec.sort.external.spill.merge_batch_size | STRING | BOOT | BOOT
> | null | "16M" | null
> | null |
> | drill.exec.sort.external.spill.spill_batch_size | STRING | BOOT | BOOT
> | null | "8M" | null
> | null |
> | drill.exec.sort.external.spill.threshold | LONG | BOOT | BOOT
> | 40000 | null | null
> | null |
> +--------------------------------------------------+---------+-------+---------+----------+----------------------------------------------------+-----------+------------+
> {code}
> Below are the spill files while the query is still executing. The size of the
> spill files is ~34MB
> {code}
> -rwxr-xr-x 3 root root 34957815 2017-05-05 11:26
> /tmp/test/26f33c36-4235-3531-aeaa-2c73dc4ddeb5_major0_minor0_op5_sort/run1
> -rwxr-xr-x 3 root root 34957815 2017-05-05 11:27
> /tmp/test/26f33c36-4235-3531-aeaa-2c73dc4ddeb5_major0_minor0_op5_sort/run2
> -rwxr-xr-x 3 root root 0 2017-05-05 11:27
> /tmp/test/26f33c36-4235-3531-aeaa-2c73dc4ddeb5_major0_minor0_op5_sort/run3
> {code}
> The data set is too large to attach here. Reach out to me if you need anything
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)