[
https://issues.apache.org/jira/browse/DRILL-5344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15905988#comment-15905988
]
ASF GitHub Bot commented on DRILL-5344:
---------------------------------------
Github user Ben-Zvi commented on a diff in the pull request:
https://github.com/apache/drill/pull/778#discussion_r105520793
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/physical/impl/xsort/managed/PriorityQueueCopierTemplate.java
---
@@ -57,9 +57,12 @@ public void setup(FragmentContext context,
BufferAllocator allocator, VectorAcce
queueSize = 0;
for (int i = 0; i < size; i++) {
- vector4.set(i, i, batchGroups.get(i).getNextIndex());
- siftUp();
- queueSize++;
+ int index = batchGroups.get(i).getNextIndex();
+ vector4.set(i, i, index);
--- End diff --
If the record batch is empty, why set this (65535) value in the vector ?
Probably usually the empty batch is last, so queueSize would not be
incremented; otherwise if the next iteration brings a non empty batch, then
siftUp() may run into that 65535 .... (not sure ...)
> External sort priority queue copier fails with an empty batch
> -------------------------------------------------------------
>
> Key: DRILL-5344
> URL: https://issues.apache.org/jira/browse/DRILL-5344
> Project: Apache Drill
> Issue Type: Bug
> Affects Versions: 1.10.0
> Reporter: Paul Rogers
> Assignee: Paul Rogers
> Priority: Minor
> Fix For: 1.11.0
>
>
> The external sort uses a "priority queue copier" to merge batches when
> spilling or when merging spilled batches.
> The code will fail with an {{IndexOutOfBoundsException}} if any record batch
> is empty. The reason is a faulty assumption in generated code:
> {code}
> public void setup(...) {
> ...
> vector4.set(i, i, batchGroups.get(i).getNextIndex());
> ...
> }
> public int getNextIndex() {
> if (pointer == getRecordCount()) {
> return -1;
> }
> ...
> }
> {code}
> The code to get the next index returns -1 when the "position" in a record
> batch is zero. The -1 position translates (when truncated) into 65535 which
> produces the index exception.
> The workaround has been to special case empty batches elsewhere in the code,
> apparently to avoid hitting this error.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)