[
https://issues.apache.org/jira/browse/DRILL-5758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178075#comment-16178075
]
ASF GitHub Bot commented on DRILL-5758:
---------------------------------------
Github user paul-rogers commented on the issue:
https://github.com/apache/drill/pull/932
Added a fix for repeated columns that have a low cardinality. If, say, one
row in ten has an array entry, then average cardinality (values per row) is
0.1. This was represented by an int, rounded to 0 and caused a zero-length
vector to be allocated. Drill then tried to double the length, which resulted
in 0, which was doubled again, and so on forever.
The fix has three parts:
* The "record batch sizer" uses floats to allow fractional cardinality.
* The vector initializer now works with fractional cardinality.
* If all else fails, if a fixed-width vector is asked to double from zero,
it sizes the vector to 256 bytes.
> Rollup of external sort fixes to issues found by QA
> ---------------------------------------------------
>
> Key: DRILL-5758
> URL: https://issues.apache.org/jira/browse/DRILL-5758
> Project: Apache Drill
> Issue Type: Task
> Affects Versions: 1.12.0
> Reporter: Paul Rogers
> Assignee: Paul Rogers
> Fix For: 1.12.0
>
>
> Tracking JIRA to used for the PR that combines fixes for various JIRA
> entries. Bugs fixed in this task are given by the linked issues.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)