[
https://issues.apache.org/jira/browse/DRILL-5657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated DRILL-5657:
-------------------------------
Description:
A recent extension to Drill's set of test tools created a "row set" abstraction
to allow us to create, and verify, record batches with very few lines of code.
Part of this work involved creating a set of "column accessors" in the vector
subsystem. Column readers provide a uniform API to obtain data from columns
(vectors), while column writers provide a uniform writing interface.
DRILL-5211 discusses a set of changes to limit value vectors to 16 MB in size
(to avoid memory fragmentation due to Drill's two memory allocators.) The
column accessors have proven to be so useful that they will be the basis for
the new, size-aware writers used by Drill's record readers.
A step in that direction is to retrofit the column writers to use the
size-aware {{setScalar()}} and {{setArray()}} methods introduced in DRILL-5517.
Since the test framework row set classes are (at present) the only consumer of
the accessors, those classes must also be updated with the changes.
This then allows us to add a new "row mutator" class that handles size-aware
vector writing, including the case in which a vector fills in the middle of a
row.
was:
A recent extension to Drill's set of test tools created a "row set" abstraction
to allow us to create, and verify, record batches with very few lines of code.
Part of this work involved creating a set of "column accessors" in the vector
subsystem. Column readers provide a uniform API to obtain data from columns
(vectors), while column writers provide a uniform writing interface.
DRILL-5211 discusses a set of changes to limit value vectors to 16 MB in size
(to avoid memory fragmentation due to Drill's two memory allocators.) The
column accessors have proven to be so useful that they will be the basis for
the new, size-aware writers used by Drill's record readers.
A step in that direction is to retrofit the column writers to use the
size-aware {{setScalar()}} and {{setArray()}} methods introduced in DRILL-5517.
Since the test framework row set classes are (at present) the only consumer of
the accessors, those classes must also be updated with the changes.
> Extend column accessors to be size-aware
> ----------------------------------------
>
> Key: DRILL-5657
> URL: https://issues.apache.org/jira/browse/DRILL-5657
> Project: Apache Drill
> Issue Type: Improvement
> Affects Versions: 1.11.0
> Reporter: Paul Rogers
> Assignee: Paul Rogers
> Fix For: 1.11.0
>
>
> A recent extension to Drill's set of test tools created a "row set"
> abstraction to allow us to create, and verify, record batches with very few
> lines of code. Part of this work involved creating a set of "column
> accessors" in the vector subsystem. Column readers provide a uniform API to
> obtain data from columns (vectors), while column writers provide a uniform
> writing interface.
> DRILL-5211 discusses a set of changes to limit value vectors to 16 MB in size
> (to avoid memory fragmentation due to Drill's two memory allocators.) The
> column accessors have proven to be so useful that they will be the basis for
> the new, size-aware writers used by Drill's record readers.
> A step in that direction is to retrofit the column writers to use the
> size-aware {{setScalar()}} and {{setArray()}} methods introduced in
> DRILL-5517.
> Since the test framework row set classes are (at present) the only consumer
> of the accessors, those classes must also be updated with the changes.
> This then allows us to add a new "row mutator" class that handles size-aware
> vector writing, including the case in which a vector fills in the middle of a
> row.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)