[
https://issues.apache.org/jira/browse/DRILL-7086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paul Rogers updated DRILL-7086:
-------------------------------
Description:
Modify the row-set scan framework to work with an external (partial) schema;
inserting "type conversion shims" to convert as needed. The reader provides an
"input schema" the data types the reader is prepared to handle. An optional
"output schema" describes the types of the value vectors to create. The type
conversion "shims" give the reader the "setFoo" method it wants to use, while
converting the data to the type needed for the vector. For example, the CSV
reader might read only text fields, while the shim converts a column to an INT.
This is just the framework layer, DRILL-7011 will combine this mechanism with
the plan-side features to enable use of the feature in the new row-set based
CSV reader.
was:Modify the row-set scan framework, and the row-set based CSV reader, to
use the new schema framework created in DRILL-7073 and related JIRAs. This
version is a proof-of-concept.
> Enhance row-set scan framework to use external schema
> -----------------------------------------------------
>
> Key: DRILL-7086
> URL: https://issues.apache.org/jira/browse/DRILL-7086
> Project: Apache Drill
> Issue Type: Improvement
> Affects Versions: 1.15.0
> Reporter: Paul Rogers
> Assignee: Paul Rogers
> Priority: Major
>
> Modify the row-set scan framework to work with an external (partial) schema;
> inserting "type conversion shims" to convert as needed. The reader provides
> an "input schema" the data types the reader is prepared to handle. An
> optional "output schema" describes the types of the value vectors to create.
> The type conversion "shims" give the reader the "setFoo" method it wants to
> use, while converting the data to the type needed for the vector. For
> example, the CSV reader might read only text fields, while the shim converts
> a column to an INT.
> This is just the framework layer, DRILL-7011 will combine this mechanism with
> the plan-side features to enable use of the feature in the new row-set based
> CSV reader.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)