[
https://issues.apache.org/jira/browse/ORC-97?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chunyang Wen updated ORC-97:
----------------------------
Issue Type: New Feature (was: Improvement)
> Support column name selection in ReaderOptions
> ----------------------------------------------
>
> Key: ORC-97
> URL: https://issues.apache.org/jira/browse/ORC-97
> Project: Orc
> Issue Type: New Feature
> Components: C++
> Affects Versions: 1.2.0
> Reporter: Chunyang Wen
>
> After orc-92 patch, column id selection is supported. But actually select
> sub-type by name is more useful.
> In my project, we use period(.) to separate nested field names.
> <s1:struct<s2:struct<int1: int>>>
> we choose int1 by s1.s2.int1 which will be passed
> include(std::list<std:string>).
> In my implementation: first I build a map for name and column id, and then
> direct the function call to includeTypes. If this is a candidate solution, I
> will provide a patch for review soon.
> When a sub-type is selected, all his child types should be selected also, as
> O'Malley pointed out in orc-92.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)