[ 
https://issues.apache.org/jira/browse/ARROW-9264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Micah Kornfield updated ARROW-9264:
-----------------------------------
    Description: Preceding ARROW-8493 there is some cleanup.  Variable 
renaming, using Result instead of output parameter and moving code from 
reader_internal.cc to schema.cc  (was: We need a function/class that can take 
the parquet schema and a proposed arrow schema (potentially retrieved from 
parquet metadata) and outputs a data structure that contains, all of the 
information in "SchemaField" and the following additional options:

 

1.  Corresponding Definition level for nullability (wouldn't be populated for 
non-null arrays).

2.  Correspond Repetition level for lists (wouldn't be populated for for 
non-lists).

3.  Definition level for "empty lists".  (wouldn't be populated for legacy two 
level encoded lists).

 

One option is to augment and populate these on the SchemaField.)

> [C++] Cleanup Parquet Arrow Schema code
> ---------------------------------------
>
>                 Key: ARROW-9264
>                 URL: https://issues.apache.org/jira/browse/ARROW-9264
>             Project: Apache Arrow
>          Issue Type: Sub-task
>          Components: C++
>            Reporter: Micah Kornfield
>            Assignee: Micah Kornfield
>            Priority: Major
>
> Preceding ARROW-8493 there is some cleanup.  Variable renaming, using Result 
> instead of output parameter and moving code from reader_internal.cc to 
> schema.cc



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to