Ying Zhou created ARROW-11117:
---------------------------------
Summary: [C++] ORC Reader uses wrong types
Key: ARROW-11117
URL: https://issues.apache.org/jira/browse/ARROW-11117
Project: Apache Arrow
Issue Type: Bug
Components: C++
Reporter: Ying Zhou
The Arrow C++ ORC reader does not process types correctly. In particular it
does the following:
1. It converts the ORC STRING type to the Arrow STRING type despite the fact
that all ORC STRINGs are large.
2. It converts the ORC LIST type to the Arrow LIST type despite the fact that
all ORC LISTs are large.
3. It converts the ORC MAP type to LISTS of STRUCTS with hardcoded field names
while an actual MAP type exists in Arrow (note that the ORC MAPs are large so
we need to filter out large ones when converting).
These issues need to be fixed.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)