Ying Zhou created ARROW-11117:
---------------------------------

             Summary: [C++] ORC Reader uses wrong types
                 Key: ARROW-11117
                 URL: https://issues.apache.org/jira/browse/ARROW-11117
             Project: Apache Arrow
          Issue Type: Bug
          Components: C++
            Reporter: Ying Zhou


The Arrow C++ ORC reader does not process types correctly. In particular it 
does the following:
1. It converts the ORC STRING type to the Arrow STRING type despite the fact 
that all ORC STRINGs are large.

2. It converts the ORC LIST type to the Arrow LIST type despite the fact that 
all ORC LISTs are large.

3. It converts the ORC MAP type to LISTS of STRUCTS with hardcoded field names 
while an actual MAP type exists in Arrow (note that the ORC MAPs are large so 
we need to filter out large ones when converting). 

These issues need to be fixed.

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to