westonpace commented on issue #11967:
URL: https://github.com/apache/arrow/issues/11967#issuecomment-995297427
This is a null array represented in parquet:
```
<ParquetColumnSchema>
name: x
path: x
max_definition_level: 1
max_repetition_level: 0
physical_type: INT32
logical_type: Null
converted_type (legacy): NONE
```
The original post was not a null array:
```
<ParquetColumnSchema>
name: col13
path: col13
max_definition_level: 1
max_repetition_level: 0
physical_type: INT96
logical_type: None
converted_type (legacy): NONE
```
In particular the `logical_type` is different between the two. I'm not
actually sure what data type it is (maybe timestamp)? What is different
between how the data is generated? Do all of your files look like the latter
(`logical_type: None`) or is it possible that some of your files look like the
former?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]