westonpace commented on issue #11967:
URL: https://github.com/apache/arrow/issues/11967#issuecomment-995297427


   This is a null array represented in parquet:
   ```
   <ParquetColumnSchema>
     name: x
     path: x
     max_definition_level: 1
     max_repetition_level: 0
     physical_type: INT32
     logical_type: Null
     converted_type (legacy): NONE
   ```
   
   The original post was not a null array:
   ```
   <ParquetColumnSchema>
     name: col13
     path: col13
     max_definition_level: 1
     max_repetition_level: 0
     physical_type: INT96
     logical_type: None
     converted_type (legacy): NONE
   ```
   
   In particular the `logical_type` is different between the two.  I'm not 
actually sure what data type it is (maybe timestamp)?  What is different 
between how the data is generated?  Do all of your files look like the latter 
(`logical_type: None`) or is it possible that some of your files look like the 
former?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to