tustvold commented on issue #7040:
URL: https://github.com/apache/arrow-rs/issues/7040#issuecomment-2624074413
For reference this is the output from pyarrow
```
>>> pq.read_table('/home/raphael/Downloads/alltypes_extended_plain.parquet')
pyarrow.Table
_1: bool
_2: int8
_3: int16
_4: int32
_5: int64
_6: float
_7: double
_8: string
_9: uint8
_10: uint16
_11: uint32
_12: uint64
_13: binary
_14: fixed_size_binary[3]
_15: decimal128(5, 2)
_16: decimal128(18, 10)
_17: decimal128(38, 37)
_18: timestamp[ms, tz=UTC]
_19: timestamp[us, tz=UTC]
_20: date32[day]
----
_1: [[null,null,true,null,true,null,null,null,true,null]]
_2: [[null,null,18,null,20,null,null,null,24,null]]
_3: [[null,null,10002,null,10004,null,null,null,10008,null]]
_4: [[null,null,10002,null,10004,null,null,null,10008,null]]
_5: [[null,null,10002,null,10004,null,null,null,10008,null]]
_6: [[null,null,10002,null,10004,null,null,null,10008,null]]
_7: [[null,null,10002,null,10004,null,null,null,10008,null]]
_8:
[[null,null,"100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002",null,"100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004",null,null,null,"100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008",null]]
_9: [[null,null,238,null,236,null,null,null,232,null]]
_10: [[null,null,55534,null,55532,null,null,null,55528,null]]
```
IMO if the column contains an out of range Int32 for a UInt8 or UInt16
returning null or an error **is** the correct behaviour. I am not sure what
exactly you are proposing should be different?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]