tustvold commented on issue #7040:
URL: https://github.com/apache/arrow-rs/issues/7040#issuecomment-2624074413

   For reference this is the output from pyarrow
   
   ```
   >>> pq.read_table('/home/raphael/Downloads/alltypes_extended_plain.parquet')
   pyarrow.Table
   _1: bool
   _2: int8
   _3: int16
   _4: int32
   _5: int64
   _6: float
   _7: double
   _8: string
   _9: uint8
   _10: uint16
   _11: uint32
   _12: uint64
   _13: binary
   _14: fixed_size_binary[3]
   _15: decimal128(5, 2)
   _16: decimal128(18, 10)
   _17: decimal128(38, 37)
   _18: timestamp[ms, tz=UTC]
   _19: timestamp[us, tz=UTC]
   _20: date32[day]
   ----
   _1: [[null,null,true,null,true,null,null,null,true,null]]
   _2: [[null,null,18,null,20,null,null,null,24,null]]
   _3: [[null,null,10002,null,10004,null,null,null,10008,null]]
   _4: [[null,null,10002,null,10004,null,null,null,10008,null]]
   _5: [[null,null,10002,null,10004,null,null,null,10008,null]]
   _6: [[null,null,10002,null,10004,null,null,null,10008,null]]
   _7: [[null,null,10002,null,10004,null,null,null,10008,null]]
   _8: 
[[null,null,"100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002100021000210002",null,"100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004100041000410004",null,null,null,"100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008100081000810008",null]]
   _9: [[null,null,238,null,236,null,null,null,232,null]]
   _10: [[null,null,55534,null,55532,null,null,null,55528,null]]
   ```
   
   IMO if the column contains an out of range Int32 for a UInt8 or UInt16 
returning null or an error **is** the correct behaviour. I am not sure what 
exactly you are proposing should be different?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to