westonpace commented on issue #5332:
URL: https://github.com/apache/arrow-rs/issues/5332#issuecomment-2551596562

   > With regards to parquet, I can't help feeling the format is sufficiently 
complex that supporting untrusted input is essentially a fools errand though...
   
   FWIW, I have always assumed proper handling of untrusted inputs to be an 
intentional feature of Parquet.  Many data services accept Parquet as input.  
Being an open format it has become a de-facto interchange format between 
systems.
   
   I do agree with Rafael though that infinite loops and memory overflows are 
in a more severe category than panics and it would be acceptable (though mildly 
unfortunate) to clearly state that invalid input may cause panics and users 
should catch unwind parquet routines.
   
   In a way, this is actually similar to C++ where Parquet throws exceptions 
while the rest of the Arrow library uses Result.  Arrow has "catch unwinds" at 
all the boundaries.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to