rtpsw commented on PR #14386:
URL: https://github.com/apache/arrow/pull/14386#issuecomment-1276201896

   > Same problem as #14347: this is basically adding a partial, incomplete 
guard for a condition the caller is supposed to check for themselves.
   
   > If we wanted to check that the ExecBatch values corresponding to the 
Schema, we should check for schema equality on each of the Datum values. That 
can unfortunately be expensive.
   
   To be sure, I understand that by partial incomplete guard you mean not 
checking for full schema equality. My goal here (and in the PR you linked) is 
not to fully guard, and I understand the cost of trying to do so. My goal is to 
avoid runtime crashes that are hard to debug when the cost of doing so is 
small. IMHO, this PR is a good tradeoff because it uses a cheap check to 
transform a crash failure to a raised error, which is easier to debug. I run 
into such crash failures while developing test cases. I agree that when a test 
case is correct then the failure cases do not occur, yet during development 
test cases are frequently not correct. Another reason is that developers of 
large apps would generally prefer an error the app can handle, and perhaps 
raise to its user, over crashing the app.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to