alamb commented on PR #10:
URL: 
https://github.com/apache/datafusion-testing/pull/10#issuecomment-3179812794

   > I went through the first commit completely and most of the second commit. 
The first commit is fine, my main concern is really not with the changes in the 
2nd commit but I guess the original PR which seems to allow queries that I 
would really have expected to fail (avg(distinct), divide by zero). Assuming 
that is ok than the changes here are solid.
   
   In my mind the "do we restrict optimizations so that runtime errors always 
occur" is a classic question in analytic systems. 
   
   There were already simplification rules that avoided these errors for some 
operators. The PR in question is
   - https://github.com/apache/datafusion/pull/17088
   
   simply expanded the rules to apply to more operators so I think it is a good 
improvement
   
   What do we think about adding a note to the upgrade guide explaining that 
some queries may now succeed (b/c they are optimized away)? That might help 
anyone upgrading and seeing different results?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to