alamb commented on issue #147: URL: https://github.com/apache/arrow-datafusion/issues/147#issuecomment-1001181171
> I think for the more advanced group by queries we'll need to add median, standard dev, and correlation functions. ive created an issue for adding those as well - but hopefully we can submit benchmark without those and add them when the functionality is added. FWIW standard deviation and correlation can be calculated using the existing aggregation functions (aka `AVG(X)` and `AVG(X^2)`), numerical precision issues not withstanding Median is harder -- I think it will need special casing as it can't be calculated using partial aggregates -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
