ktmud commented on pull request #13434: URL: https://github.com/apache/superset/pull/13434#issuecomment-791641028
I'm trying to imagine what would be a useful case in exposing `SELECT DISTINCT abc FROM tbl` in the API and why would the user (or client query builder) choose one way or another---since it generally produces the same results as group by (`SELECT abc FROM tbl GROUP BY abc`). Note that in some data engines (e.g. Presto), there are performance implications in using distinct vs group by to get distinct values and [`GROUP BY` is normally faster](https://www.qubole.com/blog/presto-optimizes-aggregations-over-distinct-values/)). I'm OK with either way, but would hope we can keep the interface simple and avoid the possibility of getting the same results with different query configs. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
