ktmud commented on pull request #13434:
URL: https://github.com/apache/superset/pull/13434#issuecomment-791641028


   I'm trying to imagine what would be a useful case in exposing `SELECT 
DISTINCT abc FROM tbl` in the API and why would the user (or client query 
builder) choose one way or another---since it generally produces the same 
results as group by (`SELECT abc FROM tbl GROUP BY abc`).
   
   Note that in some data engines (e.g. Presto), there are performance 
implications in using distinct vs group by to get distinct values and [`GROUP 
BY` is normally 
faster](https://www.qubole.com/blog/presto-optimizes-aggregations-over-distinct-values/)).
   
   I'm OK with either way, but would hope we can keep the interface simple and 
avoid the possibility of getting the same results with different query configs.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to