shahidki31 commented on pull request #32659:
URL: https://github.com/apache/spark/pull/32659#issuecomment-851056661


   Hi @srowen ,
     I checked the code and I ran all the TPCDS queries to find the potential 
cases. Following are the observations.
   1. Aggregate expressions like sum, avg, count etc. -> We can't estimate 
stats for this.
   2. Literal Expressions -> We can estimate stats. (q5 and q14b of TPCDS)
   3.  String repeat -> Opposite of substring. We can estimate the stats.
   
   I have added support for 2 and 3. I am not sure we can find a supercase, as 
each one's estimation will be different. 
   Please review. Thanks


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to