alamb commented on issue #7781: URL: https://github.com/apache/arrow-datafusion/issues/7781#issuecomment-1754871075
> Of course it cannot know that. But it should keep scanning the data only if it didnt find 10 values yet, i.e. My point is that I am not sure how DataFusion would know there are only 10 unique values. > I'm willing to work on a fix if you guys are interested. If you have ways to improve things, that would be most appreciated. I am still not quite sure what you would do in this case that would work generally, but there may be some cleverness I am missing. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
