sandugood commented on issue #4723: URL: https://github.com/apache/datafusion-comet/issues/4723#issuecomment-4801083170
Additional context - screenshots from Spark WEB-UI Spark's regular BatchScan <img width="1272" height="1484" alt="Image" src="https://github.com/user-attachments/assets/43ea51f8-f526-425b-8bea-7e0befbdd5a1" /> Comet native Iceberg scan:  As we can see - after Comet scans the table we get almost 42% more rows. And there is a problem with stats gathering, although couldn't make it work. Seems like it can't scan all of the metadata. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
