LuciferYang commented on pull request #30663: URL: https://github.com/apache/spark/pull/30663#issuecomment-745239990
> If you expect some perf improvements, it makes sense to re-run benchmarks ... or write new benchmarks if we don't have benchmarks for multiple Parquet/ORC files. Let me try this ~ although I think this is obvious, we should only read fileMeta from DFS when filters are not empty ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
