vinothchandar commented on issue #1694: URL: https://github.com/apache/hudi/issues/1694#issuecomment-644113731
Sorry.. slipped off my radar./ > After setting hoodie.parquet.small.file.limit=0 This suggests that a lot of records were getting packed into existing files before.. I think you'd want to consider having the old behavior for real production use-cases.. we absorb that cost on the writer side, to queries will have well sized files for reading.. >countByKey at WorkloadProfile.java started getting more time. In the screenshot, all I see is indexing dominating the cost.. ? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
