guysherman commented on issue #8189: URL: https://github.com/apache/hudi/issues/8189#issuecomment-1471555541
Just to add... _most_ of the time the performance is fine, and then occasionally it is really bad, so I'm mainly wondering if there are characteristics of the input data, or of the existing parquet files in the partition that are expected to make the performance worse, and whether we can do anything to avoid those characteristics, eg pre-sorting, etc. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
