[GitHub] [hudi] vinothchandar commented on issue #1694: Slow Write into Hudi Dataset(MOR)

GitBox Mon, 15 Jun 2020 05:48:26 -0700


vinothchandar commented on issue #1694:
URL: https://github.com/apache/hudi/issues/1694#issuecomment-644113731



   Sorry.. slipped off my radar./ 
   
   > After setting hoodie.parquet.small.file.limit=0 
   This suggests that a lot of records were getting packed into existing files 
before.. I think you'd want to consider having the old behavior for real 
production use-cases.. we absorb that cost on the writer side, to queries will 
have well sized files for reading..
   
   >countByKey at WorkloadProfile.java started getting more time.
   In the screenshot, all I see is indexing dominating the cost.. ?
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [hudi] vinothchandar commented on issue #1694: Slow Write into Hudi Dataset(MOR)

Reply via email to