Raghvendradubey commented on issue #1694:
URL: https://github.com/apache/hudi/issues/1694#issuecomment-638773847


   On job countByKey at HoodieBloomindex, stage mapToPair at 
HoodieWriteCLient.java:977 is taking longer time more than a minute, and stage  
countByKey at HoodieBloomindex is executed within seconds.
   yes there is skew in count at HoodieSparkSqlWriter, all partitions are 
getting 200 to 500KB data and one partition is getting 100mb+ data.
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to