lamber-ken commented on issue #1469: [HUDI-686] Implement BloomIndexV2 that does not depend on memory caching URL: https://github.com/apache/incubator-hudi/pull/1469#issuecomment-612546567 > understood :).. but I am saying the difference you see is outside the indexing piece.. Can you post the stages UI? Got, let me show you the spark stage page, amazing 👍. From the ui, we can see that time-consuming, shuffle write, shuffle read are better. BTW, env: 4core, 6GB driver, local mode, don't forget `"hoodie.index.type" -> "BLOOM_V2"` #### `HoodieBloomIndex`: cost about 20min  #### `HoodieBloomIndexV2`: cost about 3min 
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
