bvaradar commented on issue #2240: URL: https://github.com/apache/hudi/issues/2240#issuecomment-724872433
Does your record key has any natural ordering ? If not, you can disable "hoodie.bloom.index.prune.by.ranges=false". Also, you are using GLOBAL_BLOOM which is expected to scan all the partitions and will be more time consuming. Does your record move from one partition to another as part of updates ? If not, you should use BLOOM. Another point : Bloom Index is very effective when the number of records to be updated <<< the number of records in the table. With 0.6.0, You can try changing the index type "hoodie.index.type=SIMPLE" (GLOBAL_SIMPLE for global) to Simple which would perform a join. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
