bvaradar commented on issue #2240:
URL: https://github.com/apache/hudi/issues/2240#issuecomment-724872433


   Does your record key has any natural ordering ? If not, you can disable 
"hoodie.bloom.index.prune.by.ranges=false".
   
   Also, you are using GLOBAL_BLOOM which is expected to scan all the 
partitions and will be more time consuming. Does your record move from one 
partition to another as part of updates ? If not, you should use BLOOM.
   
   Another point : Bloom Index is very effective when the number of records to 
be updated <<< the number of records in the table. With 0.6.0, You can try 
changing the index type "hoodie.index.type=SIMPLE"  (GLOBAL_SIMPLE for global) 
to Simple which would perform a join.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to