danny0405 commented on code in PR #12451:
URL: https://github.com/apache/hudi/pull/12451#discussion_r1877709074


##########
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/ConsistentHashingBucketInsertPartitioner.java:
##########
@@ -29,5 +29,14 @@ public interface ConsistentHashingBucketInsertPartitioner {
    * @param partition partition to set Consistent Hashing nodes
    * @param nodes     nodes from clustering plan
    */
+  /**
+   * Set pending consistent hashing for partition, only used in executing 
clustering
+   * When call this method, the bulk insert will directly use the pending 
metadata as the consistent hash metadata for writing data to after-resizing 
buckets.
+   * Used in the case of executing bulk insert.
+   * NOTE: This method should be called before the bulk insert operation, and 
will skip building identifiers from records, just use the pending metadata.
+   * For which not calling this method, the bulk insert will use the committed 
metadata as the bucket metadata and disallow writing data to the 
pending-resizing buckets.
+   * @param partition partition to set Consistent Hashing nodes
+   * @param nodes     nodes from clustering plan
+   */

Review Comment:
   Does one partitioner instance works for only one partition during the whole 
life-cycle?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to