[GitHub] [hudi] xiarixiaoyao commented on a change in pull request #4060: [HUDI-2814] Addressing issues w/ Z-order Layout Optimization

GitBox Thu, 25 Nov 2021 17:42:25 -0800


xiarixiaoyao commented on a change in pull request #4060:
URL: https://github.com/apache/hudi/pull/4060#discussion_r757182187




##########
File path: 
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/SparkRDDWriteClient.java
##########
@@ -389,15 +389,16 @@ private void 
completeClustering(HoodieReplaceCommitMetadata metadata, JavaRDD<Wr
     writeTableMetadata(table, metadata, new 
HoodieInstant(HoodieInstant.State.INFLIGHT, 
HoodieTimeline.REPLACE_COMMIT_ACTION, clusteringCommitTime));
     finalizeWrite(table, clusteringCommitTime, writeStats);
     try {
-      // try to save statistics info to hudi
-      if (config.isDataSkippingEnabled() && 
config.isLayoutOptimizationEnabled() && 
!config.getClusteringSortColumns().isEmpty()) {
-        table.updateStatistics(context, writeStats, clusteringCommitTime, 
true);
+      // Update outstanding metadata indexes
+      if (config.isLayoutOptimizationEnabled()

Review comment:
       If you really want to modify it in this way, pls remove layout_ 
OPTIMIZE_ DATA_ SKIPPING_ Enable this configuration item.
   This configuration item was introduced for fear of unstable data skipping， 
let's remove  this config.
   
   by the way, it is suggested to directly remove this judgment condition. In 
this way, the cluster + sort operation can also generate indexes for query
   




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [hudi] xiarixiaoyao commented on a change in pull request #4060: [HUDI-2814] Addressing issues w/ Z-order Layout Optimization

Reply via email to