n3nash commented on a change in pull request #2611:
URL: https://github.com/apache/hudi/pull/2611#discussion_r584465066



##########
File path: 
hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/utils/HoodieHiveUtils.java
##########
@@ -62,6 +67,7 @@
   public static final String HOODIE_STOP_AT_COMPACTION_PATTERN = 
"hoodie.%s.ro.stop.at.compaction";
   public static final String INCREMENTAL_SCAN_MODE = "INCREMENTAL";
   public static final String SNAPSHOT_SCAN_MODE = "SNAPSHOT";
+  public static final String VALIDATE_SCAN_MODE = "VALIDATE"; //used for 
pre-commit validation

Review comment:
       @satishkotha On thinking about this a little deeper, I feel one should 
be able to do "validate" in both modes `SNAPSHOT` & `INCREMENTAL`. Essentially, 
what you want to do is a `SNAPSHOT @ commitTime` which is what time travel 
allows but ensures that we read only committed data. To keep concepts this way, 
you may want to just have a flag saying `hoodie.%s.consume.uncommitted` whose 
default value is false, you always fall back to the `HoodieTableFileSystem` 
with current behavior, if it's set to true, then you do what you are currently 
doing in "VALIDATE" scan mode. What do you think ?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to