nsivabalan commented on code in PR #13305:
URL: https://github.com/apache/hudi/pull/13305#discussion_r2115101556


##########
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/WriteStatus.java:
##########
@@ -61,6 +61,8 @@ public class WriteStatus implements Serializable {
 
   private final List<Pair<HoodieRecordDelegate, Throwable>> failedRecords = 
new ArrayList<>();
 
+  // true if this WriteStatus refers to a write happening in metadata table.
+  private boolean isMetadataTable;

Review Comment:
   actually, we need to know whether a writestatus is coming from a data table 
or metadata table. 
   there are chances that data table write status may not have any stats to add 
only. for eg, a log file added to data table containing just delete block may 
not have col stats or Secondary index stats for instance. 
   
   So, we need a way to differentiate these and categorize them accordingly. 
   eventually, we will route data table's write status to writeclient.commit() 
pertaining to data table. and metadata table's write status pertaining to 
metadata table to writeClient.commit() pertaining to metadata table. 
   
   Not sure, why adding an additional isMetadataTable() is not acceptable in 
this case. 
   
   for instance, we do use HoodieBaseFile to represent a base file as well as a 
bootstrap file. 
   
   I am w/ you on this `we can add optional component named IndexStats`. and 
only set it for data table. 
   but we need a deterministic way to differentiate data table write status 
from metadata table write status. 
   
   let me know if you have any other good suggestions. 
   
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to