[ https://issues.apache.org/jira/browse/GOBBLIN-2204?focusedWorklogId=977070&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-977070 ]
ASF GitHub Bot logged work on GOBBLIN-2204: ------------------------------------------- Author: ASF GitHub Bot Created on: 31/Jul/25 09:22 Start Date: 31/Jul/25 09:22 Worklog Time Spent: 10m Work Description: vsinghal85 commented on code in PR #4113: URL: https://github.com/apache/gobblin/pull/4113#discussion_r2244838065 ########## gobblin-runtime/src/main/java/org/apache/gobblin/runtime/SafeDatasetCommit.java: ########## @@ -90,6 +94,14 @@ public Void call() metricContext = Instrumented.getMetricContext(datasetState, SafeDatasetCommit.class); finalizeDatasetStateBeforeCommit(this.datasetState); + // evaluate data quality at the dataset commit level, only when commit source is CommitActivityImpl + if(SafeDatasetCommit.COMMIT_SRC_COMMIT_ACTIVITY_IMPL.equals(this.datasetCommitSrc)){ + log.info("Evaluating data quality for commit activity for dataset {}.", this.datasetUrn); + evaluateAndEmitDatasetQuality(); + } else { + log.warn("Skipping data quality evaluation for dataset {} as commit source is {}", this.datasetUrn, + this.datasetCommitSrc); Review Comment: when SafeDatasetCommit is invoked via jobContext.commit(), it would go to else, and as per current existing implementation, this is expected case, changed the log to info. Issue Time Tracking ------------------- Worklog Id: (was: 977070) Time Spent: 2h 10m (was: 2h) > FileSize Data Quality implementation for FileBasedCopy > ------------------------------------------------------ > > Key: GOBBLIN-2204 > URL: https://issues.apache.org/jira/browse/GOBBLIN-2204 > Project: Apache Gobblin > Issue Type: Task > Reporter: Vaibhav Singhal > Priority: Major > Time Spent: 2h 10m > Remaining Estimate: 0h > -- This message was sent by Atlassian Jira (v8.20.10#820010)