[ https://issues.apache.org/jira/browse/GOBBLIN-2204?focusedWorklogId=970746&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-970746 ]
ASF GitHub Bot logged work on GOBBLIN-2204: ------------------------------------------- Author: ASF GitHub Bot Created on: 27/May/25 07:25 Start Date: 27/May/25 07:25 Worklog Time Spent: 10m Work Description: vsinghal85 commented on code in PR #4113: URL: https://github.com/apache/gobblin/pull/4113#discussion_r2108403326 ########## gobblin-runtime/src/main/java/org/apache/gobblin/runtime/SafeDatasetCommit.java: ########## @@ -90,6 +90,7 @@ public Void call() metricContext = Instrumented.getMetricContext(datasetState, SafeDatasetCommit.class); finalizeDatasetStateBeforeCommit(this.datasetState); + this.datasetState.computeAndStoreQualityStatus(this.jobContext.getJobState()); Class<? extends DataPublisher> dataPublisherClass; Review Comment: Work unit is at individual task level, and if individual task data quality fails, it does fail that task as well. Here in this method specifically we are computing overall data quality of the dataset, based on data quality of all individual tasks. Issue Time Tracking ------------------- Worklog Id: (was: 970746) Remaining Estimate: 0h Time Spent: 10m > FileSize Data Quality implementation for FileBasedCopy > ------------------------------------------------------ > > Key: GOBBLIN-2204 > URL: https://issues.apache.org/jira/browse/GOBBLIN-2204 > Project: Apache Gobblin > Issue Type: Task > Reporter: Vaibhav Singhal > Priority: Major > Time Spent: 10m > Remaining Estimate: 0h > -- This message was sent by Atlassian Jira (v8.20.10#820010)