[ https://issues.apache.org/jira/browse/GOBBLIN-2204?focusedWorklogId=971582&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-971582 ]
ASF GitHub Bot logged work on GOBBLIN-2204: ------------------------------------------- Author: ASF GitHub Bot Created on: 03/Jun/25 03:43 Start Date: 03/Jun/25 03:43 Worklog Time Spent: 10m Work Description: vsinghal85 commented on code in PR #4113: URL: https://github.com/apache/gobblin/pull/4113#discussion_r2122580840 ########## gobblin-core/src/main/java/org/apache/gobblin/policies/size/FileSizePolicy.java: ########## @@ -0,0 +1,63 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ + +package org.apache.gobblin.policies.size; + +import org.slf4j.Logger; +import org.slf4j.LoggerFactory; + +import org.apache.gobblin.configuration.State; +import org.apache.gobblin.qualitychecker.task.TaskLevelPolicy; + +/** + * A task-level policy that checks if the bytes read matches the bytes written for a file copy operation. + */ +public class FileSizePolicy extends TaskLevelPolicy { + private static final Logger LOG = LoggerFactory.getLogger(FileSizePolicy.class); + + public static final String BYTES_READ_KEY = "gobblin.copy.bytesRead"; + public static final String BYTES_WRITTEN_KEY = "gobblin.copy.bytesWritten"; Review Comment: CopyConfiguration is not accessible within this module and including a dependency goblin-data-management, will cause circular dependency, hence created separate constants Issue Time Tracking ------------------- Worklog Id: (was: 971582) Time Spent: 0.5h (was: 20m) > FileSize Data Quality implementation for FileBasedCopy > ------------------------------------------------------ > > Key: GOBBLIN-2204 > URL: https://issues.apache.org/jira/browse/GOBBLIN-2204 > Project: Apache Gobblin > Issue Type: Task > Reporter: Vaibhav Singhal > Priority: Major > Time Spent: 0.5h > Remaining Estimate: 0h > -- This message was sent by Atlassian Jira (v8.20.10#820010)