liugddx opened a new issue, #3002:
URL: https://github.com/apache/incubator-seatunnel/issues/3002

   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   
   
   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-seatunnel/issues?q=is%3Aissue+label%3A%22bug%22)
 and found no similar issues.
   
   
   ### Describe the proposal
   
   Dirty data is data that is not meaningful to the business, has an illegal 
format, or is out of sync. A single piece of data is dirty if an exception 
occurs during writing to the target data source. So any data that fails to 
write is classified as dirty. For example, data of type VARCHAR on the source 
side is written to the target column of type INT, resulting in data that can 
not be written because of improper conversion. You can control whether dirty 
data is allowed during synchronization when task configuration is synchronized, 
and support for controlling the number of dirty data bars, that is, when the 
dirty data exceeds the specified number, the task fails to exit.
   
   ### Task list
   
   Support for defining dirty data and its impact on tasks
   
   - [ ]  When do not allow the dirty data, if produce dirty data 
synchronization task execution process, the task will fail
   - [ ] When allowing dirty data and set its threshold, synchronization task 
will ignore the dirty data (that is, won't write to the target side), and 
normal execution.
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to