NookVoive opened a new issue, #8268:
URL: https://github.com/apache/seatunnel/issues/8268

   ### Search before asking
   
   - [X] I had searched in the 
[feature](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22Feature%22)
 and found no similar feature requirement.
   
   
   ### Description
   
   In the current implementation of full database synchronization, a list of 
existing tables on the source side is required. If a new table is added to the 
data source later, it is still necessary to modify the task's configuration 
file and restart the task. Can we further enhance this full database 
synchronization feature so that only the database name that needs to be 
synchronized needs to be configured to complete the full database 
synchronization?
   
   Here's my idea:
   1. Based on the configured database name, query which tables are under the 
database and generate the configuration file.
   2. Users can customize exclusion rules to determine which tables not to 
synchronize.
   3. The sink end automatically creates new tables in full.
   4. When a new table is created on the source side, update the obtained list 
of tables.
   5. The source side updates the source table list, automatically completes 
the synchronization of new tables, and the sink side automatically completes 
the table creation action.
   
   在当前的整库同步实现中,需要给source端已有表清单。
   如果后续数据源新增了一张表,依然需要修改任务的配置文件并重新启动任务。
   是否可以继续增强下这个整库同步的功能,任务配置时仅需配置需要整库同步的数据库名,即可完成整库同步?
   
   我的想法是可以这样:
   1. 根据配置好的数据库名,查询数据库下有哪些表,并生成配置文件。
   2. 用户可以自定义排除规则不同步哪些表
   3. sink端完整自动新增创建表
   4. 当source端新创建一张表后,更新获取到的表清单
   5. source端更新源表清单,自动完成新增同步表,且sink端自动完成创建表动作
   
   ### Usage Scenario
   
   In the process of constructing a data warehouse, it is often necessary to 
perform full synchronization of the entire library, but newly added tables on 
the source end cannot be automatically added to the synchronization.
   
   If synchronization of newly added tables could be completed automatically, 
it would reduce the workload of data synchronization tasks, eliminating the 
need for manual modification of synchronization tasks.
   
   ### Related issues
   
   _No response_
   
   ### Are you willing to submit a PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to