GitHub user robintian001 created a discussion: About S3ToSQLOperator proposal

Hi ALL 

In my daily work, I use S3ToSqlOperator to synchronize S3 files to the 
database. However, when using it, I found some not very convenient things. For 
example, first of all s3_key must be a particularly clear path and cannot use a 
prefix for scanning, which means I have to unload the data from the data 
warehouse to a specific file in order to synchronize. Therefore, we implemented 
this function by ourselves through hook, but I'm not sure if this function is 
what the Airflow community expects or accepts.

Second: The execution performance of inserting databases in this Operator is 
very poor. The reason for this is that multiple insert statements are used for 
batch submission instead of using values (),(), and () for submission. 
Therefore, I have also carried out some custom development in this area

 

GitHub link: https://github.com/apache/airflow/discussions/58249

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to