Hi All, Please find the raised issue: https://github.com/apache/airflow/issues/56846
Overview I would like to contribute a comprehensive MariaDB provider to Apache Airflow that includes: Native MariaDB connector support ColumnStore engine integration Bulk data loading with cpimport utility S3 integration for data transfer SSH-based remote execution Why a Separate MariaDB Provider? While MySQL provider exists, MariaDB has unique features that justify a dedicated provider: Native MariaDB Connector: Uses mariadb library instead of generic MySQL connector It is seen that using Native Mariadb as a python connector performs 2-3x better for operations like insert_many, json handling and select queries. cpimport Utility: Bulk loading functionality specific to MariaDB ColumnStore ColumnStore Engine Support: Validation and optimization for ColumnStore tables S3 Integration: Direct S3 load/dump operations SSH Integration: Remote execution capabilities Components Included MariaDBHook: Main hook with native connector support MariaDBOperator: SQL execution operator MariaDBCpImportOperator: Bulk data loading for ColumnStore MariaDBS3LoadOperator: Load data from S3 to MariaDB MariaDBS3DumpOperator: Export MariaDB data to S3 Comprehensive test suite with 95%+ coverage Example DAGs for all features Complete documentation Dependencies mariadb>=1.1.0 (native connector) boto3>=1.26.0 (S3 integration) paramiko>=2.7.0 (SSH support) Please let me know your thoughts before I submit my PR. Thanks, Pratush Maheshwari
