4chicat opened a new pull request, #5745:
URL: https://github.com/apache/seatunnel/pull/5745

   ### Purpose of this pull request
   We use S3 as iceberg warehouse storage.
   If `catalog_type` is `hadoop`, only hdfs is supported, so it is being 
customized and used.
   I want to contribute my customized code, so I push a pull request.
   
   ### Does this PR introduce _any_ user-facing change?
   we can use the iceberg table saved in s3.
   
   config example
   ```yml
   source {
     iceberg {
       catalog_name = "seatunnel"
       catalog_type = "hadoop"
       warehouse = "s3a://your_bucket/spark/warehouse"
       namespace = "your_iceberg_database"
       table = "your_iceberg_table"
       result_table_name = "iceberg_test"
       fs.s3a.endpoint = "xxx"
       fs.s3a.aws.credentials.provider = 
"org.apache.hadoop.fs.s3a.SimpleAWSCredentialsProvider"
       access_key = "xxx"
       secret_key = "xxx"
       bucket = "s3a://xxx"
       hadoop_s3_properties {
         "fs.s3a.path.style.access" = "true"
         ...
         "fs.s3a.connection.timeout" = "3000"
       }
     }
   }
   ```
   
   ### How was this patch tested?
   After downloading the seatunnel 2.3.3 binary file, I created an 
`iceberg_to_console.conf` config file and tested running the script below.
   ```sh
   cd apache-seatunnel-2.3.3
   ./bin/start-seatunnel-spark-3-connector-v2.sh \
   --master "local[4]" \
   --deploy-mode client \
   --config ./config/iceberg_to_console.conf
   ```
   
   I don't know how to do the e2e test.
   So, I ran a maven test as shown below, but an error message occurred and I 
could not proceed.
   ```sh
   mvn -pl :connector-iceberg-e2e -Dtest=IcebergSourceIT test
   ```
   error message
   ```java
   Could not resolve dependencies for project 
org.apache.seatunnel:connector-iceberg-e2e:jar:2.3.4-SNAPSHOT: Failed to 
collect dependencies at org.apache.seatunnel:connector-assert:jar:2.3.4-SNAPSHOT
   ```
   
   ### Check list
   
   * [ ] If any new Jar binary package adding in your PR, please add License 
Notice according
     [New License 
Guide](https://github.com/apache/seatunnel/blob/dev/docs/en/contribution/new-license.md)
   * [ ] If necessary, please update the documentation to describe the new 
feature. https://github.com/apache/seatunnel/tree/dev/docs
   * [ ] If you are contributing the connector code, please check that the 
following files are updated:
     1. Update change log that in connector document. For more details you can 
refer to 
[connector-v2](https://github.com/apache/seatunnel/tree/dev/docs/en/connector-v2)
     2. Update 
[plugin-mapping.properties](https://github.com/apache/seatunnel/blob/dev/plugin-mapping.properties)
 and add new connector information in it
     3. Update the pom file of 
[seatunnel-dist](https://github.com/apache/seatunnel/blob/dev/seatunnel-dist/pom.xml)
   * [ ] Update the 
[`release-note`](https://github.com/apache/seatunnel/blob/dev/release-note.md).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to