soumilshah1995 commented on issue #8309:
URL: https://github.com/apache/hudi/issues/8309#issuecomment-1489442038

   ```
     spark-submit \
       --class org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer  \
       --conf spark.serializer=org.apache.spark.serializer.KryoSerializer \
       --conf 
spark.sql.extensions=org.apache.spark.sql.hudi.HoodieSparkSessionExtension  \
       --conf 
spark.sql.catalog.spark_catalog=org.apache.spark.sql.hudi.catalog.HoodieCatalog 
\
       --conf spark.sql.hive.convertMetastoreParquet=false \
       --conf 
spark.hadoop.hive.metastore.client.factory.class=com.amazonaws.glue.catalog.metastore.AWSGlueDataCatalogHiveClientFactory
 \
       --master yarn \
       --deploy-mode client \
       --deploy-mode cluster \
       --executor-memory 1g \
       --driver-memory 2g \
        /usr/lib/hudi/hudi-utilities-bundle.jar \
       --table-type COPY_ON_WRITE \
       --op UPSERT \
       --source-ordering-field replicadmstimestamp  \
       --source-class org.apache.hudi.utilities.sources.ParquetDFSSource \
       --target-base-path s3://sql-server-dms-demo/hudi/public/sales \
       --target-table invoice \
       --hoodie-conf 
hoodie.datasource.write.keygenerator.class=org.apache.hudi.keygen.SimpleKeyGenerator
 \
       --hoodie-conf hoodie.datasource.write.recordkey.field=invoiceid \
       --hoodie-conf 
hoodie.datasource.write.partitionpath.field=destinationstate \
       --hoodie-conf 
hoodie.deltastreamer.source.dfs.root=s3://sql-server-dms-demo/raw/public/sales \
       --hoodie-conf  
hoodie.datasource.write.precombine.field=replicadmstimestamp
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to