soumilshah1995 opened a new issue, #11252:
URL: https://github.com/apache/hudi/issues/11252

   I'm trying out Hudi error tables, but I'm having trouble finding the 
documentation for the hoodie.errortable.write.class value. Could you please 
assist me?
   
   # sample config 
   ```
   
hoodie.datasource.write.keygenerator.class=org.apache.hudi.keygen.SimpleKeyGenerator
   hoodie.datasource.write.recordkey.field=invoiceid
   hoodie.datasource.write.partitionpath.field=destinationstate
   
hoodie.streamer.source.dfs.root=file:///Users/soumilshah/IdeaProjects/SparkProject/apache-hudi-delta-streamer-labs/E1/sampledata/
   hoodie.datasource.write.precombine.field=replicadmstimestamp
   hoodie.streamer.transformer.sql=SELECT * FROM <SRC> a where sas
   
hoodie.errortable.base.path=file:///Users/soumilshah/IdeaProjects/SparkProject/apache-hudi-delta-streamer-labs/E1/error/
   hoodie.errortable.target.table.name=error_invoice
   hoodie.errortable.enable=true
   hoodie.errortable.write.class=
   
   
   ```
   
   # Job
   ```
   
   spark-submit \
     --class org.apache.hudi.utilities.streamer.HoodieStreamer \
     --packages org.apache.hudi:hudi-spark3.4-bundle_2.12:0.14.0 \
     --properties-file spark-config.properties \
     --master 'local[*]' \
     --executor-memory 1g \
      
/Users/soumilshah/IdeaProjects/SparkProject/apache-hudi-delta-streamer-labs/E1/jar/hudi-utilities-slim-bundle_2.12-0.14.0.jar
 \
     --table-type COPY_ON_WRITE \
     --op UPSERT \
     --transformer-class 
org.apache.hudi.utilities.transform.SqlQueryBasedTransformer \
     --source-ordering-field replicadmstimestamp \
     --source-class org.apache.hudi.utilities.sources.ParquetDFSSource \
     --target-base-path 
file:///Users/soumilshah/IdeaProjects/SparkProject/apache-hudi-delta-streamer-labs/E1/silver/
 \
     --target-table invoice \
     --props hudi_tbl.props
   ```
   
   I want to purposely fail the job and I want to see error tables being 
created 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to