soumilshah1995 opened a new issue, #10784:
URL: https://github.com/apache/hudi/issues/10784

   # Description
   As a user of Hudi Delta Streamer, I appreciate the concept of OneTable, but 
I find it cumbersome to run a separate job for transaction processing. I'm 
seeking guidance on whether there's a way to directly write data from Delta 
Streamer into OneTable format without the need for additional jobs.
   
   # Details
   I've explored the available documentation but haven't found a comprehensive 
guide on achieving this. I'm eager to experiment with this functionality and 
share my findings with the community through videos and blogs.
   
   # Questions
   Is there a method to configure Hudi Delta Streamer to write data directly 
into OneTable format?
   Could you provide guidance or point me to relevant documentation for 
implementing this functionality?
   Additionally, I'm looking for the hudi-extensions-0.1.0-SNAPSHOT-bundled 
JAR. Is this available on Maven repositories, and if so, under which 
coordinates?
   
   Spark Submit Configuration (Working with Hudi)
   
   ```
   spark-submit \
       --class org.apache.hudi.utilities.streamer.HoodieStreamer \
       --packages 'org.apache.hudi:hudi-spark3.4-bundle_2.12:0.14.0'\
       --properties-file spark-config.properties \
       --master 'local[*]' \
       --executor-memory 1g \
       
/Users/soumilshah/IdeaProjects/SparkProject/DeltaStreamer/jar/hudi-utilities-slim-bundle_2.12-0.14.0.jar
 \
       --table-type COPY_ON_WRITE \
       --op UPSERT \
       --source-limit 4000000 \
       --source-ordering-field ts \
       --source-class org.apache.hudi.utilities.sources.CsvDFSSource \
       --target-base-path 
'file:///Users/soumilshah/IdeaProjects/SparkProject/DeltaStreamer/hudi/bronze_orders'
  \
       --target-table bronze_orders \
       --props hudi_tbl.props
   ```
   
   Desired Changes
   I'm looking for guidance on what modifications are necessary in the above 
Spark Submit configuration to enable writing into OneTable format directly.
   
   References
   [Syncing to OneTable Format - Hudi 
Documentation](https://hudi.apache.org/docs/next/syncing_onetable)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to