soumilshah1995 opened a new issue, #10896:
URL: https://github.com/apache/hudi/issues/10896

   Hello,
   
   
![image](https://github.com/apache/hudi/assets/39345855/9efbcee9-8503-4740-9a79-572b74653b87)
   
   I recently created a video tutorial on backfilling with Hudi, and during my 
experimentation, I encountered a challenge regarding the insert_overwrite 
method while using the Delta Streamer. I've been primarily working with PySpark 
for these tasks.
   
   Video Link: [Video on Backfilling with Hudi](https://www.example.com/)
   
   Code Base: [GitHub 
Repository](https://github.com/soumilshah1995?tab=repositories)
   
   In my workflow, I intended to perform an insert_overwrite on an entire 
partition. However, when attempting to execute the insert_overwrite job, I 
encountered an error indicating that the method was not found.
   
   I understand that I have successfully executed similar tasks using PySpark. 
My question is: does Delta Streamer support the insert_overwrite operation? If 
it does not currently support this operation, I would like to request adding 
this feature to enhance the functionality of Delta Streamer.
   
   I believe that supporting insert_overwrite would greatly benefit users who 
rely on Delta Streamer for data backfilling and other data management tasks.
   
   Thank you for your attention to this matter. I look forward to hearing from 
the community regarding the feasibility of adding this feature.
   
   Slack Thread 
https://apache-hudi.slack.com/archives/C4D716NPQ/p1710788154289249


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to