soumilshah1995 opened a new issue, #10896: URL: https://github.com/apache/hudi/issues/10896
Hello,  I recently created a video tutorial on backfilling with Hudi, and during my experimentation, I encountered a challenge regarding the insert_overwrite method while using the Delta Streamer. I've been primarily working with PySpark for these tasks. Video Link: [Video on Backfilling with Hudi](https://www.example.com/) Code Base: [GitHub Repository](https://github.com/soumilshah1995?tab=repositories) In my workflow, I intended to perform an insert_overwrite on an entire partition. However, when attempting to execute the insert_overwrite job, I encountered an error indicating that the method was not found. I understand that I have successfully executed similar tasks using PySpark. My question is: does Delta Streamer support the insert_overwrite operation? If it does not currently support this operation, I would like to request adding this feature to enhance the functionality of Delta Streamer. I believe that supporting insert_overwrite would greatly benefit users who rely on Delta Streamer for data backfilling and other data management tasks. Thank you for your attention to this matter. I look forward to hearing from the community regarding the feasibility of adding this feature. Slack Thread https://apache-hudi.slack.com/archives/C4D716NPQ/p1710788154289249 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
