jayesh2424 opened a new issue, #10852: URL: https://github.com/apache/hudi/issues/10852
I have a Hudi datalake in my AWS. Currently to have a ETL operation I usually use the full load of Hudi Datalake for my operations. I want to know how Can I have a particular set of data only from the Hudi datalake. What I really want to achieve is a method like create_dynamic_frame.from_options(). Where we sent a samplequery to the database and fetch particular set of data only. Just like that I want to send a SQL query to the Hudi Datalake. The main goal is rather loading datalake and then filter out. I want to filter out the datalake and then load the particular part of datalake only. It will be great if I am able to load this particular data with help of spark sql. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
