the-other-tim-brown opened a new issue, #18115:
URL: https://github.com/apache/hudi/issues/18115

   ### Task Description
   
   **What needs to be done:**
   Analyze the query plan when deserializing the data and make sure that this 
happens after any filtering on the structured data columns and after any joins 
or other shuffle steps. 
   
   **Why this task is needed:**
   This will help reduce the cost of jobs that deal with unstructured data.
   
   ### Task Type
   
   Code improvement/refactoring
   
   ### Related Issues
   
   **Parent feature issue:** (if applicable )
   **Related issues:**
   NOTE: Use `Relationships` button to add parent/blocking issues after issue 
is created.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to