majian1998 opened a new pull request, #9652: URL: https://github.com/apache/hudi/pull/9652
…ed by table with wrong path. ### Change Logs Filtering tables with incorrect paths in the multi table service. ### Impact _Describe any public API or user-facing feature change or any performance impact._ ### Risk level (write none, low medium or high below) _If medium or high, explain what verification was done to mitigate the risks._ ### Documentation Update The current multi table service supports executing table service for multiple tables by specifying them through hoodie.tableservice.tablesToServe. When an incorrect path is provided for a table or a table is deleted but the asynchronous table service is not yet aware of it, the current implementation causes the entire Spark job to fail, impacting the execution of other tables. It is expected that this should not impact the table service of other tables with correctly configured paths. In order to address this issue, I will introduce a check to skip such cases and log the details of tables that do not conform to Hudi's requirements using Log.error. A question for discussion: What kind of exceptions do you think should impact the table service of other tables in the same batch, and what kind should not? ### Contributor's checklist - [ ] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [ ] Change Logs and Impact were stated clearly - [ ] Adequate tests were added if applicable - [ ] CI passed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
