danny0405 commented on PR #10048: URL: https://github.com/apache/hudi/pull/10048#issuecomment-1807443686
> both build metaclient repeatedly for each table, causing additional overhead. To reduce this overhead, we extract the construction of metaclient and only construct it once for each table In my opinion, the main overhead of instantiating meta client is the loading and decoding of the hoodie instants which include file listing and deciphering of the commit metadata. The refreshing of hoodie instants for each round of table service scheduling is necessary becase the latest metadata is required. So I don't think we gains much by resuing the instance. > At the same time, when running multi tables services, the write config lacks some information from the table config, such as the table name. This leads to empty strings when retrieving the table name in certain situations. If there are some configuration inconsistencies, we can just fix it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
