xiarixiaoyao commented on issue #4135:
URL: https://github.com/apache/hudi/issues/4135#issuecomment-983254085
@rubenssoto @vinothchandar i have run the test。
suggestions: if you use z-order/hilbert in clustering, pls set
option("hoodie.clustering.plan.strategy.max.bytes.per.group",
**Long.MaxValue.toString**)
we need as many files as possible participate in sorting,in this way the
sorting effect will be the best and there is no problem of parallelism
On the issue of parallelism, the current mechanism of cluster itself makes
it impossible to do z-sort in parallel
Let me submit a PR to solve this problem
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]