Hi, Those numbers you see are from production at Uber, which I no longer have access to. So they are not synthetic numbers. I use my own little script for testing write performance - tpcds does not really have good support for updates/delete workloads. I am happy to throw it up, but I think we could invest in a `hudi-perf-suite` module wrapping some of these things, at-least for Spark/Flink to begin with
I am happy to assist anyone willing to take a stab at it. Thanks Vinoth On Thu, Sep 23, 2021 at 2:45 AM Danny Chan <[email protected]> wrote: > +1, a benchmark that can reproduce is important for user testing then > choose their final product. > > Best, > Danny Chan > > casel.chen <[email protected]> 于2021年9月14日周二 下午9:38写道: > > > Hello, everyone! > > > > > > I want to know how to do apache hudi performance test like > > https://hudi.apache.org/docs/performance/? How to monitor those metrics > > and any replay steps is appreciate. Thanks! > > > > > > Shuai >
