klesh commented on issue #4188: URL: https://github.com/apache/incubator-devlake/issues/4188#issuecomment-1484715170
> > Well, it has been a while since we tested it. Maybe we should do another round, there were two things that impacted the performance IIRC: > > ``` > > 1. Iterates all commits > > > > 2. Calculates diff between 2 commits. > > ``` > > > > > > > > > > > > > > > > > > > > > > > > Can we write Benchmark PoC with those actions covered to test on a couple of medium size repositories, like `clickhouse` and `pingcap`. So we can assess how slow it will be in the real-world. > > Benchmark is necessary. +1 But I am doubt about whether pingcap or clikckhouse is appropriate. Does devlake team has any statistics about the pct90 size of repos it will analyse? I think this data varies greatly from scenario to scenario. But I have no idea on how to get a appropriate test repository. Nah, not really. There are not enough resources to conduct such a massive investigation and gitext has never been a bottleneck compared to other departments, such as collecting data from API, Chart loading speed, etc. I would say randomly picking a couple of famous and long-lived open-source projects is sufficient at this point. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
