Re: [FEEDBACK REQUEST] Re: [ANNOUNCEMENT] Nexmark included to the CI

Etienne Chauchot Fri, 30 Nov 2018 05:03:21 -0800

No problem, always glad to help

Etienne


Le jeudi 29 novembre 2018 à 09:20 -0800, Alex Amato a écrit :
> Thanks Etienne, appreciate the info. This will help me a lot :)
> On Wed, Nov 28, 2018 at 1:02 AM Etienne Chauchot <echauc...@apache.org> wrote:
> > Hi Alex,
> > Exporting results to the dashboards is as easy as writing to a BigQuery 
> > table and then configure the dashboard SQL
> > request to display it. Here is an example:
> > - exporting: 
> > https://github.com/apache/beam/blob/ad150c1d654aac5720975727d8c6981c5382b449/sdks/java/testing/nexmark/src/main/java/org/apache/beam/sdk/nexmark/Main.java#L163
> > - displaying:
> > 
> > SELECT
> >     DATE(timestamp) as date,
> >     runtimeSec
> > FROM
> >     [apache-beam-testing:nexmark.nexmark_0_DirectRunner_batch]
> > WHERE
> >     timestamp >= TIMESTAMP_TO_SEC(DATE_ADD(CURRENT_TIMESTAMP(), -2, 
> > "WEEK")) 
> > ORDER BY
> >     date;
> > 
> > Best
> > Etienne
> > 
> > Le mardi 27 novembre 2018 à 17:34 -0800, Alex Amato a écrit :
> > > It would be great to add some lower level benchmark tests for the java 
> > > SDK. I was thinking of using open census
> > > for collecting benchmarks, which looks easy to use should be license 
> > > compatible. I'm just not sure about how to
> > > export the results so that we can display them on the perfkit dashboard 
> > > for everyone to see.
> > > 
> > > Is there an example PR for this part? Can we write to this data store for 
> > > this perfkit dashboard easily?
> > > 
> > > https://github.com/census-instrumentation/opencensus-java
> > > https://github.com/census-instrumentation/opencensus-java/tree/master/exporters/trace/zipkin#quickstart
> > > 
> > > 
> > > 
> > > 
> > > On Thu, Jul 19, 2018 at 1:28 PM Andrew Pilloud <apill...@google.com> 
> > > wrote:
> > > > The doc changes look good to me, I'll add Dataflow once it is ready. 
> > > > Thanks for opening the issue on the
> > > > DirectRunner. I'll try to get some progress on a dedicated perf node 
> > > > while you are gone, we can talk about
> > > > increasing the size of the nexmark input collection for the runs once 
> > > > we know what the utilization on that looks
> > > > like.
> > > > Enjoy your time off!
> > > > 
> > > > Andrew
> > > > On Thu, Jul 19, 2018 at 9:00 AM Etienne Chauchot <echauc...@apache.org> 
> > > > wrote:
> > > > > Hi guys,As suggested by Anton bellow, I opened a PR on the website to 
> > > > > reference the Nexmark dashboards. As I
> > > > > did not want users to take them for proper neutral benchmarks of the 
> > > > > runners / engines,  but more for a CI
> > > > > piece of software, I added a disclaimer.
> > > > > Please:- tell if you agree on  the publication of such performance 
> > > > > results- comment on the PR for the
> > > > > disclaimer.
> > > > > PR: https://github.com/apache/beam-site/pull/500
> > > > > 
> > > > > Thanks
> > > > > Etienne
> > > > > 
> > > > > Le jeudi 19 juillet 2018 à 12:30 +0200, Etienne Chauchot a écrit :
> > > > > > Hi Anton, 
> > > > > > Yes, good idea, I'll update nexmark website page
> > > > > > Etienne
> > > > > > Le mercredi 18 juillet 2018 à 10:17 -0700, Anton Kedin a écrit :
> > > > > > > These dashboards look great!
> > > > > > > 
> > > > > > > Can publish the links to the dashboards somewhere, for better 
> > > > > > > visibility? E.g. in the jenkins website /
> > > > > > > emails, or the wiki.
> > > > > > > 
> > > > > > > Regards,Anton
> > > > > > > On Wed, Jul 18, 2018 at 10:08 AM Andrew Pilloud 
> > > > > > > <apill...@google.com> wrote:
> > > > > > > > Hi Etienne,
> > > > > > > > 
> > > > > > > > I've been asking around and it sounds like we should be able to 
> > > > > > > > get a dedicated Jenkins node for
> > > > > > > > performance tests. Another thing that might help is making the 
> > > > > > > > runs a few times longer. They are
> > > > > > > > currently running around 2 seconds each, so the total time of 
> > > > > > > > the build probably exceeds testing.
> > > > > > > > Internally at Google we are running them with 2000x as many 
> > > > > > > > events on Dataflow, but a job of that size
> > > > > > > > won't even complete on the Direct Runner.
> > > > > > > > I didn't see the query 3 issues, but now that you point it out 
> > > > > > > > it looks like a bug to me too.
> > > > > > > > 
> > > > > > > > Andrew
> > > > > > > > On Wed, Jul 18, 2018 at 1:13 AM Etienne Chauchot 
> > > > > > > > <echauc...@apache.org> wrote:
> > > > > > > > > Hi Andrew,
> > > > > > > > > Yes I saw that, except dedicating jenkins nodes to nexmark, I 
> > > > > > > > > see no other way.
> > > > > > > > > Also, did you see query 3 output size on direct runner? 
> > > > > > > > > Should be a straight line and it is not, I'm
> > > > > > > > > wondering if there is a problem with sate and timers impl in 
> > > > > > > > > direct runner.
> > > > > > > > > Etienne
> > > > > > > > > Le mardi 17 juillet 2018 à 11:38 -0700, Andrew Pilloud a 
> > > > > > > > > écrit :
> > > > > > > > > > I'm noticing the graphs are really noisy. It looks like we 
> > > > > > > > > > are running these on shared Jenkins
> > > > > > > > > > executors, so our perf tests are fighting with other builds 
> > > > > > > > > > for CPU. I've opened an issue 
> > > > > > > > > > https://issues.apache.org/jira/browse/BEAM-4804 and am 
> > > > > > > > > > wondering if anyone knows an easy fix to
> > > > > > > > > > isolate these jobs.
> > > > > > > > > > Andrew
> > > > > > > > > > On Fri, Jul 13, 2018 at 2:39 AM Łukasz Gajowy 
> > > > > > > > > > <lgaj...@apache.org> wrote:
> > > > > > > > > > > @Etienne: Nice to see the graphs! :)
> > > > > > > > > > > 
> > > > > > > > > > > @Ismael: Good idea, there's no document yet. I think we 
> > > > > > > > > > > could create a small google doc with
> > > > > > > > > > > instructions on how to do this.
> > > > > > > > > > > 
> > > > > > > > > > > pt., 13 lip 2018 o 10:46 Etienne Chauchot 
> > > > > > > > > > > <echauc...@apache.org> napisał(a):
> > > > > > > > > > > > Hi, 
> > > > > > > > > > > > @Andrew, this is because I did not find a way to set 2 
> > > > > > > > > > > > scales on the Y axis on the perfkit
> > > > > > > > > > > > graphs. Indeed numResults varies from 1 to  100 000 and 
> > > > > > > > > > > > runtimeSec is usually bellow 10s.
> > > > > > > > > > > > Etienne
> > > > > > > > > > > > Le jeudi 12 juillet 2018 à 12:04 -0700, Andrew Pilloud 
> > > > > > > > > > > > a écrit :
> > > > > > > > > > > > > This is great, should make performance work much 
> > > > > > > > > > > > > easier! I'm going to get the Beam SQL Nexmark
> > > > > > > > > > > > > jobs publishing as well. (Opened 
> > > > > > > > > > > > > https://issues.apache.org/jira/browse/BEAM-4774 to 
> > > > > > > > > > > > > track.) I
> > > > > > > > > > > > > might take on the Dataflow runner as well if no one 
> > > > > > > > > > > > > else volunteers.
> > > > > > > > > > > > > 
> > > > > > > > > > > > > I am curious as to why you have two separate graphs 
> > > > > > > > > > > > > for runtime and count rather then graphing
> > > > > > > > > > > > > runtime/count to get the throughput rate for each 
> > > > > > > > > > > > > run? Or should that be a third graph? Looks
> > > > > > > > > > > > > like it would just be a small tweak to the query in 
> > > > > > > > > > > > > perfkit.
> > > > > > > > > > > > > Andrew
> > > > > > > > > > > > > On Thu, Jul 12, 2018 at 11:40 AM Pablo Estrada 
> > > > > > > > > > > > > <pabl...@google.com> wrote:
> > > > > > > > > > > > > > This is really cool Etienne : ) thanks for working 
> > > > > > > > > > > > > > on this.Our of curiosity, do you know how
> > > > > > > > > > > > > > often the tests run on each runner?
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > Best
> > > > > > > > > > > > > > -P.
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > On Thu, Jul 12, 2018 at 2:15 AM Romain Manni-Bucau 
> > > > > > > > > > > > > > <rmannibu...@gmail.com> wrote:
> > > > > > > > > > > > > > > Awesome Etienne, this is really important for the 
> > > > > > > > > > > > > > > (user) community to have that visibility
> > > > > > > > > > > > > > > since it is one of the most important aspect of 
> > > > > > > > > > > > > > > the Beam's quality, kudo!
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > Romain Manni-Bucau
> > > > > > > > > > > > > > > @rmannibucau |  Blog | Old Blog | Github | 
> > > > > > > > > > > > > > > LinkedIn | Book
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > Le jeu. 12 juil. 2018 à 10:59, Jean-Baptiste 
> > > > > > > > > > > > > > > Onofré <j...@nanthrax.net> a écrit :
> > > > > > > > > > > > > > > > It's really great to have these dashboards and 
> > > > > > > > > > > > > > > > integration in Jenkins !
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > Thanks Etienne for driving this !
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > Regards
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > JB
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > On 11/07/2018 15:13, Etienne Chauchot wrote:
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > Hi guys,
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > I'm glad to announce that the CI of Beam has 
> > > > > > > > > > > > > > > > > much improved ! Indeed
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > Nexmark is now included in the perfkit 
> > > > > > > > > > > > > > > > > dashboards.
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > At each commit on master, nexmark suites are 
> > > > > > > > > > > > > > > > > run and plots are created
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > on the graphs.
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > I've created 2 kind of dashboards:
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > - one for performances (run times of the 
> > > > > > > > > > > > > > > > > queries)
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > - one for the size of the output PCollection 
> > > > > > > > > > > > > > > > > (which should be constant)
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > There are dashboards for these runners:
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > - spark
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > - flink
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > - direct runner
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > Each dashboard contains:
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > - graphs in batch mode 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > - graphs in streaming mode
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > - graphs for the 13 queries.
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > That gives more than a hundred of graphs (my 
> > > > > > > > > > > > > > > > > right finger hurts after so
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > many clics on the mouse :) ). It is detailed 
> > > > > > > > > > > > > > > > > that much so that anyone
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > can focus on the area they have interest in.
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > Feel free to also create new dashboards with 
> > > > > > > > > > > > > > > > > more aggregated data.
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > Thanks to Lukasz and Cham for reviewing my 
> > > > > > > > > > > > > > > > > PRs and showing how to use
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > perfkit dashboards.
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > Dashboards are there:
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > https://apache-beam-testing.appspot.com/explore?dashboard=5084698770407424
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > https://apache-beam-testing.appspot.com/explore?dashboard=5699257587728384
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > <
> > > > > > > > > > > > > > > > https://apache-beam-testing.appspot.com/explore?dashboard=5138380291571712>https://apache-beam-testing.appspot.com/explore?dashboard=5138380291571712
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > https://apache-beam-testing.appspot.com/explore?dashboard=5099379773931520
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > https://apache-beam-testing.appspot.com/explore?dashboard=5731568492478464
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > https://apache-beam-testing.appspot.com/explore?dashboard=5163657986048000
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > Enjoy, 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > Etienne
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > > 
> 
>

Re: [FEEDBACK REQUEST] Re: [ANNOUNCEMENT] Nexmark included to the CI

Reply via email to