Re: [MTT users] Discussion on teleconf yesterday?

Jeff Squyres Fri, 27 Oct 2006 07:39:20 -0400

On Oct 25, 2006, at 10:37 AM, Josh Hursey wrote:

The discussion started with the bug characteristics of v1.2 versusthe trunk.


Gotcha.

It seemed from the call that IU was the only institution that canasses this via MTT as noone else spoke up. Since people wereinterested in seeing things that were breaking I suggested that Istart forwarding the IU internal MTT reports (run nightly andweekly) to the test...@open-mpi.org. This was meet by Braininsisting that it would result in "thousands" of emails to thedevelopment list. I clarified that it is only 3 - 4 messages a dayfrom IU. However if all other institutions do this then it would bea bunch of email (where 'a bunch' would still be less than'thousands'). That's how we got to a 'we need a single summarypresented to the group' comment. It should be noted that we broughtup IU sending to the 'test...@open-mpi.org' list as a bandaid untilMTT could do it better.


How about sending them to me and Ethan?

This single summary can be email or a webpage that people cancheck. Rich said that he would prefer a webpage, and noone elsereally had a comment. That got us talking about the current summarypage that MTT generates. Tim M mentioned that the current websiteis difficult to figure out how to get the answers you need. Iagree, it is hard [usability] for someone to go to the summary pageand answer the question "So what failed from IU last night, and howdoes that differ from Yesterday -- e.g., what regressed andprogressed yesterday at IU?". The website is flexible enough to dueit, but having a couple of basic summary pages would be nice forbasic users. What that should look like we can discuss further.

Agreed; we aren't super-fond of the current web page, either. Do youguys want to have a teleconf to go over the current status of MTT,where you want it to go, etc.? I consider IU's input here quiteimportant, since you're the ones pushing the boundaries, flexingMTT's muscles, etc.

The IU group really likes the emails that we currently generate. Aplain-text summary of the previous run. I posted copies on the MTTbug tracker here:
http://svn.open-mpi.org/trac/mtt/ticket/61
Currently we have not put the work in to aggregate the runs, so foreach ini file that we run we get 1 email to the IU group. This isfine for the moment, but as we add the rest of the clusters anddimensions in the testing matrix we will need MTT to aggregate theresults for us and generate such an email.

Ok.

We created another ticket yesterday to make a new MTT Reporter (ourinternal plugins) that duplicates this output format. It actuallyshouldn't be that hard -- we don't have to do parsing to get thenumbers that you're reporting; we have access to the actual data. Soit's mostly caching the data, calculating the totals that you'recalculating, and printing in your output format.

Ethan has some other short tasks to do before he gets to this, butits near the top of the priority list. You can see the currentworkflow on the wiki (this is a living document; it keeps changing asrequirements, etc. change):


    http://svn.open-mpi.org/trac/mtt/wiki/TaskPlan

So I think the general feel of the discussion is that we need thefollowing from MTT:- A 'basic' summary page providing answers to some generalfrequently asked queries. The current interface is too advanced forthe current users.

We have the summary.php page, but I personally have never found ittoo useful. :-)

We're getting towards a full revamp of reporter.php (got some othertasks to complete first, but we're definitely starting to think aboutit) -- got any ideas / input? Our "haven't thought about it muchyet" idea is to be more menu/Q-A driven with a few common querieseasily available (rather than a huge, complicated single screen).

- A summary email [in plain-text preferably] similar to the onethat IU generated showing an aggregation of the previous nightsresults for (a) all reporters (b) my institution [so I can trackthem down and file bugs].

For the moment, we don't have the dynamic capability for you to loginto the web page, create a report, and say "mail this to me nightly".However, Ethan can make up custom reports on the server quite easily-- if you want some IU-specific reports, just file a ticket and Ethancan Make It So.

 - 1 email a day on the previous nights testing results.

That's what we intended for the mails that are coming today, but itseemed to not be sufficient -- we ended up with 4 nightly mails, onefor each relevant phase failures and a 4th for showing stderr of mpiinstalls.

Some relevant bugs currently in existence:
http://svn.open-mpi.org/trac/mtt/ticket/92
http://svn.open-mpi.org/trac/mtt/ticket/61
http://svn.open-mpi.org/trac/mtt/ticket/94
The other concern is that given the frequency of testing as bugsappear from the testing someone needs to make sure the bug trackeris updated. I think the group is unclear about how this is done.Meaning when a MTT identifies a test as failed whom is responsiblefor putting the bug in the bug tracker?

At the moment, I've been manually examining the mails every day andfiring off e-mails to those responsible. However, due to travel lastweek and this week, I've gotten quite behind. :-(

The obvious solution is the institution that identified the bug.[Warning: My opinion] But then that becomes unwieldy for IU sincewe have a large testing matrix, and would need to commit someone todoing this everyday (and it may take all day to properly track aset of bugs). Also this kind of punishes an institution for testingmore instead of providing incentive to test.

True. I don't know the proper answer to this, either -- I know the"Jeff look at e-mail" solution doesn't scale well.

------ Page Break -- Context switch ------
In case you all want to know what we are doing here at IU. Iattached to this email our planed MTT testing matrix. Currently wehave BigRed and Odin running the complete matrix less the BLACStests. Wotan and Thor will come online as we get more resources tosupport them.
In order to do such a complex testing matrix we have various .inifiles that we use. And since some of the dimensions in the matrixare large we break some of the tests into a couple .ini files thatare submitted concurrently to have them run in a reasonable time.
<MTT-testing-matrix.txt>


Awesome.

I would like to schedule some phone time with you guys and Ethan andme to talk about what's working, what's not working, etc. Oneobvious question I have is: is the INI config file format suitable?Do we need to do something more complex that would allowconsolidation of your various configurations? ...etc.


--
Jeff Squyres
Server Virtualization Business Unit
Cisco Systems

Re: [MTT users] Discussion on teleconf yesterday?

Reply via email to