Re: [MTT devel] Analysis of hung jobs.

2009-10-08 Thread Ethan Mallove
On Thu, Oct/08/2009 03:18:07PM, Ashley Pittman wrote: > On Thu, 2009-10-08 at 09:51 -0400, Ethan Mallove wrote: > > > $ padb --verbose --debug=all --config-option rmgr=mpirun --full-report=6336 > > ... > > full job report for job 6336 > > > > Attachi

Re: [MTT devel] Analysis of hung jobs.

2009-10-08 Thread Ethan Mallove
On Wed, Oct/07/2009 09:38:07PM, Ashley Pittman wrote: > On Wed, 2009-10-07 at 16:21 -0400, Ethan Mallove wrote: > > > No secret file (/home/em162155/.padb-secret) > > Error: Could not load secret file on this node > > You need to do this once to set a secret key fo

Re: [MTT devel] Analysis of hung jobs.

2009-10-07 Thread Ethan Mallove
On Tue, Oct/06/2009 04:30:52PM, Ashley Pittman wrote: > On Tue, 2009-10-06 at 11:25 -0400, Ethan Mallove wrote: > > On Tue, Oct/06/2009 10:23:48AM, Ashley Pittman wrote: > > > > > > Further to the mail linked below, padb is able to perform diagnostics, > > &g

Re: [MTT devel] Analysis of hung jobs.

2009-10-06 Thread Ethan Mallove
On Tue, Oct/06/2009 10:23:48AM, Ashley Pittman wrote: > > Further to the mail linked below, padb is able to perform diagnostics, > including backtraces on hung jobs and integrates well into automated > testing environments. Can padb get a backtrace from a non-debuggable MPI (e.g., not compiled

Re: [MTT devel] [MTT svn] svn:mtt-svn r1319

2009-09-28 Thread Ethan Mallove
y_exec after_all_exec My thought was that it would make sense for them to use a similar naming scheme and implementation (e.g., use suffix "_exec" and be passed to DoCommand::Cmd()). -Ethan > > On Sep 24, 2009, at 3:09 PM, Ethan Mallove wrote: > >> I think on_stop shou

Re: [MTT devel] [MTT svn] svn:mtt-svn r1314

2009-09-09 Thread Ethan Mallove
will get expanded during "exec" line evaluation. Okay, so we delay the evaluation of "argv" to when "exec" is evaluated. The error case is: argv = _np() Before fix: argv is undefined After fix: argv is the value of _np() -Ethan > >regards >

Re: [MTT devel] [MTT svn] svn:mtt-svn r1314

2009-09-08 Thread Ethan Mallove
Mike, What if argv contains a funclet, e.g., argv = () Won't this change prevent it from getting expanded? -Ethan On Tue, Sep/08/2009 09:43:37AM, mi...@osl.iu.edu wrote: > Author: miked > Date: 2009-09-08 09:43:37 EDT (Tue, 08 Sep 2009) > New Revision: 1314 > URL:

Re: [MTT devel] [MTT svn] svn:mtt-svn r1306

2009-08-11 Thread Ethan Mallove
On Tue, Aug/11/2009 02:53:50PM, Mike Dubman wrote: >Hey Jeff, > >This code acts as a pre-processor during loading of ini file into mtt. >It replaces builtin vars %VAR% with their values, for example: > >... >[Test run: trivial] >my_sect_name=%INI_SECTION_NAME% >... >

Re: [MTT devel] use of ()

2009-07-06 Thread Ethan Mallove
On Mon, Jul/06/2009 10:25:51AM, Jeff Squyres wrote: > I was just trying to use () in an ini file and ran across an annoying > restriction: I had to make the whole thing be one long line: > > max_test_num = < ('open(IN, "./mpi_test_suite -l|") || die("cant open"); while () { > if (m/Num Tests :

Re: [MTT devel] MTT email timeout notification feature

2009-06-25 Thread Ethan Mallove
ional] timeout with the sentinel file? that is, > it'll send a mail, then wait another timeout (e.g., 1 hour) and if the > sentinel file still exists, mtt will remove the file and keep going > > > On Jun 19, 2009, at 2:47 PM, Ethan Mallove wrote: > >> Folks, >> >&

[MTT devel] MTT email timeout notification feature

2009-06-19 Thread Ethan Mallove
Folks, I came up with a feature, which does not seem quite appropriate to go into the MTT trunk, but is still possibly useful for someone other than me. I have posted a note about it on the MTT wiki: http://svn.open-mpi.org/trac/mtt/wiki/EmailTimeoutNotification Here's the text of the Wiki

Re: [MTT devel] Check a v3.0 commit

2009-04-29 Thread Ethan Mallove
On Wed, Apr/29/2009 02:34:54PM, Jeff Squyres wrote: > On Apr 29, 2009, at 2:32 PM, Ethan Mallove wrote: > >>> Can one of you guys sanity https://svn.open-mpi.org/trac/mtt/changeset/1283 >>> before I move it to the 3.0 branch? >>> >>> It should save some

Re: [MTT devel] Check a v3.0 commit

2009-04-29 Thread Ethan Mallove
On Wed, Apr/29/2009 12:30:25PM, Jeff Squyres wrote: > Can one of you guys sanity https://svn.open-mpi.org/trac/mtt/changeset/1283 > before I move it to the 3.0 branch? > > It should save some testing cycles if the OMPI tarball hasn't changed > versions from one day to the next (and you start

Re: [MTT devel] GSOC application

2009-04-22 Thread Ethan Mallove
Dancing bears on slide 1. We're off to a good start. -Ethan On Wed, Apr/22/2009 09:11:57AM, Jeff Squyres wrote: > The slides will also be on webex on the call tomorrow. Use the URL to join > the meeting in the email invite that you got. That URL will launch an > application thingy for the

Re: [MTT devel] GSOC application

2009-04-14 Thread Ethan Mallove
. I was on travel last week and > that always makes me waaay behind on my INBOX. *:-( > > On Mon, Apr 13, 2009 at 5:44 PM, Ethan Mallove <ethan.mall...@sun.com> >wrote: > >Will this translate to something like >lib/MTT/Reporter/GoogleDatabase

Re: [MTT devel] GSOC application

2009-04-13 Thread Ethan Mallove
r its derived sub-classes. > > The attached is archive with a simple test for using datastore for mtt. > Please see models.py file with proposed object model and comment. > I don't see the models.py attachment. Thanks, Ethan > You can run the attached example in t

Re: [MTT devel] [MTT svn] svn:mtt-svn r1273 (Analyze/Performance plug-ins)

2009-03-20 Thread Ethan Mallove
is ready to handle non-latency/bandwidth test data, we can split out the client-side Analyze/Performance directory. -Ethan > >On Thu, Mar 19, 2009 at 8:22 PM, Ethan Mallove <ethan.mall...@sun.com> >wrote: > > Hi Mike, > > Is HPL a latency and/or bandwi

Re: [MTT devel] GSOC application

2009-03-18 Thread Ethan Mallove
On Wed, Mar/18/2009 03:28:48PM, Josh Hursey wrote: > So they posted the list of accepted projects and we are -not- on it > for this year: > > http://socghop.appspot.com/program/accepted_orgs/google/gsoc2009 > > Maybe next year. I don't know if they will be sending around a note > regarding why

Re: [MTT devel] MTT on Windows

2009-03-11 Thread Ethan Mallove
On Wed, Mar/11/2009 03:47:22PM, Jeff Squyres wrote: > Thanks for your patience! Yes, this looks good to me with one minor nit: > >> +if(($sys_type == "Cygwin" || $sys_type == "Msys") && >> +$config->{compiler_name} == "microsoft") { > > should be > >> +if(($sys_type eq "Cygwin" ||

Re: [MTT devel] Time to make the 2009 mtt database partitions?

2008-12-01 Thread Ethan Mallove
> 2009-test-run.sql >> ./create-partition-indexes.pl 2009 XX > 2009-indexes.sql >> >> Then just run the generated SQL scripts into the database to create the >> tables. >> >> I can do this today, and send an email when it is done. I'll also add some

Re: [MTT devel] mpi_details section with different scenarios for command line params

2008-11-03 Thread Ethan Mallove
On Mon, Nov/03/2008 09:34:07AM, Mike Dubman wrote: >Hello Guys, > >Please suggest the proper way to handle the following: > >Is there any way to run "test run" section with a list >of "mpi_details" sections? Mike, There is currently no way to iterate over multiple mpi_details

Re: [MTT devel] [MTT svn] svn:mtt-svn r1216

2008-07-23 Thread Ethan Mallove
Nice! Which device(s) does DiskFree.pm check for space on? I occasionally run out of room in /tmp, so I would want it to check my swap space. Can we remove these? (All but the .pm and README files.) trunk/lib/Filesys/Changes trunk/lib/Filesys/MANIFEST trunk/lib/Filesys/Makefile.PL

Re: [MTT devel] bogus timestamps in database

2008-07-17 Thread Ethan Mallove
On Thu, Jul/17/2008 04:35:38PM, Jeff Squyres wrote: > Here's a fun report (as of 17 July 2008): > > http://www.open-mpi.org/mtt/index.php?do_redir=775 > > Note that two of the rows are in the future. :-) (Absoft has since fixed > the problem; ntp accidentally got turned off) > > Ethan and I

Re: [MTT devel] Tracking process stats

2008-04-22 Thread Ethan Mallove
; > what we would want to exercise in it. > > > > -josh > > > > On Apr 22, 2008, at 1:45 PM, Ethan Mallove wrote: > > > >> On Tue, Apr/22/2008 01:35:06PM, Josh Hursey wrote: > >>> On the Open MPI teleconf this morning Rich mentioned that > >&g

Re: [MTT devel] Tracking process stats

2008-04-22 Thread Ethan Mallove
On Tue, Apr/22/2008 01:35:06PM, Josh Hursey wrote: > On the Open MPI teleconf this morning Rich mentioned that > he was noticing odd memory usage. It got me thinking, > would it be useful for MTT to track important aspects of > the process such as memory use? > > Just a thought. I'm not exactly

Re: [MTT devel] [MTT bugs] [MTT] #355: tooltips for reporter

2008-04-21 Thread Ethan Mallove
Do these work for you? http://tinyurl.com/49m2n4 They work for me in IE (Windows) and Mozilla (Solaris), but not in Firefox and Opera. The joys of JavaScript :-) -Ethan On Mon, Apr/21/2008 01:32:49PM, MTT wrote: > #355: tooltips for reporter >

Re: [MTT devel] [MTT svn] svn:mtt-svn r1176

2008-04-04 Thread Ethan Mallove
I like the "all" keyword. Are these no longer needed? _mpi_get_names() _mpi_install_names() _test_get_names() _test_build_names() -Ethan On Fri, Apr/04/2008 03:31:07PM, jsquy...@osl.iu.edu wrote: > Author: jsquyres > Date: 2008-04-04 15:31:07 EDT (Fri, 04 Apr 2008) > New Revision: 1176

Re: [MTT devel] [MTT svn] svn:mtt-svn r1164

2008-03-20 Thread Ethan Mallove
I sense this "do_not_run" stuff could be useful to me, but I'm not sure. Can you give a simple use case for "do_not_run"? Could "do_not_run" be achieved by just commenting out the INI lines that pertain to a certain group? E.g., do these "#" comments ... # simple_really_slow:tests =

Re: [MTT devel] [MTT svn] svn:mtt-svn r1163

2008-03-17 Thread Ethan Mallove
On Mon, Mar/17/2008 11:01:11AM, Jeff Squyres wrote: > Ethan -- > > Was there a case where the trivial plugin was not correctly detecting > what language bindings to compile against? > The motivation is different than wanting to override the language binding detection. Sometimes I *only* want

Re: [MTT devel] Weird MTT test names

2008-03-06 Thread Ethan Mallove
You can go ahead and delete those rows. (I believe they are from the ORTE test suite I was working on.) -Ethan On Thu, Mar/06/2008 09:39:16AM, Josh Hursey wrote: > I'm trying to cleanup the test_suite/test_name tables in the database, > removing some test names that are invalid and

Re: [MTT devel] two recent commits

2008-02-13 Thread Ethan Mallove
On Wed, Feb/13/2008 10:35:51AM, Josh Hursey wrote: > I just committed to the trunk two revisions that I want to push to the > Open MPI version of MTT: > https://svn.open-mpi.org/trac/mtt/changeset/1154 > https://svn.open-mpi.org/trac/mtt/changeset/1155 > > r1154 is a performance fix which

Re: [MTT devel] Extracting transparent data from OMPI

2008-02-06 Thread Ethan Mallove
tions via ompi_info, if possible. -Ethan > > On Feb 6, 2008, at 1:44 PM, Josh Hursey wrote: > > > > > On Feb 6, 2008, at 11:32 AM, Ethan Mallove wrote: > > > >>>> > >>>> > >>>>> For the configure options we *could* p

[MTT devel] Crazy SkaMPI graph rendering

2008-01-28 Thread Ethan Mallove
Jeff, Do you have a link to a human-readable SkaMPI graph? I'm not sure what to make of this: http://www.open-mpi.org/mtt/index.php?do_redir=515 I'm not sure what is lacking here, the client-side parsing module, the server-side graph generation, or both. -Ethan

Re: [MTT devel] MTT Visualization

2008-01-11 Thread Ethan Mallove
On Fri, Jan/11/2008 12:49:50PM, Jeff Squyres wrote: > On Jan 10, 2008, at 10:29 AM, Josh Hursey wrote: > > > I met with Joseph Cottam (Grad student in my lab at IU) yesterday > > about MTT visualization. He is working on some new visualization > > techniques and wants to apply them to the MTT

Re: [MTT devel] MTT Visualization

2008-01-10 Thread Ethan Mallove
Woo hoo! The reporter has been much much more useful since the DB optimizations, though I wonder if in the next batch of changes we could also have #296 (a few more columns are needed)? (Not really related to visibility, but I thought I'd speak my mind while there seems to be another round of

Re: [MTT devel] Database Notice

2008-01-09 Thread Ethan Mallove
"pg_dump -s" seems to show that we are set until 2009? (Just put a note in my calendar about this for late December 2008 :-)) On Wed, Jan/09/2008 04:07:01PM, Josh Hursey wrote: > I was showing MTT to someone today and noticed that it was performing a > bit slower than it should. After taking a

Re: [MTT devel] [MTT svn] svn:mtt-svn r1094 (for review)

2007-11-01 Thread Ethan Mallove
Josh, Before this gets committed to the live submit.php, can you look at this? It should only effect the "environment" field. Thanks, Ethan On Thu, Nov/01/2007 03:30:26PM, emall...@osl.iu.edu wrote: > Author: emallove > Date: 2007-11-01 15:30:26 EDT (Thu, 01 Nov 2007) > New Revision: 1094 >

Re: [MTT devel] Handling "Interrupted system call" with MTT

2007-10-17 Thread Ethan Mallove
On Wed, Oct/17/2007 07:45:53AM, Jeff Squyres wrote: > On Oct 16, 2007, at 6:36 PM, Ethan Mallove wrote: > > >>> The bail is that "make" will eventually succeed or fail > >>> with something other than "interrupted system call". Do > >

[MTT devel] Handling "Interrupted system call" with MTT

2007-10-16 Thread Ethan Mallove
On certain NFS servers, I run into the error message "Interrupted system call" when executing long running commands such as "make all". One solution I've been able to use is to setup an NFS mount point solely for the cluster I'm using, but this is not always an option. The below link advises to

Re: [MTT devel] MTT server error (user: sun)

2007-09-19 Thread Ethan Mallove
On Mon, Sep/17/2007 10:46:45AM, Josh Hursey wrote: > I've been getting quite a few errors from submit.php of > the below form. It seems that the MPI Install that is > being referenced is not valid. Could this is an 'already > installed' issue where the user is trying to submit > results to the

Re: [MTT devel] sanity check a reporter commit

2007-09-18 Thread Ethan Mallove
I don't see the problem from the permalink posted in the ticket. Pak and I have also been looking at performance results this week without running into the jpgraph error. -Ethan On Thu, Sep/13/2007 04:44:20PM, Jeff Squyres wrote: > Ethan -- > > Can you sanity check a commit I just made in the

Re: [MTT devel] First cut at MTT web pages

2007-09-06 Thread Ethan Mallove
On Thu, Sep/06/2007 08:35:01AM, Jeff Squyres wrote: > I put up a skeleton of the MTT web pages on the OMPI web site, but > didn't link to them from anywhere. This actually involved changing a > bunch of infrastructure because we published the name /projects/mtt/ > in the paper but PLPA was

Re: [MTT devel] No nightly summary last night?

2007-08-30 Thread Ethan Mallove
cron/alerts.php said this last night: Sorry, this page is not mirrored. Please see the http://www.open-mpi.org/;>original version of this page on the main Open MPI web site. Is "curl" confused by deny_mirror()'s new location (see r949)? -Ethan On Thu, Aug/30/2007 04:34:34PM, Josh Hursey

Re: [MTT devel] [MTT svn] svn:mtt-svn r964

2007-08-30 Thread Ethan Mallove
Pak, we're scratching our heads on this new "developer results into central database" use case. Could you give more details? -Ethan On Thu, Aug/30/2007 01:32:26PM, Jeff Squyres wrote: > Ethan -- > > Did Pak really mean to submit to the database? > > On Aug 30, 2007, at 1:22 PM,