I hope you'll understand if I don't run that test while on the road...one
battery yank per week is my limit :-)


On Fri, Sep 24, 2010 at 4:40 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com
> wrote:

> Also to clarify:
>
> - did autogen set am-jobs to 2 in your case?  (it should do that if lstopo
> is not found - it also limits itself to 4 at max)
>
> - in the same scenario, what happens if you manually set am-jobs to 1 and
> run autogen?  Ie do you get the same heat/sluggishness?  I have experienced
> vms causing this kind of behavior just because they are running - causing
> CPU and memory pressure.
>
> Sent from my PDA. No type good.
>
> On Sep 24, 2010, at 12:49 AM, "Ralph Castain" <r...@open-mpi.org> wrote:
>
> Sent to both for reference (see below)
>
> Just to clarify. It wasn't a deadlock situation, but rather that the
> machine was overloaded and running so hard that the response to keystrokes
> was multiple seconds. Thus, there was no way to shut it down from the
> keyboard or screen. Even a ctrl-c was just getting ignored for a very long
> time due to the overload.
>
> I was running vmware on my machine, and doing a heavy compile/build in it.
> On top of this, I had email, editor, and browsers running - and then kicked
> off a fresh build in a terminal window. With Jeff's default settings, this
> latter build thought it would be running alone on the machine, and promptly
> generated a number of threads equal to all the processors. Since they were
> already loaded, this drove the machine into the ground.
>
> My point is just that it is unwise to assume that the OMPI build can
> utilize all available processors. I'm sure it's fine for the MTT runs,
> especially on Jeff's machines as they are dedicated to that purpose - just
> not a good general assumption.
>
>
> HTH
> Ralph
>
> ====================================
> Output of "perl -V":
>
> Summary of my perl5 (revision 5 version 8 subversion 9) configuration:
>   Platform:
>     osname=darwin, osvers=10.2.0, archname=darwin-2level
>     uname='darwin <http://sjc-rcastain-87111.cisco.com>
> sjc-rcastain-87111.cisco.com 10.2.0 darwin kernel version 10.2.0: tue nov
> 3 10:37:10 pst 2009; root:xnu-1486.2.11~1release_i386 i386 '
>     config_args='-des -D prefix=/opt/local -D scriptdir=/opt/local/bin -D
> cppflags=-I/opt/local/include -D ccflags=-O2 -arch x86_64 -D
> ldflags=-L/opt/local/lib -D vendorprefix=/opt/local -D man1ext=1pm -D
> man3ext=3pm -D cc=/usr/bin/gcc-4.2 -D ld=/usr/bin/gcc-4.2 -D
> man1dir=/opt/local/share/man/man1p -D man3dir=/opt/local/share/man/man3p -D
> siteman1dir=/opt/local/share/man/man1 -D
> siteman3dir=/opt/local/share/man/man3 -D
> vendorman1dir=/opt/local/share/man/man1 -D
> vendorman3dir=/opt/local/share/man/man3 -D inc_version_list=5.8.8
> 5.8.8/darwin-2level -U i_bind -U i_gdbm -U i_db'
>     hint=recommended, useposix=true, d_sigaction=define
>     usethreads=undef use5005threads=undef useithreads=undef
> usemultiplicity=undef
>     useperlio=define d_sfio=undef uselargefiles=define usesocks=undef
>     use64bitint=define use64bitall=define uselongdouble=undef
>     usemymalloc=n, bincompat5005=undef
>   Compiler:
>     cc='/usr/bin/gcc-4.2', ccflags ='-O2 -arch x86_64 -fno-common
> -DPERL_DARWIN -I/opt/local/include -no-cpp-precomp -fno-strict-aliasing
> -pipe -I/usr/local/include -I/opt/local/include',
>     optimize='-O3',
>     cppflags='-I/opt/local/include -no-cpp-precomp -O2 -arch x86_64
> -fno-common -DPERL_DARWIN -I/opt/local/include -no-cpp-precomp
> -fno-strict-aliasing -pipe -I/usr/local/include -I/opt/local/include'
>     ccversion='', gccversion='4.2.1 (Apple Inc. build 5646) (dot 1)',
> gccosandvers=''
>     intsize=4, longsize=8, ptrsize=8, doublesize=8, byteorder=12345678
>     d_longlong=define, longlongsize=8, d_longdbl=define, longdblsize=16
>     ivtype='long', ivsize=8, nvtype='double', nvsize=8, Off_t='off_t',
> lseeksize=8
>     alignbytes=8, prototype=define
>   Linker and Libraries:
>     ld='env MACOSX_DEPLOYMENT_TARGET=10.3 /usr/bin/gcc-4.2', ldflags
> ='-L/opt/local/lib -L/usr/local/lib'
>     libpth=/usr/local/lib /opt/local/lib /usr/lib
>     libs=-ldbm -ldl -lm -lutil -lc
>     perllibs=-ldl -lm -lutil -lc
>     libc=/usr/lib/libc.dylib, so=dylib, useshrplib=false, libperl=libperl.a
>     gnulibc_version=''
>   Dynamic Linking:
>     dlsrc=dl_dlopen.xs, dlext=bundle, d_dlsymun=undef, ccdlflags=' '
>     cccdlflags=' ', lddlflags='-L/opt/local/lib -bundle -undefined
> dynamic_lookup -L/usr/local/lib'
>
>
> Characteristics of this binary (from libperl):
>   Compile-time options: PERL_MALLOC_WRAP USE_64_BIT_ALL USE_64_BIT_INT
>                         USE_FAST_STDIO USE_LARGE_FILES USE_PERLIO
>   Built under darwin
>   Compiled at Feb 13 2010 13:19:33
>   @INC:
>     /opt/local/lib/perl5/site_perl/5.8.9/darwin-2level
>     /opt/local/lib/perl5/site_perl/5.8.9
>     /opt/local/lib/perl5/site_perl
>     /opt/local/lib/perl5/vendor_perl/5.8.9/darwin-2level
>     /opt/local/lib/perl5/vendor_perl/5.8.9
>     /opt/local/lib/perl5/vendor_perl
>     /opt/local/lib/perl5/5.8.9/darwin-2level
>     /opt/local/lib/perl5/5.8.9
>     .
>
> On Thu, Sep 23, 2010 at 10:26 PM, Ralf Wildenhues <<ralf.wildenh...@gmx.de>
> ralf.wildenh...@gmx.de> wrote:
>
>> Hello Ralph,
>>
>> wow, that's not good to hear.  I knew the perl ithreads implementation
>> wasn't all that efficient, but causing a deadlock sounds like you have
>> more trouble than just perl; at least I hope so.  For reference, can
>> you send 'perl -V' output (if you like, to the bug-automake at
>> <http://gnu.org>gnu.org
>> list).
>>
>> Thanks,
>> Ralf
>>
>> * Ralph Castain wrote on Fri, Sep 24, 2010 at 03:12:16AM CEST:
>> > I found one major negative to this change - it assumes that my build is
>> > being done in exclusion of anything else on my computer. Unfortunately,
>> this
>> > is never true.
>> >
>> > So my laptop hemorrhaged itself into frozen silence, overheated to the
>> point
>> > of being burning hot, and had to have its battery yanked to stop the
>> runaway
>> > behavior. Not a really good thing.
>> >
>> > I would suggest you default this "heuristic" out, and let someone set it
>> to
>> > use multiple runs if-and-only-if they want it. Hate to cite the lowest
>> > common denominator, but this was a very nasty surprise.
>> >
>> >
>> >
>> > On Wed, Sep 22, 2010 at 7:50 AM, Jeff Squyres < <jsquy...@cisco.com>
>> jsquy...@cisco.com> wrote:
>> >
>> > > Some of you may be unaware that recent versions of automake can run in
>> > > parallel.  That is, automake will run in parallel with a degree of (at
>> most)
>> > > $AUTOMAKE_JOBS.  This can speed up the execution time of autogen.plquite
>> > > a bit on some platforms.  On my cluster at cisco, here's a few quick
>> timings
>> > > of the entire autogen.pl process (of which, automake is the
>> bottleneck):
>> > >
>> > > $AUTOMAKE_JOBS           Total wall time
>> > > value                    of autogen.pl
>> > > 8                        3:01.46
>> > > 4                        2:55.57
>> > > 2                        3:28.09
>> > > 1                        4:38.44
>> > >
>> > > This is an older Xeon machine with 2 sockets, each with 2 cores.
>> > >
>> > > There's a nice performance jump from 1 to 2, and a smaller jump from 2
>> to
>> > > 4.  4 and 8 are close enough to not matter.  YMMV.
>> > >
>> > > I just committed a heuristic to autogen.pl to setenv AUTOMAKE_JOBS if
>> it
>> > > is not already set (<https://svn.open-mpi.org/trac/ompi/changeset/23788>
>> https://svn.open-mpi.org/trac/ompi/changeset/23788):
>> > >
>> > > - If lstopo is found in your $PATH, runs it and count how many PU's
>> > > (processing units) you have.  It'll set AUTOMAKE_JOBS to that number,
>> or a
>> > > maximum of 4 (which is admittedly a further heuristic).
>> > > - If lstopo is not found, it just sets AUTOMAKE_JOBS to 2.
>> > >
>> > > Enjoy.
>> > >
>> > > --
>> > > Jeff Squyres
>> > > <jsquy...@cisco.com>jsquy...@cisco.com
>> > > For corporate legal information go to:
>> > > <http://www.cisco.com/web/about/doing_business/legal/cri/>
>> http://www.cisco.com/web/about/doing_business/legal/cri/
>> _______________________________________________
>> devel mailing list
>>  <de...@open-mpi.org>de...@open-mpi.org
>>  <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>

Reply via email to