Yes, my point was any testing I do isn't in the form of our usual test
suites. I would like get MTT rolling at some point at LLNL; though I
have (too many :() higher priorities. Also I'm not sure of the 'their
cost' vs. 'our value' ratio of doing runs at much more than 1024 procs
(or even that many).
Andrew
Jeff Squyres wrote:
I think Terry was asking about running at larger scale on a regular
basis for correctness testing (i.e., nightly snapshot tarballs via MTT).
I, for one, would love to see the labs run some of our nightly
tarballs at larger scale so that we have a more consistent datapoint
of what "works" and what "doesn't work" at scale (even if it's a
coarse-grained measurement of the tests we have in ompi-tests). Even
if the tests are not run nightly -- running even a subset of them
even once a week even at "medium" scale would be great. I realize
that even with large clusters, we're all resource-constrained
(needing to let real users run and all that), but any testing on a
regular basis (even if it's sparse) would be really, really great/
useful/good for the code/good for the community/etc.
(yes, this is a not-so-subtle hint :-) )
On Sep 17, 2007, at 11:15 AM, Andrew Friedley wrote:
I won't speak for the labs as a whole, but I generally don't run
things
at scale unless theres something specific I'm after, ie benchmarks or
apps I'm using as a benchmark, rather than test suites.
You might look at some of the purple benchmarks:
http://www.llnl.gov/asci/platforms/purple/rfp/benchmarks/limited/
code_list.html
Andrew
Terry Dontje wrote:
What about Sandia and LANL? Is there anything that is ran on their
large clusters to confirm things seem to work at high np's?
--td
Jeff Squyres wrote:
Cisco is not yet testing that large, but we plan to shortly start
testing at np>=128 (I'm waiting for an internal cluster within Cisco
to be setup properly).
On Sep 11, 2007, at 5:31 PM, rolf.vandeva...@sun.com wrote:
I am curious which tests are being used when running tests on
larger
clusters. And by larger clusters, I mean anything with np > 128.
(Although I realize that is not very large, but it is bigger
than most
of the clusters I assume tests are being run on)
I ask this because I planned on using some of the intel tests, but
they
clearly have limitations starting at np=64.
To avoid mailing list clutter, feel free to just email me and I
will
summarize.
Rolf
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel