[MTT devel] MTT GDS -- one more...

2010-02-11 Thread Jeff Squyres
Heh... even more questions...

(BTW, Ethan and I have asked s many questions that if it helps, I can setup 
a webex and we can all discuss this in person rather than via 1,000,000 
annoying emails from us.  :-)  Webex can call you; no one will need to pay for 
an international call)

1. It looks like the main benefits of using the Google App Engine -- 
specifically for MTT -- is that we can use the GDS and/or we can host an 
application on their web servers.  Is that correct?

2. In reading through the Google Appengine docs, the GDS stuff looks like we 
mainly can access the data through GQL.  I don't see any mention of doing 
map/reduce kinds of computations (Ethan and I were talking on the phone today 
about MTT Appengine possibilities).  I'm new to all this stuff, so it's quite 
possible that a) I missed it, or b) I just don't understand what I'm 
seeing/reading yet.  Or does GQL do map/reduce on the back end to do its magic? 
 Is GQL the main/only way we have to access GDS?

3. Is there a reason that MTTGDS.pm doesn't use the python API to directly talk 
to GDS?  I.e., what is the rationale for using a web app on appengine?  Is the 
web app doing stuff that we can't do at the client?  Ditto for bquery.pl and 
breport.pl.  (these questions are partially fueled by my curiosity and concern 
about why we're using so much CPU at Google)

-- 
Jeff Squyres
jsquy...@cisco.com

For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/




[MTT devel] More GDS questions

2010-02-11 Thread Jeff Squyres
Igor et al. -- 

1. I'm not sure you saw Ethan's and my posts from the past day or so about GDS 
on the mtt-devel list; it just occurred to me that I don't know if you're 
members of the list or not.  We've posted a few questions and comments that you 
may not have received if you're not on the list:

http://www.open-mpi.org/community/lists/mtt-devel/2010/02/index.php

2. I'm still looking into the perl syntax error that caused my Big Submit to 
GDS to fail.  But looking at the Google logs, it looks like at least *some* of 
my test run results made it up to GDS.  There was a BIG spike in CPU usage (3.2 
hours of CPU time!) when it submitted -- see the attached CPU usage graph from 
the apps dashboard.

Does anyone know why it takes so much CPU just to submit data to GDS?  3.2 CPU 
hours is a LOT!

It makes me a bit concerned that only part of a single Cisco MTT run submit 
checked through almost half of our daily CPU quota (6.5 CPU hours/day).  Is 
there any way to reduce the amount of CPU necessary just to submit data?

-- 
Jeff Squyres
jsquy...@cisco.com

For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/


[MTT devel] GDS errors

2010-02-11 Thread Jeff Squyres
Looking in the appspot dashboard, I see a bunch of errors when Cisco tried to 
submit test run data.  There's a few random errors, but a bunch that look like 
what I pasted below.  How do I diagnose this further?  Clearly, some field is 
too long -- how do I find out which one?

-
• 128.107.241.170 - - [11/Feb/2010:00:51:21 -0800] "POST /client 
HTTP/1.1" 500 1972 - "MPI Test MTTGDS Reporter,gzip(gfe)" 
"open-mpi-mtt.appspot.com"
• E02-11 12:51AM 21.241
Property data_message_size is 667 bytes long; it must be 500 or less. Consider 
Text instead, which can store strings of any length.
Traceback (most recent call last):
  File "/base/python_lib/versions/1/google/appengine/ext/webapp/__init__.py", 
line 509, in __call__
handler.post(*groups)
  File "/base/data/home/apps/open-mpi-mtt/1.337140739868725607/main.py", line 
961, in post
status = self._submit();
  File "/base/data/home/apps/open-mpi-mtt/1.337140739868725607/main.py", line 
485, in _submit
test_run_phase.put()
  File "/base/python_lib/versions/1/google/appengine/ext/db/__init__.py", line 
801, in put
self._populate_internal_entity()
  File "/base/python_lib/versions/1/google/appengine/ext/db/__init__.py", line 
779, in _populate_internal_entity
self._entity = self._populate_entity(_entity_class=_entity_class)
  File "/base/python_lib/versions/1/google/appengine/ext/db/__init__.py", line 
839, in _populate_entity
self._to_entity(entity)
  File "/base/python_lib/versions/1/google/appengine/ext/db/__init__.py", line 
1465, in _to_entity
entity[key] = value
  File "/base/python_lib/versions/1/google/appengine/api/datastore.py", line 
492, in __setitem__
datastore_types.ValidateProperty(name, value)
  File "/base/python_lib/versions/1/google/appengine/api/datastore_types.py", 
line 1290, in ValidateProperty
prop_validator(name, v)
  File "/base/python_lib/versions/1/google/appengine/api/datastore_types.py", 
line 1181, in ValidatePropertyString
ValidateStringLength(name, value, max_len=_MAX_STRING_LENGTH)
  File "/base/python_lib/versions/1/google/appengine/api/datastore_types.py", 
line 1171, in ValidateStringLength
(name, len(value), max_len))
BadValueError: Property data_message_size is 667 bytes long; it must be 500 or 
less. Consider Text instead, which can store strings of any length.
-

-- 
Jeff Squyres
jsquy...@cisco.com

For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/




Re: [MTT devel] 500 Internal Server Error from open-mpi-mtt.appspot.com

2010-02-11 Thread Jeff Squyres
After looking through the logs, Ethan and I *think* that this was just a query 
that was too large (i.e., it used too much CPU, and therefore it was killed).

Can someone with a little more knowledge than us have a look at the logs and 
let us know if we're right?


On Feb 11, 2010, at 2:05 PM, Ethan Mallove wrote:

> Hi,
> 
> I'm getting a 500 Internal Server Error using bquery.pl.  I can --ping
> successfully:
> 
>   $ client/bquery.pl --ping --server=http://open-mpi-mtt.appspot.com/ 
> --password=x --username=sun
>   Ping is successful.
> 
> But an actual query gets an error:
> 
>   $ client/bquery.pl --server=http://open-mpi-mtt.appspot.com/ 
> --password=x --username=sun --query --gqls="select * from TestRunPhase 
> where status=1" --dir="bquery-test"
>   Error at http://open-mpi-mtt.appspot.com//client
>   500 Internal Server Error
> 
> -Ethan
> ___
> mtt-devel mailing list
> mtt-de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/mtt-devel
> 


-- 
Jeff Squyres
jsquy...@cisco.com

For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/




[MTT devel] 500 Internal Server Error from open-mpi-mtt.appspot.com

2010-02-11 Thread Ethan Mallove
Hi,

I'm getting a 500 Internal Server Error using bquery.pl.  I can --ping
successfully:

  $ client/bquery.pl --ping --server=http://open-mpi-mtt.appspot.com/ 
--password=x --username=sun 
  Ping is successful.

But an actual query gets an error:

  $ client/bquery.pl --server=http://open-mpi-mtt.appspot.com/ --password=x 
--username=sun --query --gqls="select * from TestRunPhase where status=1" 
--dir="bquery-test"
  Error at http://open-mpi-mtt.appspot.com//client
  500 Internal Server Error

-Ethan


[MTT devel] MTTGDS issues

2010-02-11 Thread Jeff Squyres
1. Can you guys describe what MTTGDS expects from the performance analyzer 
modules?

I ran a bunch of netpipe results and MTTGDS performance analyzer failed to run 
-- did you guys change the specifications for the performance analyzer modules?

*** WARNING: Could not run module
MTT::Test::Analyze::Performance::NetPipe:PreReport: Undefined
subroutine &MTT::Test::Analyze::Performance::NetPipe::PreReport called
at (eval 335838) line 1.

2. I ran 24+ hours of MTT tests and the MTTGDS reporter failed to submit the 
results.  :-(

*** ERROR: Module aborted: MTT::Reporter::MTTGDS:Finalize: Nested
quantifiers in regex; marked by <-- HERE in m/\s[\S/\\]*mpi2c++ <--
HERE _test.*/ at /home/jsquyres/svn/mtt/lib/MTT/Reporter/MTTGDS.pm line
498.

Some of my INI section names have special characters in them (e.g., "mpi2c++"); 
it looks like this might be what tripped up some regexp.  I'll have a look at 
this one now...

Is there a way to re-submit my data to GDS?

-- 
Jeff Squyres
jsquy...@cisco.com

For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/