Re: [MTT devel] GSOC application

Jeff Squyres Mon, 23 Mar 2009 08:34:08 -0400

Yes, I think you're right -- making a "schema" for the datastore mightbe quite easy. I'm on travel all this week and likely won't be ableto look into this stuff -- can you guys post a proposal and we candive into it from that angle?


On Mar 22, 2009, at 6:48 AM, Mike Dubman wrote:

Hello guys,
I`m not sure if we should preserve current DB schema, from onesimple reason - datastore is an object oriented storage and havedifferent rules and techniques then rdbms.The basic storage unit in the datastore is an object which can besaved, loaded and queried.
(hadoop is based on the same principles, but open source.)
It seems that DB model for mtt over datastore should not be complexat all. The current mtt db schema is mostly optimized for specificqueries dictated by web UI. Datastore creates indexes automatically,based on submitted queries history.
I suggest we discuss/exchange db layout proposals by emails and whenwe get to some general understanding how it should look like - weswitch to telepresence.
Also, It seems not problem at all to get datastore access forexisting gmail account. You get 500MB quota for storage. It takes5min to start using it.
Here is some short info for datastore API:
- howto submit data model to datastore
- howto save, load, query

http://code.google.com/appengine/docs/python/gettingstarted/usingdatastore.html

please comment.

Thanks

Mike
On Fri, Mar 20, 2009 at 5:38 PM, Jeff Squyres <[email protected]>wrote:
On Mar 20, 2009, at 10:42 AM, Josh Hursey wrote:

Yeah I think this sounds like a good way to move forward with this
work. The database schema is pretty complex. If you need help on the
database side of things let me know.

To get started, would it be useful to have a meeting over the phone/
telepresence to design the datastore layout? This gives us an
opportunity to start from a blank slate with regards to the
datastore, so it may be useful brainstorm a bit beforehand.
Yes, it probably would. My understanding of hadoop (which is veryhighlevel) is that just dump everything in without too much concernabout the structure / "schema". But I could be wrong on that.
The Google Apps account is under my personal Google account, so I'm
reluctant to use it. I think the reason it took so long for me, was
because when I originally signed up it was in limited beta. I think
the approval time is much shorter now (maybe a day?), and we can make
an openmpi or mtt account that we can use.

With regard to Hadoop, I don't think that IU has a set of machines
that would work, but I can ask around. We could always try Hadoop on
a single machine if people wanted to play around with data querying/
storage.

I don't have a strong preference either way, but Google Apps may
provide us with a lower overhead solution for the long run even
though it costs $$.
It looks like there is a set that you can use for free. When you goover one of several metrics (CPU hours/day, storage, bandwidth in,bandwidth out, etc.), then you have to start paying. But even withthat, the costs look *quite* reasonable and should be easily coveredby the combined Open MPI organizations (I'm talking hundreds ofdollars here, not tens of thousands).
--
Jeff Squyres
Cisco Systems

_______________________________________________
mtt-devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/mtt-devel

_______________________________________________
mtt-devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/mtt-devel



--
Jeff Squyres
Cisco Systems

Re: [MTT devel] GSOC application

Reply via email to