Hi Chris,

Thats a good reminder. There were few masters research projects exploring OODT 
integration with Airavata. Let me find their emails and poke them to give 
insights on this thread. 

Suresh

On Mar 2, 2014, at 3:39 PM, Mattmann, Chris A (3980) 
<[email protected]> wrote:

> Guys,
> 
> Has there been any thought to using the Apache OODT file manager
> as the Airavata registry? Would seem to fit the use cases..
> 
> Cheers,
> Chris
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-283, Mailstop: 171-246
> Email: [email protected]
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 
> 
> 
> 
> 
> 
> -----Original Message-----
> From: Eran Chinthaka Withana <[email protected]>
> Reply-To: "[email protected]"
> <[email protected]>
> Date: Sunday, March 2, 2014 12:31 PM
> To: "[email protected]" <[email protected]>
> Subject: Re: Object Database Suggestions for Airavata Registry
> 
>> Hi Suresh,
>> 
>> Sorry for the late reply. I don't think I can make it at 1pm PST today.
>> Can
>> we please re-schedule this to 5pm PST (8pm EST) or later?
>> 
>> Thanks,
>> Eran Chinthaka Withana
>> 
>> 
>> On Sun, Mar 2, 2014 at 6:38 AM, Suresh Marru <[email protected]> wrote:
>> 
>>> Hi All,
>>> 
>>> Great to see we have a good quorum. So how about 4pm EST (1pm PST) today
>>> with a hangout on air. It works best if we start a a hangout then
>>> (previous
>>> attempts to pre-schedules on-air events did not work well. So please
>>> check
>>> this mailing list around 4pm EST for the hangout on air link.
>>> 
>>> Meanwhile, please join the Airavata Google Plus community, that might be
>>> easier to share the link -
>>> https://plus.google.com/communities/100700433662281905708
>>> 
>>> Thanks all for willing to take time on a sunday,
>>> Suresh
>>> 
>>> On Feb 28, 2014, at 9:15 PM, Supun Kamburugamuva <[email protected]>
>>> wrote:
>>> 
>>>> +1 for Sunday afternoon. I can make it after 4 pm EST.
>>>> 
>>>> Thanks,
>>>> Supun..
>>>> 
>>>> 
>>>> On Fri, Feb 28, 2014 at 5:04 PM, Shameera Rathnayaka <
>>> [email protected]
>>>>> wrote:
>>>> 
>>>>> +1
>>>>> 
>>>>> Thanks,
>>>>> Shameera.
>>>>> 
>>>>> 
>>>>> On Sat, Mar 1, 2014 at 3:11 AM, Eran Chinthaka Withana <
>>>>> [email protected]> wrote:
>>>>> 
>>>>>> +1 for Sunday afternoon
>>>>>> 
>>>>>> Thanks,
>>>>>> Eran Chinthaka Withana
>>>>>> 
>>>>>> 
>>>>>> On Fri, Feb 28, 2014 at 5:17 AM, Suresh Marru <[email protected]>
>>> wrote:
>>>>>> 
>>>>>>> Hi Eran,
>>>>>>> 
>>>>>>> This is a great idea. I myself owe few replies on this thread and
>>>>> unable
>>>>>>> to take time to comprehend my thoughts (and realized I should take
>>> time
>>>>>> to
>>>>>>> properly articulate the challenges otherwise we will be discussing
>>>>>>> orthogonal issues).
>>>>>>> 
>>>>>>> A hangout will help us brainstorm more comprehensively. We can
>>> have it
>>>>> on
>>>>>>> air so we can refer back for archival purposes. How is Sunday
>>> afternoon
>>>>>> for
>>>>>>> everyone willing to join and contribute?
>>>>>>> 
>>>>>>> Thanks,
>>>>>>> Suresh
>>>>>>> 
>>>>>>> On Feb 28, 2014, at 1:45 AM, Eran Chinthaka Withana <
>>>>>>> [email protected]> wrote:
>>>>>>> 
>>>>>>>> Hi,
>>>>>>>> 
>>>>>>>> Is there any chance of hosting a google hangout to talk about
>>> this. I
>>>>>>> think
>>>>>>>> with long emails and multiple directions things are getting little
>>>>> bit
>>>>>>>> confusing in thread (I'm partly responsible for this :) ). I can
>>>>> join a
>>>>>>>> video chat during a weekend but lets make sure its convenient for
>>>>> both
>>>>>>> east
>>>>>>>> and west coasts :)
>>>>>>>> 
>>>>>>>> WDYT?
>>>>>>>> 
>>>>>>>> Thanks,
>>>>>>>> Eran Chinthaka Withana
>>>>>>>> 
>>>>>>>> 
>>>>>>>> On Mon, Feb 24, 2014 at 9:32 AM, Suresh Marru <[email protected]>
>>>>>> wrote:
>>>>>>>> 
>>>>>>>>> I could respond to each thread in detail, but I see the general
>>>>> sense
>>>>>> is
>>>>>>>>> inquiring on the use case, so let me try and explain this and
>>> see if
>>>>>> it
>>>>>>>>> comes across. I am fully onboard with perceptions of relational
>>> vs
>>>>>> nosql
>>>>>>>>> and also agree current Airavata needs are not a direct map for
>>> NoSQL
>>>>>>>>> migration. I will summarize the driving motivation:
>>>>>>>>> 
>>>>>>>>> Background: The key problem Airavata needs to solve is getting
>>> the
>>>>> API
>>>>>>> and
>>>>>>>>> associated data model right. The problem is current relational
>>>>>> database
>>>>>>>>> (with OpenJPA overlay) is severely limiting the API evolution.
>>>>> Science
>>>>>>>>> Gateways by nature are very science domain and use-case specific.
>>>>> But
>>>>>>>>> Airavata is tackling this challenging problem of providing a
>>> generic
>>>>>> API
>>>>>>>>> which will meet and enable these use case centric integration.
>>> The
>>>>>> issue
>>>>>>>>> here is, we are designing an API to handle a wide range of known
>>>>> (and
>>>>>>> some
>>>>>>>>> foreseen) use cases. But at the same time trying to keep it
>>> simple
>>>>> and
>>>>>>> yet
>>>>>>>>> flexible. The only way we can get through a reasonable,
>>> normalized
>>>>>>> version
>>>>>>>>> of API is by hands-on programming against the API. Within the
>>>>> Airavata
>>>>>>> PMC
>>>>>>>>> itself, we can solicit a half-a-dozen different ways on how to
>>>>>> visualize
>>>>>>>>> the data model. And we need few hackethon's with real-end users
>>> of
>>>>>>> Airavata
>>>>>>>>> until we find a common ground. All of this needs rapid
>>> prototyping.
>>>>>>>>> Currently a slight change in the data model is taking close to
>>> two
>>>>>>> weeks of
>>>>>>>>> re-arcitecting the Open-JPA based registry. There are many known
>>>>>>> problems
>>>>>>>>> with current draft of data model which have to be put-down in the
>>>>>>> interest
>>>>>>>>> of making over all system progress.
>>>>>>>>> 
>>>>>>>>> So the driving motivation is not certainly any of the classic
>>> NoSQL
>>>>>>> needs.
>>>>>>>>> But a simple one, can we have registry which is schema-agnostic
>>> and
>>>>>> yet
>>>>>>> is
>>>>>>>>> queriable for most of the fields in the model? Can we try 10
>>>>> different
>>>>>>>>> variants of data model (hence API) within the next 3 months with
>>>>>> focused
>>>>>>>>> hackethon's and arrive at a stable 1.0 version of API?
>>>>>>>>> 
>>>>>>>>> Part one is the discussion is successful that it raised every
>>> one's
>>>>>> eye
>>>>>>>>> brows. Now that we have every one's attention, what will be a
>>> good
>>>>>> data
>>>>>>>>> store for Airavata which will meet these needs?
>>>>>>>>> 
>>>>>>>>> P.S: Additional background: The API has been in development for
>>>>> close
>>>>>>> to 3
>>>>>>>>> years and is falling short of pleasing a majority. Many academic
>>>>>>>>> standardization efforts fail terribly trying to pretend to
>>>>> understand
>>>>>>> all
>>>>>>>>> use cases and proposing a standard way (which ends up
>>> unnecessarily
>>>>>>> complex
>>>>>>>>> and not usable). Science by nature is evolutionary, and
>>> restricting
>>>>>> the
>>>>>>>>> capabilities by a known set of use cases prevents the use of
>>>>>> middleware
>>>>>>> for
>>>>>>>>> real-scientific research (and gets limited to proof of concept
>>>>>>>>> demonstrations, papers, educational use). The only way meeting
>>> the
>>>>>>>>> challenges of these evolving needs is to have the framework which
>>>>> can
>>>>>>>>> evolve with minimal disruption.
>>>>>>>>> 
>>>>>>>>> Great thoughts so far, please keep 'em coming until we can find a
>>>>>>> solution
>>>>>>>>> not by the technical fancies but to address the real need.
>>>>>>>>> 
>>>>>>>>> Cheers,
>>>>>>>>> Suresh
>>>>>>>>> 
>>>>>>>>> On Feb 24, 2014, at 11:53 AM, Lahiru Gunathilake
>>> <[email protected]
>>>>>> 
>>>>>>>>> wrote:
>>>>>>>>> 
>>>>>>>>>> On Mon, Feb 24, 2014 at 11:20 AM, Milinda Pathirage <
>>>>>>>>>> [email protected]> wrote:
>>>>>>>>>> 
>>>>>>>>>>> I also think that moving to Cassandra or any other NoSQL will
>>> add
>>>>>>>>>>> unneccessary complexity to your solution. Also designing proper
>>>>>> (easy
>>>>>>> to
>>>>>>>>>>> manage changes, easy to query) NoSQL data models are hard
>>> (AFAIK,
>>>>>>>>> require
>>>>>>>>>>> lots of experience and understanding about data structures and
>>>>>>> queries).
>>>>>>>>>>> Also migrating from one NoSQL technology to other can require
>>>>>> complete
>>>>>>>>>>> re-write. And current relational databases can handle heavy
>>> loads
>>>>>>> except
>>>>>>>>>>> Google, Twitter, Amazon and Facebook like loads. I don't think
>>>>>>> Airavata
>>>>>>>>>>> will see Google and Amazon like loads.
>>>>>>>>>>> 
>>>>>>>>>> +1
>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> If the constant changes to the data model is the problem , I
>>> think
>>>>>>> best
>>>>>>>>>>> option is to abstract registry implementation to something like
>>>>>>>>> collections
>>>>>>>>>>> and resources used in WSO2 Registry [1] or something suitable
>>> for
>>>>>>>>> Airavata
>>>>>>>>>>> context. That will make it easy to handle changes in data
>>> model.
>>>>>>>>>>> 
>>>>>>>>>>> Also don't let the technologies drive design decision. Its
>>> always
>>>>>>>>> better to
>>>>>>>>>>> let use cases drive the design decision.
>>>>>>>>>>> 
>>>>>>>>>> +1
>>>>>>>>>> 
>>>>>>>>>> Regards
>>>>>>>>>> Lahiru
>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> Thanks
>>>>>>>>>>> Milinda
>>>>>>>>>>> 
>>>>>>>>>>> [1] http://wso2.com/products/governance-registry/
>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> On Mon, Feb 24, 2014 at 10:57 AM, Supun Kamburugamuva <
>>>>>>>>> [email protected]
>>>>>>>>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>> Hi all,
>>>>>>>>>>>> 
>>>>>>>>>>>> I'm not trying to discourage you on your exploration to NoSQL
>>>>>>>>> databases.
>>>>>>>>>>> I
>>>>>>>>>>>> have the following concern.
>>>>>>>>>>>> 
>>>>>>>>>>>> Your database schema is moderately complex - even for a RDBMS
>>> it
>>>>>>> seems
>>>>>>>>>>>> complex and the data size is relatively small. I'm not sure
>>> about
>>>>>> the
>>>>>>>>>>>> current tools available but I think you will need to write
>>> more
>>>>>> code
>>>>>>> to
>>>>>>>>>>>> support all your requirements in a NoSQL database. So writing
>>>>> more
>>>>>>> code
>>>>>>>>>>> and
>>>>>>>>>>>> allow redundancy to support *relatively small* and *structured
>>>>>>>>>>>> data*doesn't seem right to me. May be I'm wrong and there are
>>>>>> better
>>>>>>>>>>>> tools in
>>>>>>>>>>>> NoSQL than RDBMS, which I doubt.
>>>>>>>>>>>> 
>>>>>>>>>>>> Thanks,
>>>>>>>>>>>> Supun..
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> On Sun, Feb 23, 2014 at 5:20 PM, Suresh Marru
>>> <[email protected]
>>>>>> 
>>>>>>>>> wrote:
>>>>>>>>>>>> 
>>>>>>>>>>>>> Hi All,
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Airavata is actively migrating to use Thrift API for the
>>>>> RESTless
>>>>>>>>>>> design
>>>>>>>>>>>>> and to facilitate various language bindings from client
>>>>> gateways.
>>>>>>> The
>>>>>>>>>>>>> programming language support in thrift has been so far very
>>>>>>>>>>> encouraging.
>>>>>>>>>>>>> The current architecture is looking like Figure 1 at [1].
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Language specific clients will be released as thrift SDK's
>>>>>> (similar
>>>>>>> to
>>>>>>>>>>>>> evernote sdk's [1]). These clients will be integrated into
>>>>> gateway
>>>>>>>>>>>> portals
>>>>>>>>>>>>> which connect to the API Server. The API operations brokers
>>> he
>>>>>>> simple
>>>>>>>>>>>> calls
>>>>>>>>>>>>> into one or more backend CPI calls (Airavata internal
>>> component
>>>>>>>>>>>>> interfaces).  An example set of mappings are illustrated in
>>>>>> Figure 2
>>>>>>>>> at
>>>>>>>>>>>>> [1]. The current draft of thrift API for version 0.12 is at
>>> [3],
>>>>>>>>> please
>>>>>>>>>>>> pay
>>>>>>>>>>>>> attention to experiment model at [4].
>>>>>>>>>>>>> 
>>>>>>>>>>>>> For the persistent store, we had few iterations of Airavata
>>>>>> Registry
>>>>>>>>>>>>> shifting from a legacy XRegistry to JackRabbit to now a
>>> OpenJPA
>>>>>>> based
>>>>>>>>>>>>> registry. To allow the API and the associated data models to
>>>>>> evolve,
>>>>>>>>> it
>>>>>>>>>>>>> will be useful to explore object databases so we can store
>>> the
>>>>>>>>>>> serialized
>>>>>>>>>>>>> version of thrift objects directly. But it will be nice to
>>> have
>>>>>> all
>>>>>>>>> (or
>>>>>>>>>>>>> most) of the fields queriable. This calls for a more
>>>>> column-family
>>>>>>>>>>> design
>>>>>>>>>>>>> of any NoSQL approaches.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Any recommendations for a registry architecture?
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Quickly hacking through I find the following approach a
>>> viable
>>>>>> one:
>>>>>>>>>>>>> ZombieDB[5] over astyanax[6] which talks to Cassandra.
>>> Airavata
>>>>>> can
>>>>>>>>>>>> benefit
>>>>>>>>>>>>> immediately from the replication and reliability of cassandra
>>>>> and
>>>>>>>>>>>>> scalability in near future. Some of the model objects like
>>>>>>> experiment
>>>>>>>>>>>>> creation will need to have strong consistency and most of the
>>>>>>>>>>> monitoring
>>>>>>>>>>>>> can live with eventual consistency.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Critical comments please?
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Thanks for your time,
>>>>>>>>>>>>> Suresh
>>>>>>>>>>>>> 
>>>>>>>>>>>>> [1] -
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>> 
>>>>>> 
>>>>> 
>>> 
>>> https://cwiki.apache.org/confluence/display/AIRAVATA/2014/02/23/Brainstor
>>> ming+Diagrams
>>>>>>>>>>>>> [2] - https://dev.evernote.com/doc/
>>>>>>>>>>>>> [3] -
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>> 
>>>>>> 
>>>>> 
>>> 
>>> https://git-wip-us.apache.org/repos/asf?p=airavata.git;a=tree;f=airavata-
>>> api/thrift-interface-descriptions;hb=HEAD
>>>>>>>>>>>>> [4] -
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>> 
>>>>>> 
>>>>> 
>>> 
>>> https://git-wip-us.apache.org/repos/asf?p=airavata.git;a=blob_plain;f=air
>>> avata-api/thrift-interface-descriptions/experimentModel.thrift;hb=HEAD
>>>>>>>>>>>>> [5] - https://github.com/MisterTea/ZombieDB
>>>>>>>>>>>>> [6] - https://github.com/Netflix/astyanax
>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> --
>>>>>>>>>>>> Supun Kamburugamuva
>>>>>>>>>>>> Member, Apache Software Foundation; http://www.apache.org
>>>>>>>>>>>> E-mail: [email protected];  Mobile: +1 812 369 6762
>>>>>>>>>>>> Blog: http://supunk.blogspot.com
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> --
>>>>>>>>>>> Milinda Pathirage
>>>>>>>>>>> PhD Student Indiana University, Bloomington;
>>>>>>>>>>> E-mail: [email protected]
>>>>>>>>>>> Web: http://mpathirage.com
>>>>>>>>>>> Blog: http://blog.mpathirage.com
>>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> --
>>>>>>>>>> System Analyst Programmer
>>>>>>>>>> PTI Lab
>>>>>>>>>> Indiana University
>>>>>>>>> 
>>>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> --
>>>>> Best Regards,
>>>>> Shameera Rathnayaka.
>>>>> 
>>>>> email: shameera AT apache.org , shameerainfo AT gmail.com
>>>>> Blog : http://shameerarathnayaka.blogspot.com/
>>>>> 
>>>> 
>>>> 
>>>> 
>>>> --
>>>> Supun Kamburugamuva
>>>> Member, Apache Software Foundation; http://www.apache.org
>>>> E-mail: [email protected];  Mobile: +1 812 369 6762
>>>> Blog: http://supunk.blogspot.com
>>> 
>>> 
> 

Reply via email to