Cracking answer Ressh. If we can make this more like the de-facto response to these queries aI think we will be doing ourselves justice. Lewis
On Wed, Aug 6, 2014 at 3:25 PM, Verma, Rishi (398J) < [email protected]> wrote: > Hi Roger, > > Great to hear from you, and thanks for considering Apache OODT! > > I would say, at a high-level, Apache OODT is a project centered around > three themes: > 1. Data management and archival > 2. Data processing > 3. Data sharing > > Depending on your use case, it would make sense to first identify which of > those you are interested in, and then investigate the relevant modules > in-depth. OODT is a component-based architecture, so one can just use > modules independently of one another if so desired (or use a packaged > bundle, like mentioned in the Quick Start section below). Much of our > documentation is currently on our wiki [1], so that is a good place to > start. > > Here are some resources: > > Quick-start with OODT: > * Vagrant Virtual Machine with latest OODT (all components) pre-installed: > https://cwiki.apache.org/confluence/display/OODT/Vagrant+Powered+OODT > * RADiX (i.e. OODT, all components, packaged together through a single > Maven build): > https://cwiki.apache.org/confluence/display/OODT/RADiX+Powered+By+OODT > > Data Management and Archival (i.e. taking raw products, extracting > metadata, archiving metadata and products) > * File Manager Developer Guide: > http://oodt.apache.org/components/maven/filemgr/development/developer.html > * File Manager Policy (i.e. describing the nature of your products for > archival): > https://cwiki.apache.org/confluence/display/OODT/Everything+you+want+to+know+about+File+Manager+Policy > * Crawler (i.e. how to get your products into File Manager): > https://cwiki.apache.org/confluence/display/OODT/OODT+Crawler+Help > > Data Processing (i.e. transforming data already archived or to-be > archived): > * Workflow Manager Developer Guide: > http://oodt.apache.org/components/maven/workflow/development/developer.html > * CAS-PGE Learn By Example (i.e. how to wrap your external algorithms into > workflows): > https://cwiki.apache.org/confluence/display/OODT/CAS-PGE+Learn+by+Example > > Data Sharing (i.e. sharing and accessing your archive between machines) > * Web-grid overview: > http://oodt.apache.org/components/maven/grid/slides.pdf > > > To answer your second question, of experiences dealing with OODT, I've > personally been using it for archival management for at least two climate > science projects at NASA Jet Propulsion Laboratory, and for data processing > needs for two other projects. I think the integration of the Solr catalog > makes OODT an attractive choice for metadata cataloging and the workflow > manager makes the creation and execution of batch jobs involving external > algorithms easier. There's sort of a high-learning curve for OODT (we are > working on improving documentation!), but once you get the hang of the > components, its definitely a useful software package. > > Hope that helps! > > Rishi > > -- > [1] https://cwiki.apache.org/confluence/display/OODT/Home > > On Aug 6, 2014, at 2:01 PM, Roger Carter wrote: > > Hi Everyone, > > I'm new to the apache scene; I have experience with Matlab and minimal > experience with Python. This seems like a powerful tool and I'd like to > learn more. If anyone is willing to provide reccomendations for resources > or detail their experiences in learning Apache OODT, I would be most > grateful. > > Thanks, > Roger > > -- *Lewis*
