[jira] [Commented] (AIRAVATA-357) Provide cloud bursting like capabilities to Airavata computational workflows integrating with Apache Whirr

Suresh Marru (Commented) (JIRA) Thu, 05 Apr 2012 21:55:45 -0700

    [ 
https://issues.apache.org/jira/browse/AIRAVATA-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13248051#comment-13248051
 ]


Suresh Marru commented on AIRAVATA-357:
---------------------------------------

Thank you for quickly grabbing attention to this task. Yes, what you suggest on 
moving part of the job is very appealing and will have large community uptake. 
This is so useful for check pointable scientific applications. Airavata 
framework should consider providing a capability to plug in application logic 
which evaluates the long running execution, and trigger job migration 
requesting increased resources or also request a change in data partitioning or 
user inputs. Any study on understanding the associated over heads will be a big 
plus. 

As for the logic to when to move to cloud, its a complex beast, as you suggest, 
you can do heuristics or CBR or other adaptive learning techniques, but really 
the challenge comes in effectively mapping the current problem at hand with 
previous data. The challenge being there may be semantic differences among 
these executions which needs elaborative metadata and provenance description. 
But glad to see you thinking on these aspects. 

 As for level of integration, all these suggestions are arguable and there are 
pros and cons of each. I personally favor lose coupling and not having any 
dependency on critical path of basic airavata functionality, so I second your 
later approach as well. You can also make the cloud provider API's themselves 
can be integrated into GFac. GFac chain of responsibility based architecture 
make this integration seamless. If you are planning a GSoc project based on 
this task, we can help elaborate in more detail. But in short, these 
capabilities will require changes to XBaya, GFac, and possibly WS-Messenger 
(adding publishers). 
                
> Provide cloud bursting like capabilities to Airavata computational workflows 
> integrating with Apache Whirr
> ----------------------------------------------------------------------------------------------------------
>
>                 Key: AIRAVATA-357
>                 URL: https://issues.apache.org/jira/browse/AIRAVATA-357
>             Project: Airavata
>          Issue Type: New Feature
>          Components: GFac, Workflow Interpreter, XBaya
>            Reporter: Suresh Marru
>              Labels: gsoc2012, mentor
>             Fix For: WISHLIST
>
>
> Apache Airavata provides capabilities to construct execute and monitor 
> computational workflows with built in providers to execute applications on 
> compute intensive resources. The users of Airavata constantly have the need 
> to execute workflows which have tasks with a hybrid combination of 
> computational grids and computational clouds. More over, some applications 
> can be executed on multiple types of resources. The selection will depend on 
> the distribution of data to be processed, the tolerable latency on shared 
> batch queued grid clusters, and on the nature and size of the problem to be 
> solved and data analysis tasks. Especially for applications which reduce the 
> data as they proceed in the graph, they better fit for local map-reduce 
> executions residing on hadoop based file systems.
> This idea has two implementation paths:
> Firstly extend airavata workflow providers to integrate with cloud based run 
> times. Integrating with higher level API's like Apache Whirr provides a great 
> value addition to this project to encompass multiple cloud services. 
> Secondly, extending Airavata enactment called Workflow Interpreter so a 
> component in the workflow is capable of initially running on local clusters 
> deploying hadoop file systems and if the execution is saturating the local 
> resources, the executions can scale out to commercial on-demand clouds or 
> leverage high speed low latency interconnects on computational grids. This 
> scaling technique is often referred to as Cloud Bursting. This project will 
> not only enable this capability to Airavata workflow. 
> User community & Impact of the software: Airavata is primarily targeted to 
> build science gateways using computational resources from various 
> disciplines. The initial targeted set of gateways include projects supporting 
> research and education in chemistry, life sciences, biophysics, environmental 
> sciences, geosciences astronomy and nuclear physics. The goal of airavata is 
> to enhance productivity of these gateways to utilize cyberinfrastructure of 
> resources (e.g., local lab resources, the Extreme Science and Engineering 
> Discovery Environment (XSEDE), the Open Science Grid (OSG), University 
> Clusters, Academic and Commercial Computational Clouds like FutureGrid & 
> Amazon EC2). By using open community based software components and services 
> like Airavata, gateways will be able to focus on providing additional 
> scientific capabilities and to expanding the number of supported users. The 
> capabilities of these gateways will offer clear benefits to society.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (AIRAVATA-357) Provide cloud bursting like capabilities to Airavata computational workflows integrating with Apache Whirr

Reply via email to