[ 
https://issues.apache.org/jira/browse/AIRAVATA-3965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suresh Marru updated AIRAVATA-3965:
-----------------------------------
    Labels: gsoc gsoc2025 mentor  (was: )

> Facilitating computational experiment generation in AIRAVATA
> ------------------------------------------------------------
>
>                 Key: AIRAVATA-3965
>                 URL: https://issues.apache.org/jira/browse/AIRAVATA-3965
>             Project: Airavata
>          Issue Type: New Feature
>            Reporter: Giri Krishnan
>            Priority: Major
>              Labels: gsoc, gsoc2025, mentor
>
> Computational sciences involve extensive experimentation which often involves 
> searching over space of parameters, variables, functions and workflows. 
> Individual researchers and groups often perform a large number of such 
> searches to identify critical functional forms and workflows for any 
> particular study. The goal of this work is to provide a tool that facilitates 
> this search process. This will enable visualization, identifying or learning 
> templates and generate potential experiments based on past experiments using 
> LLM and neurosymbolic methods.
>  
> This task requires the following specific goals for this work :
>  # Provide visualization of past computational experiments: Tracking various 
> computational experiments with various variations is often a challenging 
> problem for individuals and groups of researchers. Often various adhoc 
> approaches (directories, git etc) are used to track these changes, but often 
> it is very difficult to provide an entire overview of past experiments. The 
> goal of this work is to develop a visualization approach that allows to 
> examine all the past experiments. This will require dimensionality reduction 
> on the embeddings from LLMs which have been tested on its code generation 
> abilities (eg. codellama, Llama 4 Maverick) for generating visualizations. 
> Further comparison in the performance with standard code cloning and 
> similarity measures will be required.
>  # Identify template based on past experiment database: It is common for 
> several computational experiments to share a common structure, in such cases 
> identifying the 'template' allows for identifying common approaches in past 
> experiments and to generate new ones. This work will need software engineer 
> approach and AI based approaches to identify such templates. The templates 
> will also be integrated with the visualization (in addition to embedding 
> based visualization) allowing for examining the collections of experiments 
> that belong to each template.
>  # Generate new suggested experiments using templates and visualization 
> guided search. Generation of new experiments is a key component of 
> computational science work. To facilate this process, will require a visual 
> interactive way to generate experiments based within the regions of previous 
> experiments and also in the space where it was not previously explored. In 
> addition, this will require generation of new experiments based on templates 
> that were identified from the previous step. Template based generation could 
> also provide a verifiable way to generate experiments.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to