GitHub user xintongsong edited a discussion: Planning Flink Agents 0.3

# Time Plan
- Target Feature Freeze: May 31
- Target Release: Jun 15

# Feature Plan

## Must
- Highlight Features
  - Support for agent skills
  - [Memory] Integrate with mature AI storage frameworks (Mem0)
  - [Event log] Support per-event-type configurable log levels
  - Support Python 3.12
- Demands from Production
  - Support auto-arguments for tool using
  - Support cross-language actions & events
- Important Experience Improvements
  - Enhance the quickstart experience
  - [Event log] Enable by default and display in UI
  - Support async execution for cross-language resource access

## If possible
- Important feature, but lack concrete plan or capacity
  - [Event log] Support querying and filtering
  - Durable execution enhancement
  - Eval
  - Multimodal capabilities
- Nice to have features & improvements
  - Loading MCP Tools & Prompts in runtime
  - Support searching tools and prompts from the MCP server
  - [Memory] MemoryRef auto resolving
  - Structured information extraction
- User Unaware
  - Refactor ActionExecutionOperator
  - Refactor AgentPlan & resource cache
  - Unify long-term memory management for Java & Python
  - CI improvements

## Not likely (not sure about the necessity in event-driven agent scenarios)
- Support for multi-agent system & A2A
- Human-in-the-loop
- Serve as a service
- Sandbox
- Basic agent tools implementations like web search
- [Memory] Support for knowledge

# Brainstorm (previous)

## From @xintongsong 

- Installation & Deployment
  - Enhance the quickstart experience. - Currently, there are too many manual 
steps, thus chances for users to make mistakes and run into problems.
- Agent Development
  - Support Python 3.12
  - Support cross-language actions & events
  - Memory
    - Support for knowledge. - Not entirely sure about this. What is the 
relationship between Knowledge and ContextRetrievalAction?
    - MemoryRef auto resolving - Using MemoryRef for Events and Resource 
arguments just like using the referenced value
    - Integrate with mature AI storage frameworks - E.g., mem0, reme, agents, 
etc. Why build from scratch?
  - Support auto-arguments for tool using - Not all arguments should be decided 
by LLM. Avoid unnecessary randomness.
  - Support searching tools and prompts from the MCP server
  - Support for agent skills
  - Support for multi-agent system
  - Multimodal capabilities
  - Human-in-the-loop
  - Structured information extraction
- Agent Execution and Operation
  - Durable execution enhancement - What if the job fails when the durable 
execution code block is executed halfway? 2PC? User defined handling?
  - Serve as a service - A service that accepts the request, put it into the 
source, collect the output from the sink, and send the response.
  - Event log
    - Enable by default and display in UI
    - Support per-event-type configurable log levels
    - Support querying and filtering
  - Eval
    - How to quantitatively measure agent performance?
    - How to compare performance of two different versions (prompt, model, 
parameter, ...)?
- Tech Debt
  - Refactor ActionExecutionOperator
  - Refactor AgentPlan & resource cache
  - Loading MCP Tools & Prompts in runtime - in case of updates on the MCP 
server side
  - Support async execution for cross-language resource access
  - Unify long-term memory management for Java & Python
  - CI
    - Run the examples
    - Submit jobs to a standalone flink cluster - in addition to the 
mini-cluster
    - Fault tolerant test - kill the TM and see if it recovers as expected

## From @alnzng 
- Skills
- Multimodal
- Sandbox
- Human-in-the-Loop

## From @yanand0909 
- A2A (Multi-Agent System)
- Basic agent tools implementations like web search

## From @JinkunLiu 
- Improving the quick-start experience


GitHub link: https://github.com/apache/flink-agents/discussions/516

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to