[FEATURE] Deep Research #3650

pyek-bot · 2025-03-13T23:26:58Z

Is your feature request related to a problem?
Agent framework in ml-commons is not capable of solving complex tasks that require multiple steps and tools to execute. Moreover, the execution is synchronous and not suitable for long running tasks. Tools execute sequentially and do not offer the capability to search the web preventing the ability to solve ambiguous tasks that require more context.

Currently we have three types of agent (doc):

Flow agent
Conversational flow agent
Conversational agent

These existing agent types have limitations in supporting complex logic. Specifically, they lack the ability to:

Implement conditional workflows
Execute tools in parallel
Handle branching and merging of execution paths
Manage dependencies between tasks
Asynchronous execution

This limitation restricts the creation of more sophisticated and efficient workflows within OpenSearch.

With the introduction of deep research in service providers like OpenAI, Gemini, Perplexity, etc, there is a need for such an agent capable of breaking down a task into simple steps and executing them with the help of the provided tools asynchronously.

It can also help save costs by invoking cheaper & faster LLMs for smaller tasks and reducing the number of inferences required.

Examples:

Root Cause Analysis (RCA) for an Error:

Identify the source of recurring 500 Internal Server Error logs.
Retrieve related warnings, deployment changes, or traffic anomalies.
Generate a summary with potential causes and recommended actions.

Log Anomaly Detection:

Analyze the last 24 hours of logs to detect spikes.
Identify affected endpoints, services, or patterns.
Correlate findings with recent system changes.

What solution would you like?
A new Deep Research Agent in OpenSearch ML Commons that can:

Break down complex tasks into simpler steps
Use appropriate tools dynamically for each step
Execute tasks serially or in parallel, depending on dependencies
Support function calling to leverage external services efficiently
Ref: https://docs.anthropic.com/en/docs/build-with-claude/tool-use/overview
Re-evaluate progress and refine execution based on intermediate results
Handle failures intelligently, including retries, fallbacks, and logging
Execute the task asynchronously and update the status accordingly
Incorporate web search capabilities where external context is required

There is a feature request for a graph agent already that tries to address the problem: #3309

However, this feature focusses more on the automatic breakdown of complex tasks and tool execution rather than only a DAG style execution of tools.

ylwu-amzn · 2025-03-14T04:01:41Z

Suggestion:

Add architecture and workflow diagram
Split into phases for faster delivery

pyek-bot added enhancement New feature or request untriaged labels Mar 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Deep Research #3650

[FEATURE] Deep Research #3650

pyek-bot commented Mar 13, 2025

ylwu-amzn commented Mar 14, 2025

[FEATURE] Deep Research #3650

[FEATURE] Deep Research #3650

Comments

pyek-bot commented Mar 13, 2025

ylwu-amzn commented Mar 14, 2025