ROMA – a multi-agent framework for Sentient AGI, open-sourced

AI Tools updated 2d ago dongdong
23 0

What is ROMA?

ROMA (Recursive Open Meta-Agent) is an open-source multi-agent system framework developed by the Sentient AGI team. It decomposes complex tasks into parallelizable subtasks through a recursive hierarchical structure, where parent nodes allocate tasks to child nodes, which then execute and return results for aggregation. ROMA supports multimodal input and output, and comes with built-in agents such as a general task solver, a deep research agent, and a financial analysis agent, making it applicable to a wide range of scenarios from research and analysis to financial decision-making. With a transparent execution process that facilitates debugging and optimization, ROMA has shown excellent performance across multiple benchmarks, positioning it as a powerful open-source tool for DeepResearch.

ROMA – a multi-agent framework for Sentient AGI, open-sourced


Key Features

  • Recursive task decomposition: Automatically breaks down complex tasks into parallelizable subtasks, integrates results after step-by-step resolution.

  • Multimodal support: Handles diverse data types including text, images, and code, adaptable to various application needs.

  • Tool integration: Supports MCP protocol and API integration, enabling calls to external tools and models.

  • Transparent debugging: Provides clear visibility into every execution step, simplifying debugging and optimization.

  • Built-in specialized agents: Includes a general task solver, deep research agent, and financial analysis agent to meet diverse requirements.


Technical Principles of ROMA

  • Recursive hierarchical structure: Uses a tree structure where parent nodes decompose tasks into subtasks, which are executed by child nodes and returned to the parent.

  • Core components:

    • Atomizer: Determines whether a task is atomic; if not, triggers decomposition.

    • Planner: Breaks down complex tasks into subtasks and recursively assigns them.

    • Executor: Executes atomic tasks by invoking LLMs, APIs, or other agents.

    • Aggregator: Integrates subtask results and returns them to the parent node.

  • Context flow management: Top-down task decomposition and bottom-up result aggregation ensure clear information flow.

  • Modular design: Allows insertion of any agent, tool, or model at the node level, offering high extensibility.


Project Links


Application Scenarios for ROMA

  • Research & analysis: The deep research agent automatically decomposes complex academic or market research tasks, integrates multi-source information, and generates reports.

  • Financial decision-making: The financial analysis agent monitors cryptocurrency markets in real time and integrates multiple data sources to produce investment analysis reports.

  • Project planning: The general task solver decomposes project tasks, assigns them, and tracks progress to support efficient project management.

  • Enterprise automation: Builds multi-agent workflows to automate internal business processes, enhancing operational efficiency.

  • Educational tools: Students can use natural language to create research agents that automatically collect and integrate information into research reports.

© Copyright Notice

Related Posts

No comments yet...

none
No comments yet...