What is ROMA?
ROMA (Recursive Open Meta-Agent) is an open-source multi-agent system framework developed by the Sentient AGI team. It decomposes complex tasks into parallelizable subtasks through a recursive hierarchical structure, where parent nodes allocate tasks to child nodes, which then execute and return results for aggregation. ROMA supports multimodal input and output, and comes with built-in agents such as a general task solver, a deep research agent, and a financial analysis agent, making it applicable to a wide range of scenarios from research and analysis to financial decision-making. With a transparent execution process that facilitates debugging and optimization, ROMA has shown excellent performance across multiple benchmarks, positioning it as a powerful open-source tool for DeepResearch.
Key Features
-
Recursive task decomposition: Automatically breaks down complex tasks into parallelizable subtasks, integrates results after step-by-step resolution.
-
Multimodal support: Handles diverse data types including text, images, and code, adaptable to various application needs.
-
Tool integration: Supports MCP protocol and API integration, enabling calls to external tools and models.
-
Transparent debugging: Provides clear visibility into every execution step, simplifying debugging and optimization.
-
Built-in specialized agents: Includes a general task solver, deep research agent, and financial analysis agent to meet diverse requirements.
Technical Principles of ROMA
-
Recursive hierarchical structure: Uses a tree structure where parent nodes decompose tasks into subtasks, which are executed by child nodes and returned to the parent.
-
Core components:
-
Atomizer: Determines whether a task is atomic; if not, triggers decomposition.
-
Planner: Breaks down complex tasks into subtasks and recursively assigns them.
-
Executor: Executes atomic tasks by invoking LLMs, APIs, or other agents.
-
Aggregator: Integrates subtask results and returns them to the parent node.
-
-
Context flow management: Top-down task decomposition and bottom-up result aggregation ensure clear information flow.
-
Modular design: Allows insertion of any agent, tool, or model at the node level, offering high extensibility.
Project Links
-
Official site: https://blog.sentient.xyz/posts/recursive-open-meta-agent
-
GitHub repository: https://github.com/sentient-agi/ROMA
Application Scenarios for ROMA
-
Research & analysis: The deep research agent automatically decomposes complex academic or market research tasks, integrates multi-source information, and generates reports.
-
Financial decision-making: The financial analysis agent monitors cryptocurrency markets in real time and integrates multiple data sources to produce investment analysis reports.
-
Project planning: The general task solver decomposes project tasks, assigns them, and tracks progress to support efficient project management.
-
Enterprise automation: Builds multi-agent workflows to automate internal business processes, enhancing operational efficiency.
-
Educational tools: Students can use natural language to create research agents that automatically collect and integrate information into research reports.