WOWService – Technical Report on Meituan’s Large Model Interaction System

What is WOWService？

WOWService is an intelligent interaction system developed by Meituan’s LongCat team. By integrating large language models (LLMs) with a multi-agent architecture, WOWService optimizes user experience while reducing operational costs. The system addresses five major challenges in intelligent interaction: cold-start data construction, multi-turn dialogue performance, adaptation to changing business rules, limitations of a single LLM, and quantitative evaluation of multi-turn conversations. WOWService introduces innovative solutions through a four-stage training pipeline—Continuous Pre-Training (CPT), Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL)—significantly enhancing model performance in real-world business scenarios. Its multi-agent architecture allows a main agent to collaborate with multiple specialized sub-agents for efficient handling of complex tasks. Additionally, WOWService incorporates a dual-driven mechanism of data and knowledge through Self-Refinement Training (SRT) and hybrid data-knowledge strategies, further improving overall system capability.

Key Features of WOWService

Data Construction and Optimization:
WOWService adopts a dual-driven strategy combining data and knowledge. It enhances model training efficiency and performance through adaptive data-mixing optimization and high-quality data filtering, reducing the cost of data labeling.

Multi-Turn Dialogue Enhancement:
Through the four-stage training pipeline—CPT, SFT, DPO, and RL—the system significantly improves intent understanding, rule adherence, and solution extraction during multi-turn conversations.

Business Rule Adaptation and Self-Evolution:
Equipped with self-optimization training, WOWService can rapidly adapt to evolving business rules, achieving continuous model evolution and optimization while reducing business expansion costs.

Multi-Agent Collaboration:
The architecture features a main agent responsible for global dialogue control and multiple specialized sub-agents handling specific tasks, improving flexibility and service quality in complex scenarios.

Automated Evaluation and Feedback:
A comprehensive evaluation framework measures both fundamental model performance and end-to-end agent effectiveness, with data-driven feedback loops supporting continuous model improvement.

Multimodal Interaction Support:
WOWService supports multimodal inputs such as speech and images. By integrating multimodal understanding agents, it enhances comprehension of non-text inputs and broadens application scenarios.

Technical Principles of WOWService

Data and Knowledge Dual-Drive:
WOWService combines data-driven and knowledge-driven approaches, strengthening the model’s ability to follow business rules and apply structured knowledge effectively in complex business environments.

Four-Stage Training Pipeline:
Its pipeline consists of Continuous Pre-Training (CPT), Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL), progressively improving generalization, business adaptability, and reasoning capabilities.

Multi-Agent Architecture:
The system introduces a main agent and several specialized sub-agents. The main agent manages overall dialogue flow, while sub-agents handle domain-specific tasks. Through dynamic invocation and information integration, WOWService efficiently manages complex workflows.

Self-Refinement Training:
By automatically selecting high-quality service cases and rewriting low-quality dialogues, WOWService builds a self-improving data loop, enabling continuous optimization and adaptation to new business needs.

Hybrid Data-Knowledge Approach:
Combining the flexibility of data-driven methods with the rule compliance of knowledge-driven approaches, WOWService employs dynamic knowledge retrieval and reasoning enhancement to improve accuracy and reliability in complex business processes.

Reinforcement Learning and Preference Optimization:
By leveraging reinforcement learning and DPO, WOWService adjusts its outputs based on human feedback, generating responses more aligned with user preferences and improving user experience.

Automated Evaluation Framework:
A comprehensive evaluation system—from core model capabilities to full agent performance—supports automated assessment and continuous optimization, ensuring stability and efficiency in real-world deployment.

Project Links

arXiv Technical Paper: https://arxiv.org/pdf/2510.13291

Application Scenarios of WOWService

1. Meituan Intelligent Customer Service System:
WOWService has been successfully deployed in Meituan’s app-based intelligent customer service system, handling user inquiries 24/7 and significantly improving satisfaction and issue resolution efficiency.

2. Multi-Business Scenario Coverage:
Widely applied across dozens of Meituan business areas—such as food delivery, hotel booking, and travel—WOWService provides personalized services tailored to each domain.

3. Complex Business Process Management:
In scenarios involving multiple roles (e.g., users, merchants, couriers), WOWService dynamically allocates and coordinates resources to ensure efficient information flow and service execution.

4. Multimodal Interaction Support:
By supporting input via voice, images, and other modalities, WOWService enables more natural, convenient interactions, suitable for scenarios requiring visual or auditory assistance.

5. Proactive Collaboration and Recommendation:
WOWService proactively identifies user needs, offering personalized recommendations and solutions that enhance engagement and satisfaction.

6. Intelligent Outbound Services:
It supports automated outbound calls—for example, order confirmations and satisfaction surveys—improving operational efficiency and user response rates.