
HELM
The large model evaluation system launched by Stanford University.
The large model open evaluation system launched by Shanghai AI Laboratory
OpenCompass is an open large model evaluation system officially launched by the Shanghai AI Laboratory in August 2023. Through a complete, open-source and reproducible evaluation framework, it supports one-stop evaluation of various models such as large language models and multimodal models, and regularly publishes evaluation result leaderboards.