Open LLM Leaderboard The open-source large model ranking list launched by Hugging Face. 0580 AI Metrics
FlagEval The FlagEval (Tiancheng) large model evaluation platform launched by the Beijing Academy of Artificial Intelligence (BAAI). 0460 AI Metrics
OpenCompass The large model open evaluation system launched by Shanghai AI Laboratory 0470 AI Metrics
H2O Eval Studio A large-scale evaluation system based on the Elo rating method launched by H2O.ai 0490 AI Metrics