BriefGPT.xyz
Jun, 2024
MobileAgentBench: 移动 LLM 代理的高效且用户友好的基准测试
MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents
HTML
PDF
Luyuan Wang, Yongyu Deng, Yiwei Zha, Guodong Mao, Qinmin Wang...
TL;DR
通过提出MobileAgentBench这一高效且用户友好的基准测试工具,对现有移动代理进行全面和系统性的性能比较,以解决应用程序状态无穷和可行操作序列定义模糊的挑战。
Abstract
large language model
(LLM)-based
mobile agents
are increasingly popular due to their capability to interact directly with mobile phone Graphic User Interfaces (GUIs) and their potential to autonomously manage dai
→