BriefGPT.xyz
Apr, 2024
LlamaTouch: 一个忠实且可扩展的移动界面自动化测试床任务评估
LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Automation Task Evaluation
HTML
PDF
Li Zhang, Shihe Wang, Xianqing Jia, Zhihan Zheng, Yunhe Yan...
TL;DR
LlamaTouch是一种用于在设备上执行代理程序和可信、可扩展代理评估的测试平台,它通过观察任务执行过程只传输UI状态,采用了新颖的评估方法来评估代理程序是否遍历了所有手动注释的应用程序/系统状态。
Abstract
The emergent large language/multimodal models facilitate the evolution of
mobile agents
, especially in the task of mobile
ui automation
. However, existing
→