BriefGPT.xyz
Jul, 2023
基于项目反应理论的算法综合评估
Comprehensive Algorithm Portfolio Evaluation using Item Response Theory
HTML
PDF
Sevvandi Kandanaarachchi, Kate Smith-Miles
TL;DR
在本文中,我们提出了一个基于修改过的IRT模型的框架,用于评估算法组合在数据集存储库中的性能,同时揭示算法性能的重要方面,例如一致性和异常性。我们测试了这个框架在广泛应用的算法组合上,展示了这种方法作为一种具有洞察力的算法评估工具的广泛适用性,并且IRT参数的可解释性提供了对算法组合的更深入理解。
Abstract
Item Response Theory (IRT) has been proposed within the field of
educational psychometrics
to assess student ability as well as test question difficulty and discrimination power. More recently, IRT has been applied to evaluate
→