BriefGPT.xyz
May, 2015
我们真的应该使用基于平均等级的事后检验吗?
Should we really use post-hoc tests based on mean-ranks?
HTML
PDF
Alessio Benavoli, Giorgio Corani, Francesca Mangili
TL;DR
本文研究了机器学习领域中的算法比较方法,说明了mean-ranks后继检验存在的不一致性以及可能导致的悖论情况,并建议采用sign-test或Wilcoxon signed-rank test等检验方法来避免相关问题。
Abstract
The
statistical comparison
of multiple algorithms over multiple data sets is fundamental in
machine learning
. This is typically carried out by the
→