AbstractLimited diversity in standardized benchmarks for evaluating audio representation learning (ARL) methods may hinder systematic comparison of current methods' capabilities. We present ARCH, a
comprehensive benchmark for evaluating ARL methods on
→