研究机器学习回归中最小化训练集填充距离

Jul, 2023

研究机器学习回归中最小化训练集填充距离

Investigating minimizing the training set fill distance in machine learning regression

Paolo Climaco, Jochen Garcke

TL;DR研究了一种抽样方法，旨在最小化填充距离，通过选择最小填充距离的训练集，实验证明该方法显著降低了各种回归模型的最大预测误差，远远优于现有的抽样方法。

Abstract

Many machine learning regression methods leverage large datasets for training predictive models. However, using large datasets may not be feasible due to computational limitations or high labelling costs. Therefore, sampling small training sets from large pools of unlabelled data point