语义相似性分类任务中模型与评估数据集策划的界限定位

Nov, 2023

Contextualizing the Limits of Model & Evaluation Dataset Curation on Semantic Similarity Classification Tasks

Daniel Theron

TL;DR该研究展示了预训练模型和开放评估数据集的局限性对于评估二元语义相似性分类任务的性能的影响，强调了数据的收集方式的重要性，同时强调了不同数据集、嵌入技术和距离度量之间的性能差异。

Abstract

This paper demonstrates how the limitations of pre-trained models and open evaluation datasets factor into assessing the performance of binary se