BriefGPT.xyz
Jun, 2024
用于优化西班牙大型语言模型的17世纪西班牙美洲公证记录
Seventeenth-Century Spanish American Notary Records for Fine-Tuning Spanish Large Language Models
HTML
PDF
Shraboni Sarker, Ahmad Tamim Hamad, Hulayyil Alshammari, Viviana Grieco, Praveen Rao
TL;DR
展示了如何利用17世纪阿根廷国家档案馆的手写公证记录来优化西班牙语言模型,以进行分类、遮蔽语言建模等任务,证明这个资源在历史文本分析领域非常有价值。
Abstract
large language models
have gained tremendous popularity in domains such as e-commerce, finance, healthcare, and education.
fine-tuning
is a common approach to customize an LLM on a domain-specific dataset for a d
→