BriefGPT.xyz
Feb, 2024
CodeS:构建面向文本到SQL的开源语言模型
CodeS: Towards Building Open-source Language Models for Text-to-SQL
HTML
PDF
Haoyang Li, Jing Zhang, Hanbing Liu, Ju Fan, Xiaokang Zhang...
TL;DR
本研究介绍一种名为CodeS的开源语言模型,旨在解决Text-to-SQL任务中现有限制,并通过增量预训练、模式构建和双向数据增强等方法提升了CodeS在SQL生成能力上的表现,并在多个数据集上取得了新的最先进准确性和鲁棒性。
Abstract
language models
have shown promising performance on the task of translating natural language questions into SQL queries (
text-to-sql
). However, most of the state-of-the-art (SOTA) approaches rely on powerful yet
→