BriefGPT.xyz
Nov, 2024
水晶:揭示大型语言模型在语言和编码上的能力
Crystal: Illuminating LLM Abilities on Language and Code
HTML
PDF
Tianhua Tao, Junbo Li, Bowen Tan, Hongyi Wang, William Marshall...
TL;DR
本研究针对代码生成大型语言模型(LLMs)在自然语言与编码能力整合方面的不足,提出了一种预训练策略来提升这两种能力的结合。研究结果表明,所提出的模型Crystal在自然语言和代码生成方面的性能与Llama 2和Code Llama相当,同时数据效率更高,显示出更有效的训练方式和潜在的广泛应用价值。
Abstract
Large Language Models
(LLMs) specializing in
Code Generation
(which are also often referred to as code LLMs), e.g., StarCoder and Code Llama, play increasingly critical roles in various software development scena
→