BriefGPT.xyz
May, 2021
CoDesc: 一个大型代码-描述平行数据集
CoDesc: A Large Code-Description Parallel Dataset
HTML
PDF
Masum Hasan, Tanveer Muttaqueen, Abdullah Al Ishtiaq, Kazi Sajeed Mehrab, Md. Mahim Anjum Haque...
TL;DR
本文提出了CoDesc数据集,该数据集包含420万个Java方法和自然语言描述,其有效地提高了24%的代码搜索能力,并实现了代码总结的新的最先进水平。
Abstract
Translation between
natural language
and
source code
can help software development by enabling developers to comprehend, ideate, search, and write computer programs in
→