BriefGPT.xyz
Feb, 2021
DOBF: 面向编程语言的反混淆预训练目标
DOBF: A Deobfuscation Pre-Training Objective for Programming Languages
HTML
PDF
Baptiste Roziere, Marie-Anne Lachaux, Marc Szafraniec, Guillaume Lample
TL;DR
本文介绍了一种新的预训练方法 DOBF,它利用编程语言的结构特性,对模型进行预训练,以恢复混淆的源代码的原始版本,表明使用 DOBF 预训练的模型在多种下游任务上具有明显的性能优势,例如在无监督代码翻译和自然语言代码搜索方面分别提供了多达 13% 和 24% 的相对改进。
Abstract
Recent advances in
self-supervised learning
have dramatically improved the state of the art on a wide variety of tasks. However, research in
language model pre-training
has mostly focused on natural languages, an
→