BriefGPT.xyz
Jun, 2021
ChineseBERT: 利用字形和拼音信息加强的中文预训练模型
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information
HTML
PDF
Zijun Sun, Xiaoya Li, Xiaofei Sun, Yuxian Meng, Xiang Ao...
TL;DR
本文提出了一种新的预训练语言模型 ChineseBERT,将汉字的字形、拼音信息融合到语言模型预训练中,该模型在多个汉语自然语言处理任务上取得了新的最佳表现。
Abstract
Recent
pretraining models
in Chinese neglect two important aspects specific to the Chinese language:
glyph
and
pinyin
, which carry signifi
→