融合非偏好注意力跨度的预训练语言模型
邹傲, 郝文宁, 陈刚, 靳大尉
Pre-trained Language Models Incorporated with Non-preference Attention Spans
ZOU Ao, HAO Wenning, CHEN Gang, JIN Dawei
陆军工程大学学报 . 2023, (3): 49 -57 .  DOI: 10.12018/j.issn.2097-0730.20220714001