关键词transformer-based large language model
搜索结果 - 3
  • ACL多语言微调中语言特定类别不平衡的影响理解
    PDF5 months ago
  • ConSmax:硬件友好的可学习参数替代 Softmax
    PDF5 months ago
  • DeepSpeed Ulysses:极长序列 Transformer 模型训练的系统优化
    PDF9 months ago
Prev
Next