关键词attention-aware post-training mixed-precision quantization
搜索结果 - 1
  • APTQ:针对大型语言模型的注意力感知后训练混合精度量化
    PDF5 months ago
Prev
Next