Feb, 2024

1 位 LLMs 的时代:所有大型语言模型都在 1.58 比特

TL;DR1-bit Large Language Models (LLMs), such as BitNet b1.58, with ternary weights, define a new scaling law and offer high-performance and cost-effective solutions for training new generations of LLMs while enabling the design of hardware optimized for 1-bit LLMs.