Awesome Edge Inference Papers and Source Codes

Bitnet.cpp: Run 1.58-Bit LLMs at the Edge with Lossless Speed and Efficiency 24456

Large language models (LLMs) are becoming increasingly central to real-world applications—but their computational demands remain a major barrier for edge…

12/22/2025Edge Inference, Low-bit LLMs, On-Device AI