PaperCodex

GLM-130B: A Truly Open, Bilingual 130B-Language Model That Runs on Consumer GPUs 7680

If you’re evaluating large language models (LLMs) for real-world deployment—especially in multilingual settings—you’ve likely hit a wall: most top-performing models…

01/05/2026Bilingual Language Modeling, Text Generation, Zero-shot Inference

SmoothQuant: Accurate 8-Bit LLM Inference Without Retraining – Slash Memory and Boost Speed 1576

Deploying large language models (LLMs) in production is expensive—not just in dollars, but in compute and memory. While models like…

01/05/2026Efficient LLM Deployment, Large Language Model Inference, Post-training Quantization

Magika: AI-Powered File Type Detection with 99% Accuracy and Millisecond Speed 9991

Identifying what kind of data is inside a file seems simple—until you’re dealing with corrupted headers, obfuscated malware, or ambiguous…

01/05/2026AI-powered Security, Content-type Detection, File Classification

Self-Instruct: Bootstrap High-Quality Instruction Data Without Human Annotations 4557

For teams building or fine-tuning large language models (LLMs), one of the biggest bottlenecks is the scarcity of high-quality, diverse…

01/05/2026Instruction Tuning, Synthetic Data Generation, Zero-shot Generalization

TTRL: Boost LLM Reasoning Without Labels Using Test-Time Reinforcement Learning 836

Imagine being able to improve a large language model’s (LLM) reasoning capabilities after deployment, using only unlabeled test data—no ground-truth…

01/05/2026Reasoning, Reinforcement Learning, Test-time Scaling

RecAI: Prevent Out-of-Domain Recommendations with Plug-and-Play LLM Integration for Accurate, Explainable, and Interactive Systems 913

Large Language Models (LLMs) have unlocked new possibilities for recommender systems—enabling natural-language interactions, personalized explanations, and dynamic user control. Yet…

01/05/2026Constrained Generation, Explainable AI, Recommender Systems