Skip to content

PaperCodex

Subscribe
VerlTool: A Unified Framework for Scalable, Multi-Turn Tool-Using Agents with Reinforcement Learning

VerlTool: A Unified Framework for Scalable, Multi-Turn Tool-Using Agents with Reinforcement Learning 561

Building intelligent agents that can reason, interact with external tools, and learn from experience is a cornerstone of next-generation AI…

01/09/2026Multi-modal Tool Integration, Multi-turn Agentic Reasoning, Tool-augmented Reinforcement Learning
GLM-130B: A Truly Open, Bilingual 130B-Language Model That Runs on Consumer GPUs

GLM-130B: A Truly Open, Bilingual 130B-Language Model That Runs on Consumer GPUs 7680

If you’re evaluating large language models (LLMs) for real-world deployment—especially in multilingual settings—you’ve likely hit a wall: most top-performing models…

01/05/2026Bilingual Language Modeling, Text Generation, Zero-shot Inference
SmoothQuant: Accurate 8-Bit LLM Inference Without Retraining – Slash Memory and Boost Speed

SmoothQuant: Accurate 8-Bit LLM Inference Without Retraining – Slash Memory and Boost Speed 1576

Deploying large language models (LLMs) in production is expensive—not just in dollars, but in compute and memory. While models like…

01/05/2026Efficient LLM Deployment, Large Language Model Inference, Post-training Quantization
Magika: AI-Powered File Type Detection with 99% Accuracy and Millisecond Speed

Magika: AI-Powered File Type Detection with 99% Accuracy and Millisecond Speed 9991

Identifying what kind of data is inside a file seems simple—until you’re dealing with corrupted headers, obfuscated malware, or ambiguous…

01/05/2026AI-powered Security, Content-type Detection, File Classification
Self-Instruct: Bootstrap High-Quality Instruction Data Without Human Annotations

Self-Instruct: Bootstrap High-Quality Instruction Data Without Human Annotations 4557

For teams building or fine-tuning large language models (LLMs), one of the biggest bottlenecks is the scarcity of high-quality, diverse…

01/05/2026Instruction Tuning, Synthetic Data Generation, Zero-shot Generalization
TTRL: Boost LLM Reasoning Without Labels Using Test-Time Reinforcement Learning

TTRL: Boost LLM Reasoning Without Labels Using Test-Time Reinforcement Learning 836

Imagine being able to improve a large language model’s (LLM) reasoning capabilities after deployment, using only unlabeled test data—no ground-truth…

01/05/2026Reasoning, Reinforcement Learning, Test-time Scaling
RecAI: Prevent Out-of-Domain Recommendations with Plug-and-Play LLM Integration for Accurate, Explainable, and Interactive Systems

RecAI: Prevent Out-of-Domain Recommendations with Plug-and-Play LLM Integration for Accurate, Explainable, and Interactive Systems 913

Large Language Models (LLMs) have unlocked new possibilities for recommender systems—enabling natural-language interactions, personalized explanations, and dynamic user control. Yet…

01/05/2026Constrained Generation, Explainable AI, Recommender Systems
FinTeam: A Multi-Agent Financial Intelligence System That Generates Human-Accepted Reports and Outperforms GPT-4o

FinTeam: A Multi-Agent Financial Intelligence System That Generates Human-Accepted Reports and Outperforms GPT-4o 779

Financial analysis is rarely a solo endeavor. In real-world institutions—from investment banks to asset management firms—complex tasks like producing quarterly…

01/05/2026Financial Reasoning, Multi-agent Systems, Retrieval-Augmented Generation
MAGI-1: Autoregressive Video Generation at Scale with Constant Memory and Real-Time Streaming

MAGI-1: Autoregressive Video Generation at Scale with Constant Memory and Real-Time Streaming 530

MAGI-1 is a breakthrough world model designed for autoregressive video generation at scale. Unlike conventional video diffusion or transformer-based approaches…

01/05/2026Autoregressive Modeling, Image-to-video, Video Generation
HiDream-I1: Generate and Edit High-Quality Images in Seconds with Sparse Diffusion Transformer

HiDream-I1: Generate and Edit High-Quality Images in Seconds with Sparse Diffusion Transformer 777

The rapid evolution of AI-driven image generation has unlocked incredible creative potential—but often at a steep cost: slow inference, massive…

01/05/2026Instruction-based Image Editing, Multimodal Generative Modeling, Text-to-Image Generation

Posts pagination

Previous 1 … 15 16 17 … 53 Next
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex