Skip to content

PaperCodex

Subscribe
vLLM: High-Throughput, Memory-Efficient LLM Serving for Real-World Applications

vLLM: High-Throughput, Memory-Efficient LLM Serving for Real-World Applications 65106

If you’re building or scaling a system that relies on large language models (LLMs)—whether for chatbots, embeddings, multimodal reasoning, or…

01/04/2026Efficient Attention, LLM Inference, Model Serving
Uni-Mol: High-Accuracy 3D Molecular Modeling for Realistic Drug Discovery and Virtual Screening

Uni-Mol: High-Accuracy 3D Molecular Modeling for Realistic Drug Discovery and Virtual Screening 1003

In the rapidly evolving field of computational drug discovery, one of the most persistent challenges is accurately predicting how small…

01/04/20263D Molecular Representation Learning, Protein-ligand Docking, Quantum Chemical Property Prediction
UniRepLKNet: A Universal Large-Kernel ConvNet for Faster, Stronger, and Truly Multimodal AI

UniRepLKNet: A Universal Large-Kernel ConvNet for Faster, Stronger, and Truly Multimodal AI 1053

In the era of Vision Transformers and increasingly complex multimodal architectures, convolutional neural networks (ConvNets) have often been written off…

01/04/2026Image Classification, Multimodal Perception, Time-series Forecasting
FluxMusic: High-Quality Text-to-Music Generation with Faster, More Controllable Rectified Flow Transformers

FluxMusic: High-Quality Text-to-Music Generation with Faster, More Controllable Rectified Flow Transformers 1713

FluxMusic represents a significant step forward in the field of AI-driven audio synthesis—specifically for generating music directly from natural language…

01/04/2026Audio Synthesis, Rectified Flow Models, Text-to-music Generation
Mini-Gemini: Close the Gap with GPT-4V and Gemini Using Open, High-Performance Vision-Language Models

Mini-Gemini: Close the Gap with GPT-4V and Gemini Using Open, High-Performance Vision-Language Models 3323

In today’s AI landscape, multimodal systems that understand both images and language are no longer a luxury—they’re a necessity. Yet,…

12/31/2025Document Understanding, Multimodal Reasoning, vision-language modeling
BM25S: Ultrafast Lexical Search in Pure Python—No Java, No PyTorch, Just Speed

BM25S: Ultrafast Lexical Search in Pure Python—No Java, No PyTorch, Just Speed 1354

In today’s world of AI-powered search and retrieval, speed, simplicity, and low resource usage are non-negotiable—especially during prototyping, research, or…

12/31/2025Document Ranking, Information Retrieval, Lexical Search
EasyNLP: Rapidly Build, Train, and Deploy Production-Ready NLP Models—Even with Minimal Labeled Data

EasyNLP: Rapidly Build, Train, and Deploy Production-Ready NLP Models—Even with Minimal Labeled Data 2179

Natural Language Processing (NLP) has been revolutionized by pre-trained language models (PLMs), but turning these powerful models into real-world applications…

12/27/2025Few-shot Learning, Multimodal NLP, Text Classification
CodeGeeX: Open-Source Multilingual Code Generation That Boosts Developer Productivity Across 23 Languages

CodeGeeX: Open-Source Multilingual Code Generation That Boosts Developer Productivity Across 23 Languages 8713

For software teams working across multiple programming languages—or developers tired of vendor lock-in with proprietary AI coding tools—CodeGeeX offers a…

12/27/2025Code Generation, Code Translation, Multilingual Programming
Qwen-Audio: Unified Audio-Language Understanding for Speech, Music, and Environmental Sounds Without Task-Specific Tuning

Qwen-Audio: Unified Audio-Language Understanding for Speech, Music, and Environmental Sounds Without Task-Specific Tuning 1848

Audio is one of the richest yet most fragmented modalities in artificial intelligence. Traditional systems often require separate models for…

12/27/2025Audio-Language Modeling, Multimodal Understanding, Universal Audio Recognition
Chinese-BERT-wwm: High-Performance Chinese Language Understanding with Whole Word Masking for Better Word-Level Context

Chinese-BERT-wwm: High-Performance Chinese Language Understanding with Whole Word Masking for Better Word-Level Context 10147

Chinese-BERT-wwm is a family of pre-trained language models specifically engineered to overcome a key limitation of the original BERT when…

12/27/2025Chinese Natural Language Inference, Chinese Reading Comprehension, Chinese Text Classification

Posts pagination

Previous 1 … 9 10 11 … 43 Next
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex