PaperCodex

EasyNLP: Rapidly Build, Train, and Deploy Production-Ready NLP Models—Even with Minimal Labeled Data 2179

Natural Language Processing (NLP) has been revolutionized by pre-trained language models (PLMs), but turning these powerful models into real-world applications…

12/27/2025Few-shot Learning, Multimodal NLP, Text Classification

CodeGeeX: Open-Source Multilingual Code Generation That Boosts Developer Productivity Across 23 Languages 8713

For software teams working across multiple programming languages—or developers tired of vendor lock-in with proprietary AI coding tools—CodeGeeX offers a…

12/27/2025Code Generation, Code Translation, Multilingual Programming

Qwen-Audio: Unified Audio-Language Understanding for Speech, Music, and Environmental Sounds Without Task-Specific Tuning 1848

Audio is one of the richest yet most fragmented modalities in artificial intelligence. Traditional systems often require separate models for…

12/27/2025Audio-Language Modeling, Multimodal Understanding, Universal Audio Recognition

Chinese-BERT-wwm: High-Performance Chinese Language Understanding with Whole Word Masking for Better Word-Level Context 10147

Chinese-BERT-wwm is a family of pre-trained language models specifically engineered to overcome a key limitation of the original BERT when…

12/27/2025Chinese Natural Language Inference, Chinese Reading Comprehension, Chinese Text Classification

PaperCodex

EasyNLP: Rapidly Build, Train, and Deploy Production-Ready NLP Models—Even with Minimal Labeled Data 2179

CodeGeeX: Open-Source Multilingual Code Generation That Boosts Developer Productivity Across 23 Languages 8713

Qwen-Audio: Unified Audio-Language Understanding for Speech, Music, and Environmental Sounds Without Task-Specific Tuning 1848

Chinese-BERT-wwm: High-Performance Chinese Language Understanding with Whole Word Masking for Better Word-Level Context 10147

XLNet: Bidirectional Language Understanding Without Masked Input Limitations 6180

Qwen-VL: Open-Source Vision-Language AI for Text Reading, Object Grounding, and Multimodal Reasoning 6422

NeMo: Build Production-Grade Speech, LLM, and Multimodal AI Faster with NVIDIA’s Optimized Framework 16305

MetaCLIP: Superior Vision-Language Models Through Transparent, High-Quality Data Curation 1692

SPIN: Boost Your LLM’s Performance Without New Human Annotations—Just Use Self-Play Fine-Tuning 1206