If you’re building or scaling a system that relies on large language models (LLMs)—whether for chatbots, embeddings, multimodal reasoning, or…
Uni-Mol: High-Accuracy 3D Molecular Modeling for Realistic Drug Discovery and Virtual Screening 1003
In the rapidly evolving field of computational drug discovery, one of the most persistent challenges is accurately predicting how small…
UniRepLKNet: A Universal Large-Kernel ConvNet for Faster, Stronger, and Truly Multimodal AI 1053
In the era of Vision Transformers and increasingly complex multimodal architectures, convolutional neural networks (ConvNets) have often been written off…
FluxMusic: High-Quality Text-to-Music Generation with Faster, More Controllable Rectified Flow Transformers 1713
FluxMusic represents a significant step forward in the field of AI-driven audio synthesis—specifically for generating music directly from natural language…
Mini-Gemini: Close the Gap with GPT-4V and Gemini Using Open, High-Performance Vision-Language Models 3323
In today’s AI landscape, multimodal systems that understand both images and language are no longer a luxury—they’re a necessity. Yet,…
BM25S: Ultrafast Lexical Search in Pure Python—No Java, No PyTorch, Just Speed 1354
In today’s world of AI-powered search and retrieval, speed, simplicity, and low resource usage are non-negotiable—especially during prototyping, research, or…
EasyNLP: Rapidly Build, Train, and Deploy Production-Ready NLP Models—Even with Minimal Labeled Data 2179
Natural Language Processing (NLP) has been revolutionized by pre-trained language models (PLMs), but turning these powerful models into real-world applications…
CodeGeeX: Open-Source Multilingual Code Generation That Boosts Developer Productivity Across 23 Languages 8713
For software teams working across multiple programming languages—or developers tired of vendor lock-in with proprietary AI coding tools—CodeGeeX offers a…
Qwen-Audio: Unified Audio-Language Understanding for Speech, Music, and Environmental Sounds Without Task-Specific Tuning 1848
Audio is one of the richest yet most fragmented modalities in artificial intelligence. Traditional systems often require separate models for…
Chinese-BERT-wwm: High-Performance Chinese Language Understanding with Whole Word Masking for Better Word-Level Context 10147
Chinese-BERT-wwm is a family of pre-trained language models specifically engineered to overcome a key limitation of the original BERT when…