PaperCodex

SparseDrive: Safer, Faster End-to-End Autonomous Driving Through Sparse Scene Representation 742

End-to-end autonomous driving systems promise a streamlined alternative to traditional modular pipelines—where perception, prediction, and planning are handled by separate…

01/13/2026End-to-End Autonomous Driving, Motion Planning, Sparse Representation Learning

Fast FullSubNet: Real-Time Speech Enhancement with Minimal Latency and Power Consumption for Edge Devices 586

Fast FullSubNet addresses a critical challenge in modern audio applications: delivering high-quality, real-time speech enhancement on devices with strict constraints…

01/13/2026Real-time Audio Denoising, Single-channel Audio Processing, Speech Enhancement

BatteryML: Accelerate Battery Degradation Prediction with an All-in-One Open-Source ML Platform 673

Battery degradation is a critical bottleneck in the deployment of electric vehicles (EVs), grid-scale energy storage, and portable electronics. Engineers…

01/13/2026Battery Degradation Prediction, Remaining Useful Life Estimation, State Of Health Modeling

JetMoE: High-Performance LLMs Under $100K—Open, Efficient, and Accessible 985

Building powerful language models used to be the exclusive domain of well-funded tech giants. But JetMoE is changing that narrative.…

01/13/2026Efficient Inference, Language Modeling, Sparse Mixture-of-experts

OpenICL: Simplify In-Context Learning for LLM Evaluation Without Retraining 583

Evaluating large language models (LLMs) on new tasks traditionally requires fine-tuning—a process that’s time-consuming, resource-intensive, and often impractical when labeled…

01/13/2026In-context Learning, LLM Evaluation, Prompt-based Inference

LimSim Series: Validate and Improve Autonomous Driving Systems with Realistic, Long-Term Urban Simulation 536

Validating autonomous driving systems (ADS) in realistic, complex urban environments is notoriously difficult. Real-world testing is expensive, risky, and often…

01/13/2026Autonomous Driving Simulation, Closed-loop Validation, Multimodal Driving Agents

AnchorCrafter: Generate Realistic Product Promotion Videos with AI-Powered Human-Object Interaction 643

In the fast-evolving world of e-commerce and digital marketing, brands are under constant pressure to produce high-quality, engaging promotional videos—fast…

01/13/2026Human-object Interaction, Pose-guided Video Generation, Product Promotion Video Synthesis

SpikingRTNH: Energy-Efficient 4D Radar Object Detection for Autonomous Vehicles in All Weather Conditions 507

Autonomous driving systems demand robust, real-time perception under all environmental conditions—but traditional deep learning models struggle with the high computational…

01/13/20263D Object Detection, Radar Perception, Spiking Neural Networks

PMC-LLaMA: An Open-Source Medical LLM That Outperforms ChatGPT on Clinical Accuracy 673

PMC-LLaMA is an open-source large language model explicitly engineered for the medical domain. Unlike general-purpose LLMs—such as LLaMA-2 or even…

01/13/2026Clinical Reasoning, Domain-specific Language Modeling, Medical Question Answering

LSKNet: A Lightweight, High-Performance Backbone for Remote Sensing Object Detection, Segmentation, and Classification 639

Remote sensing imagery—captured from satellites, drones, or aircraft—presents unique challenges for computer vision systems. Objects are often small, densely packed,…

01/13/2026Image Classification, Remote Sensing Object Detection, Semantic Segmentation