In real-world computer vision systems—whether for autonomous vehicles, remote sensing, or robotic inspection—images rarely come from a single type of…
Zero-shot Generalization
Stereo Anything: Zero-Shot Stereo Matching That Works Across Any Domain Without Retraining 803
Stereo matching—the task of finding corresponding pixels between left and right images to infer depth—is foundational to 3D vision systems…
Self-Instruct: Bootstrap High-Quality Instruction Data Without Human Annotations 4557
For teams building or fine-tuning large language models (LLMs), one of the biggest bottlenecks is the scarcity of high-quality, diverse…