Imagine describing a “mechanical lobster with tank treads” in plain English and instantly getting a usable 3D model—no Blender expertise,…
Multimodal AI
AudioGPT: Build Spoken AI Experiences with Speech, Music, Sound, and Talking Head Generation in One Unified System 10209
AudioGPT is a multimodal AI system that bridges the gap between large language models (LLMs) like ChatGPT and the rich…
MNN: Run Large Language Models and Vision AI Offline on Mobile with a Lightweight, High-Performance Inference Engine 13694
Mobile Neural Network (MNN) is an open-source, lightweight deep learning inference engine developed by Alibaba Group to bring powerful AI…