Alibaba's Mach Creates Lifelike AI Avatars from Text
Alibaba has developed a new text-to-3D model called Make-A-Character (Mach) that can generate detailed and lifelike 3D avatars from simple text descriptions.
Summary
- Alibaba's Mach seamlessly converts textual descriptions into visual avatars, providing an easy way to create custom 3D avatars.
- It leverages large language and vision foundation models like Stable Diffusion and ControlNet to generate reference portrait images from text.
- The generated 2D images are then converted into 3D face meshes and textures and assembled with matched accessories.
- The avatars focus on Asian ethnicities currently, but support for different ethnicities and styles will be added.
- Expression, motion and cloth generation capabilities driven by text will also be developed.
- Alibaba also introduced Richdreamer for 2D to 3D generation and Animate Anyone for transforming images into character videos.
- Additionally, new language models Qwen-72B and Qwen-1.8B were launched to advance language capabilities.