Alibaba's Mach Creates Lifelike AI Avatars from Text

Alibaba has developed a new text-to-3D model called Make-A-Character (Mach) that can generate detailed and lifelike 3D avatars from simple text descriptions.

Summary

Alibaba's Mach seamlessly converts textual descriptions into visual avatars, providing an easy way to create custom 3D avatars.
It leverages large language and vision foundation models like Stable Diffusion and ControlNet to generate reference portrait images from text.
The generated 2D images are then converted into 3D face meshes and textures and assembled with matched accessories.
The avatars focus on Asian ethnicities currently, but support for different ethnicities and styles will be added.
Expression, motion and cloth generation capabilities driven by text will also be developed.
Alibaba also introduced Richdreamer for 2D to 3D generation and Animate Anyone for transforming images into character videos.
Additionally, new language models Qwen-72B and Qwen-1.8B were launched to advance language capabilities.

READ MORE