ByteDance Develops OmniHuman, an AI Framework That Can Generate Realistic Videos of Humans
The use of motion signals is a novel technique, which the company is calling omni-conditions training. With this, the AI model is trained on different modalities, including text, image, audio, and video. Researchers said this allowed the model to learn mixed conditioning which overcame the scarcity of high-quality data.