
Visit Website
(0 user reviews)
A model for generating videos from text.
Phenaki is an AI model designed to produce videos of varying lengths from written text. Additionally, it has the capability to generate videos based on a single image and a prompt. This innovative video encoder-decoder model surpasses the existing per-frame baselines commonly utilized in research, excelling in both spatio-temporal quality and the number of tokens per video. Consequently, text tokens are transformed into video tokens through a bidirectional masked transformer that is conditioned on pre-computed text tokens. These generated video tokens are then de-tokenized to form the finalized video.
People Also Viewed





Get Featured! 🚀
Feature your AI brand at the top of our homepage for 7 days! Exclusive sponsorship for AI tools, platforms, and applications.
Get Featured NowPromote Phenaki
