OpenAI Launches Sora, a Text-to-Video AI Model

OpenAI is launching a new video-generation model, and it’s called Sora. The AI company says Sora “can create realistic and imaginative scenes from text instructions.” The text-to-video model allows users to create photorealistic videos up to a minute long — all based on prompts they’ve written.

Prompt: A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red dress, and black boots, and carries a black purse. She wears sunglasses and red lipstick. She walks confidently and casually. The street is damp and reflective, creating a mirror effect of the colourful lights. Many pedestrians walk about.

Sora can create “complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background,” according to OpenAI’s introductory blog post. The company also notes that the model can understand how objects “exist in the physical world,” and “accurately interpret props and generate compelling characters that express vibrant emotions.”

OpenAI Unveils Sora
In a groundbreaking announcement, OpenAI has introduced "Sora," its latest video generation model. The revelation comes from a tweet by Sam Altman, the CEO of OpenAI, who shared the exciting news on February 15, 2024.

Tweet from Sam Altman: Sam Altman's tweet provides a glimpse into Sora's capabilities and mentions the commencement of red-teaming, along with the offer of limited access to creators. Altman acknowledges the incredible work of key individuals such as @_tim_brooks, @billpeeb, and @model_mechanic, emphasizing their contributions to this remarkable moment.

Sora is currently only available to “red teamers” who are assessing the model for potential harms and risks. OpenAI is also offering access to some visual artists, designers, and filmmakers to get feedback. It notes that the existing model might not accurately simulate the physics of a complex scene and may not properly interpret certain instances of cause and effect.

Earlier this month, OpenAI announced it’s adding watermarks to its text-to-image tool DALL-E 3, but notes that they can “easily be removed.” Like its other AI products, OpenAI will have to contend with the consequences of fake, AI photorealistic videos being mistaken for the real thing.

What is OpenAI Sora?

Sora is an AI model developed by OpenAI –– built on past research in DALL·E and GPT models –– and is capable of generating videos based on text instructions, and can also animate a static image, transforming it into a dynamic video presentation. Sora can create full videos in one go or add more to already created videos to make them longer. It can produce videos up to one minute in duration, ensuring high visual quality and accuracy.


