OpenAI, the innovative tech firm now under Microsoft Corp. (NYSE:MSFT), has unveiled Sora, a groundbreaking artificial intelligence model designed to transform the content creation landscape with its ability to generate high-quality videos from textual prompts.
This revolutionary tool promises to bring a new level of creativity and realism to digital content, setting a new standard in the industry.
Sam Altman, the CEO of OpenAI, expressed his enthusiasm for this development, describing it as a “remarkable moment” in a Thursday post on X.
Altman highlighted the initiation of red-teaming efforts to identify potential harms or risks associated with Sora, emphasizing the company’s commitment to responsible AI development. He also acknowledged the significant contributions of Tim Brooks and Bill Peebles, research scientists at OpenAI, and Aditya Ramesh, the mind behind DALL-E, for their pivotal roles in this breakthrough.
here is sora, our video generation model:https://t.co/CDr4DdCrh1today we are starting red-teaming and offering access to a limited number of creators.@_tim_brooks @billpeeb @model_mechanic are really incredible; amazing work by them and the team.remarkable moment.
— Sam Altman (@sama) February 15, 2024
How Does OpenAI Sora Work?
Sora stands as a testament to OpenAI’s continued innovation, building upon the foundations laid by DALL-E and GPT models. Utilizing advanced recaptioning techniques from DALL-E 3, Sora excels in translating detailed textual instructions into visually stunning and accurate video content.
For instance, a prompt like “A gorgeously rendered papercraft world of a coral reef, rife with colorful fish and sea creatures,” turns into a video format as the post below shows:
Excited to share what @billpeeb @_tim_brooks and my team has been working on for the past year! Our text-to-video model Sora can generate videos of complex scenes up to a minute long. We’re excited about making this step toward AI that can reason about the world like we do.… pic.twitter.com/yXfwvYgcE2
— Aditya Ramesh (@model_mechanic) February 15, 2024
“Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background. The model understands not only what the user has asked for in the prompt, but…
Click Here to Read the Full Original Article at Cryptocurrencies Feed…