An open-source video generation model similar to Sora

June 4, 2024

This project was jointly initiated by the PKU-Tuzhan AIGC Lab, aiming to reproduce Sora (OpenAI's text-to-video generation model). It is hoped that the open-source community can contribute to this project. The Apache-2.0 license is used.

You can use the Demo here: https://huggingface.co/spaces/LanguageBind/Open-Sora-Plan-v1.1.0

Image generation usually requires 50 steps, and video generation may require 150 steps to produce good results, which might take 3-4 minutes. Therefore, the process of generating a 2-second video is also slow.

prompt: Extreme close-up of chicken and green pepper kebabs grilling on a barbeque with flames. Shallow focus and light smoke. vivid colours

prompt: A robot dog trots down a deserted alley at night, its metallic paws clinking softly on the cobblestones, the glow of its LED eyes piercing the darkness. Occasionally, it pauses to scan its surroundings with a soft, whirring sound.

ABOUT THE AUTHOR

Renee's Entrepreneurial JourneyEssay Editor

This is my little corner of the internet where I share thoughts, ideas, and interesting stuff I come across in the world of AI. Things in this field move fast, and I use this space to slow down a bit—to reflect, explore, and hopefully spark some good conversations.

[Excerpt of Opinions] Tim Ferriss Show #542: Chris Dixon and Naval Ravikant (3/4)

LLM

GOOGLE

Trial of Google's video generation model VOE2

GOOGLEMarch 23, 2025

Gemini 2.5 Pro, claimed to be far ahead of the competition, has been released with great fanfare: comprehensively surpassing other LLMs and topping the global rankings

GOOGLEMarch 26, 2025

AI-Researcher: LLM-driven全自动 scientific research assistant

GOOGLEMarch 30, 2025

An open-source video generation model similar to Sora

June 4, 2024

You can use the Demo here: https://huggingface.co/spaces/LanguageBind/Open-Sora-Plan-v1.1.0

prompt: Extreme close-up of chicken and green pepper kebabs grilling on a barbeque with flames. Shallow focus and light smoke. vivid colours

prompt: A robot dog trots down a deserted alley at night, its metallic paws clinking softly on the cobblestones, the glow of its LED eyes piercing the darkness. Occasionally, it pauses to scan its surroundings with a soft, whirring sound.

ABOUT THE AUTHOR

Renee's Entrepreneurial Journey

Essay Editor

LLM

GOOGLE

Trial of Google's video generation model VOE2

GOOGLEMarch 23, 2025

Gemini 2.5 Pro, claimed to be far ahead of the competition, has been released with great fanfare: comprehensively surpassing other LLMs and topping the global rankings

GOOGLEMarch 26, 2025

AI-Researcher: LLM-driven全自动 scientific research assistant

GOOGLEMarch 30, 2025

An open-source video generation model similar to Sora

ABOUT THE AUTHOR

RELATED

[Excerpt of Opinions] Tim Ferriss Show #542: Chris Dixon and Naval Ravikant (3/4)

On Liberty

Kuaishou's LivePortrait - A Video-driven Avatar Animation Framework

[Excerpt of Opinions] Tim Ferriss Show #542: Chris Dixon and Naval Ravikant (2/4)

Tencent's PhotoMaker - Faster, More Realistic, and More Controllable AI Avatars

POPULAR

LLM

GOOGLE

An open-source video generation model similar to Sora

ABOUT THE AUTHOR

POPULAR

AI TOOLS

RELATED

[Excerpt of Opinions] Tim Ferriss Show #542: Chris Dixon and Naval Ravikant (3/4)

On Liberty

Kuaishou's LivePortrait - A Video-driven Avatar Animation Framework

[Excerpt of Opinions] Tim Ferriss Show #542: Chris Dixon and Naval Ravikant (2/4)

LLM

GOOGLE