Light

lum1104 / minisora Goto Github PK

View Code? Open in Web Editor NEW

This project forked from mini-sora/minisora

0.0 0.0 0.0 587 KB

minisora's Introduction

Mini Sora 社区

Mini Sora 开源社区定位为由社区同学自发组织的开源社区（免费不收取任何费用、不割韭菜），Mini Sora 计划探索实现 Sora 的实现路径可发展可能：

将定期举办 Sora 的圆桌和社区一起探讨可能性
视频生成的现有技术路径探讨

论文共读计划

Sora 技术报告: Video generation models as world simulators
DiT: Scalable Diffusion Models with Transformers
Latte: Latte: Latent Diffusion Transformer for Video Generation
更新中...

相关工作

Diffusion Model

论文	链接
1) Guided-Diffusion: Diffusion Models Beat GANs on Image Synthesis	Paper, Github
2) Latent Diffusion: High-Resolution Image Synthesis with Latent Diffusion Models	Paper, Github
3) EDM: Elucidating the Design Space of Diffusion-Based Generative Models	Paper, Github
4) DDPM: Denoising Diffusion Probabilistic Models	Paper, Github
5) DDIM: Denoising Diffusion Implicit Models	Paper, Github
6) Score-Based Diffusion: Score-Based Generative Modeling through Stochastic Differential Equations	Paper, Github

Diffusion Transformer

论文	链接
1) UViT: All are Worth Words: A ViT Backbone for Diffusion Models	Paper, Github, ModelScope
2) DiT: Scalable Diffusion Models with Transformers	Paper, Github, ModelScope
3) SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers	Paper, Github, ModelScope
4) FiT: Flexible Vision Transformer for Diffusion Model	Paper, Github

Video Generation

论文	链接
1) Animatediff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning	Paper, Github, ModelScope
2) I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models	Paper, Github, ModelScope
4) Imagen Video: High Definition Video Generation with Diffusion Models	Paper
5) MoCoGAN: Decomposing Motion and Content for Video Generation	Paper
6) Adversarial Video Generation on Complex Datasets	Paper
7) Photorealistic Video Generation with Diffusion Models	Paper
8) VideoGPT: Video Generation using VQ-VAE and Transformers	Paper, Github
9) Video Diffusion Models	Paper, Github, Project
10) MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation	Paper, Github, Project, Blog

Long-context

论文	链接
1) World Model on Million-Length Video And Language With RingAttention	Paper, Github
2) Ring Attention with Blockwise Transformers for Near-Infinite Context	Paper, Github
3) Extending LLMs' Context Window with 100 Samples	Paper, Github
4) Efficient Streaming Language Models with Attention Sinks	Paper, Github
5) The What, Why, and How of Context Length Extension Techniques in Large Language Models – A Detailed Survey	Paper

Base Video Models

Paper	Links
1) ViViT: A Video Vision Transformer	Paper, Github

Mini Sora 微信社区社区交流群

现有高质量资料

社区贡献者

minisora's People

Contributors

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.