Giter VIP home page Giter VIP logo

minisora's Introduction

Mini Sora 社区

 

Mini Sora 开源社区定位为由社区同学自发组织的开源社区(免费不收取任何费用、不割韭菜),Mini Sora 计划探索实现 Sora 的实现路径可发展可能:

  • 将定期举办 Sora 的圆桌和社区一起探讨可能性
  • 视频生成的现有技术路径探讨

论文共读计划

相关工作

Diffusion Model

论文 链接
1) Guided-Diffusion: Diffusion Models Beat GANs on Image Synthesis Paper, Github
2) Latent Diffusion: High-Resolution Image Synthesis with Latent Diffusion Models Paper, Github
3) EDM: Elucidating the Design Space of Diffusion-Based Generative Models Paper, Github
4) DDPM: Denoising Diffusion Probabilistic Models Paper, Github
5) DDIM: Denoising Diffusion Implicit Models Paper, Github
6) Score-Based Diffusion: Score-Based Generative Modeling through Stochastic Differential Equations Paper, Github

Diffusion Transformer

论文 链接
1) UViT: All are Worth Words: A ViT Backbone for Diffusion Models Paper, Github, ModelScope
2) DiT: Scalable Diffusion Models with Transformers Paper, Github, ModelScope
3) SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers Paper, Github, ModelScope
4) FiT: Flexible Vision Transformer for Diffusion Model Paper, Github

Video Generation

论文 链接
1) Animatediff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Paper, Github, ModelScope
2) I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models Paper, Github, ModelScope
4) Imagen Video: High Definition Video Generation with Diffusion Models Paper
5) MoCoGAN: Decomposing Motion and Content for Video Generation Paper
6) Adversarial Video Generation on Complex Datasets Paper
7) Photorealistic Video Generation with Diffusion Models Paper
8) VideoGPT: Video Generation using VQ-VAE and Transformers Paper, Github
9) Video Diffusion Models Paper, Github, Project
10) MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation Paper, Github, Project, Blog

Long-context

论文 链接
1) World Model on Million-Length Video And Language With RingAttention Paper, Github
2) Ring Attention with Blockwise Transformers for Near-Infinite Context Paper, Github
3) Extending LLMs' Context Window with 100 Samples Paper, Github
4) Efficient Streaming Language Models with Attention Sinks Paper, Github
5) The What, Why, and How of Context Length Extension Techniques in Large Language Models – A Detailed Survey Paper

Base Video Models

Paper Links
1) ViViT: A Video Vision Transformer Paper, Github

Mini Sora 微信社区社区交流群

 

现有高质量资料

社区贡献者

minisora's People

Contributors

vansin avatar ming-zch avatar fanqino1 avatar chg0901 avatar jimmyma99 avatar lum1104 avatar matrixgame2018 avatar nobody-ml avatar pommespeter avatar drryanhuang avatar tackhwa avatar wenmengzhou avatar junyaohu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.