Giter VIP home page Giter VIP logo

awesome-mm-chat's Introduction

awesome-mm-chat

多模态 MM +Chat 合集

BERT 解读

具体见 BERT

GPT 解读

具体见 GPT

CLIP 解读

具体见 CLIP

BLIP 解读

具体见 BLIP 和 BLIP2 解读

LLaMA 解读

具体见 llama

DetGPT 解读

具体见 DetGPT

Visual Segmentation 解读

Visual Segmentation 指的是通用图像分割,包括开放集。

具体见 Visual Segmentation

Multi Dataset 解读

指的是在 CV 任务中特别是检测任务中常用的多数据集联合训练论文

具体见 Multi Dataset

LLM 解读

大语言模型相关论文

具体见 LLM

MLLM 解读

视觉多模态大语言模型相关论文

具体见 MLLM

mmpretrain 多模态部分

具体见 mmpretrain

Tools 解读

存放和 LLM tool 相关的内容,例如 visual chatgpt 等

具体见 Tools

HuggingFace Transformers 基础教程

本部分用于 CVer 们快速上手 HuggingFace Transformers

具体见 HuggingFace Transformers

LangChain

官方地址: https://github.com/hwchase17/langchain
文档: https://python.langchain.com/en/latest/

具体见 langchain

PEFT

Parameter-Efficient Fine-Tuning

官方地址: https://github.com/huggingface/peft

具体见 PEFT

Diffusers

具体见 Diffusers

CVPR2023 检测方向分析

内容已经发布到知乎,具体见: https://zhuanlan.zhihu.com/p/632210111

SAM 及其后续工作

具体见 SAM

DETR 系列代码理解

具体见 DETR

训练和推理技术

详情见 technology

awesome-mm-chat's People

Contributors

hhaandroid avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.