Giter VIP home page Giter VIP logo

OpenGVLab's Projects

all-seeing icon all-seeing

[ICLR 2024] This is the official implementation of the paper "The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World"

ask-anything icon ask-anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

awesome-draggan icon awesome-draggan

Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN

awesome-llm4tool icon awesome-llm4tool

A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools

cafo icon cafo

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

chartast icon chartast

ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.

controlllm icon controlllm

ControlLLM: Augment Language Models with Tools by Searching on Graphs

dcnv4 icon dcnv4

[CVPR 2024] Deformable Convolution v4

ddps icon ddps

Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"

diffagent icon diffagent

[CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model

diffrate icon diffrate

[ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging techniques, while incorporating a differentiable compression rate.

draggan icon draggan

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

egoexolearn icon egoexolearn

Data and benchmark code for the EgoExoLearn dataset

gitm icon gitm

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory

gv-benchmark icon gv-benchmark

General Vision Benchmark, GV-B, a project from OpenGVLab

hulk icon hulk

An official implementation of "Hulk: A Universal Knowledge Translator for Human-Centric Tasks"

humanbench icon humanbench

This repo is official implementation of HumanBench (CVPR2023)

instruct2act icon instruct2act

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

interngpt icon interngpt

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

internimage icon internimage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

internvideo icon internvideo

Video Foundation Models & Data for Multimodal Understanding

internvl icon internvl

InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. [CVPR 2024 Oral]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.