Giter VIP home page Giter VIP logo

Xinyu Huang's Projects

actionclip icon actionclip

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

albef icon albef

Code for ALBEF: a new vision-language pre-training method

blip icon blip

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

clip icon clip

Contrastive Language-Image Pretraining

daily_fudan icon daily_fudan

äø€é”®å¹³å®‰å¤ę—¦å°č„šęœ¬ļ¼Œč‡ŖåŠØ化åæ«é€ŸäøŠęŠ„ē–«ęƒ…

grounded-segment-anything icon grounded-segment-anything

Marrying Grounding DINO with Segment Anything & Tag2Text & Stable Diffusion & BLIP & Whisper - Automatically Recognize, Detect, Segment and Generate Anything with Image, Text, and Speech Inputs

groundingdino icon groundingdino

The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

idea-pytorch icon idea-pytorch

Code for paper: IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training [ACM MM2022]

img2dataset icon img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

minigpt-4 icon minigpt-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

moco icon moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

query2labels icon query2labels

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

robust-loss-mlml icon robust-loss-mlml

Code for paper: Simple and Robust Loss Design for Multi-Label Learning with Missing Labels

ssl-small icon ssl-small

Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".

transformers icon transformers

šŸ¤— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    šŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. šŸ“ŠšŸ“ˆšŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ā¤ļø Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.