Giter VIP home page Giter VIP logo

Samarth Mishra's Projects

action-recognition-pytorch icon action-recognition-pytorch

This is the pytorch implementation of some representative action recognition approaches including I3D, S3D, TSN and TAM.

btp icon btp

Dictionary Learning on Images

elite icon elite

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)

flownet2-pytorch icon flownet2-pytorch

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

kubric icon kubric

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

minigpt-4 icon minigpt-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

open_flamingo icon open_flamingo

An open-source framework for training large multimodal models.

pan icon pan

For the code of ICCV-21 paper "Effectively Leveraging Attributes for Visual Similarity"

pot icon pot

POT : Python Optimal Transport

pwc-net icon pwc-net

PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume, CVPR 2018 (Oral)

pytorch-adain icon pytorch-adain

Unofficial pytorch implementation of 'Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization' [Huang+, ICCV2017]

pytorch-image-models icon pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

submitit icon submitit

Python 3.6+ toolbox for submitting jobs to Slurm

sugar-crepe icon sugar-crepe

[NeurIPS 2023] A faithful benchmark for vision-language compositionality

swinir icon swinir

SwinIR: Image Restoration Using Swin Transformer (official repository)

tdw icon tdw

ThreeDWorld simulation environment

timesformer icon timesformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.