Topic: video-understanding Goto Github
Some thing interesting about video-understanding
Some thing interesting about video-understanding
video-understanding,[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.
Organization: alibaba-mmai-research
Home Page: https://tadaconv-iclr2022.github.io
video-understanding,Video Contrastive Learning with Global Context, ICCVW 2021
Organization: amazon-science
video-understanding,[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
User: antoyang
Home Page: https://arxiv.org/abs/2206.08155
video-understanding,[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
User: antoyang
Home Page: https://arxiv.org/abs/2012.00451
video-understanding,[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
User: antoyang
video-understanding,[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
User: antoyang
Home Page: http://arxiv.org/abs/2309.13952
video-understanding,(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
User: boheumd
Home Page: https://boheumd.github.io/MA-LMM/
video-understanding,Temporal Segments LSTM and Temporal-Inception for Activity Recognition
User: chihyaoma
video-understanding,Paper list of activity prediction and related area
User: chinancheng
video-understanding,[CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)
User: cmhungsteve
Home Page: https://arxiv.org/abs/2003.02824
video-understanding,[ICCV 2021 Oral] Deep Evidential Action Recognition
User: cogito2012
video-understanding,Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018
User: fabienbaradel
video-understanding,Easily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.
User: ferreirafabio
video-understanding,[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
User: henghuiding
Home Page: https://henghuiding.github.io/MeViS/
video-understanding,Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral
Organization: hustvl
Home Page: https://arxiv.org/abs/2204.08412
video-understanding,A curated list of action recognition and related area resources
User: jinwchoi
video-understanding,Dataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.
User: junweiliang
Home Page: https://next.cs.cmu.edu/multiverse/
video-understanding,[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
Organization: mcg-nju
Home Page: https://arxiv.org/abs/2012.10071
video-understanding,[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Organization: mcg-nju
Home Page: https://arxiv.org/abs/2203.12602
video-understanding,[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Organization: mit-han-lab
Home Page: https://arxiv.org/abs/1811.08383
video-understanding,Tools for movie and video research
Organization: movienet
Home Page: http://movienet.github.io
video-understanding,STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)
Organization: nvlabs
video-understanding,An open-source toolbox for action understanding based on PyTorch
Organization: open-mmlab
Home Page: https://open-mmlab.github.io/
video-understanding,OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Organization: open-mmlab
Home Page: https://mmaction2.readthedocs.io
video-understanding,[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Organization: opengvlab
Home Page: https://vchat.opengvlab.com/
video-understanding,Video Foundation Models & Data for Multimodal Understanding
Organization: opengvlab
video-understanding,[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Organization: opengvlab
Home Page: https://arxiv.org/abs/2303.16727
video-understanding,Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.
Organization: paddlepaddle
video-understanding,[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Organization: pku-yuangroup
Home Page: https://arxiv.org/abs/2311.08046
video-understanding,deep learning sex position classifier
User: rlleshi
video-understanding,ActionVLAD for video action classification (CVPR 2017)
User: rohitgirdhar
Home Page: https://rohitgirdhar.github.io/ActionVLAD/
video-understanding,CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
User: rohitgirdhar
Home Page: https://rohitgirdhar.github.io/CATER/
video-understanding,A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Organization: showlab
video-understanding,awesome grounding: A curated list of research papers in visual grounding
User: theshadow29
video-understanding,TensorFlow code for finetuning I3D model on UCF101.
Organization: ustc-video-understanding
video-understanding,Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
User: v-iashin
Home Page: https://v-iashin.github.io/SpecVQGAN
video-understanding,The 2nd place Solution to the Youtube-8M Video Understanding Challenge by Team Monkeytyping (based on tensorflow)
User: wangheda
Home Page: https://arxiv.org/abs/1706.05150
video-understanding,【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
User: whwu95
Home Page: https://arxiv.org/abs/2301.00182
video-understanding,【CVPR'2023 Highlight】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
User: whwu95
Home Page: https://arxiv.org/abs/2301.00184
video-understanding,【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition
User: whwu95
Home Page: https://arxiv.org/abs/2012.06977
video-understanding,【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
User: whwu95
video-understanding,A curated list of “Temporally Language Grounding” and related area
User: wujie1010
video-understanding,PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
User: xyzforever
video-understanding,temporal action detection with SSN
User: yjxiong
video-understanding,Code & Models for Temporal Segment Networks (TSN) in ECCV 2016
User: yjxiong
video-understanding,Temporal Segment Networks (TSN) in PyTorch
User: yjxiong
video-understanding,A collection of recent video understanding datasets, under construction!
User: yoosan
video-understanding,Efficient 3D Backbone Network for Temporal Modeling
User: youngwanlee
Home Page: https://arxiv.org/abs/2012.00317
video-understanding,Useful Toolbox for Anomaly Detection
User: yuhaocheng
video-understanding,[Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance
User: zhang-can
Home Page: https://arxiv.org/abs/2008.03462
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.