mcg-nju Goto Github PK
Name: Multimedia Computing Group, Nanjing University
Type: Organization
Location: Nanjing
Blog: mcg.nju.edu.cn
Name: Multimedia Computing Group, Nanjing University
Type: Organization
Location: Nanjing
Blog: mcg.nju.edu.cn
[CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
[TIP] APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Recognition
BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection
[ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation
[CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
[CVPR 2022 Oral & TPAMI 2023] Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion
[CVPR 2021] CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation
[IJCV 2021] Cross-Modal Pyramid Translation for RGB-D Scene Recognition
[AAAI 2023] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Context-aware RCNN: a Baseline for Action Detection in Videos
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
[ICCV 2023] Deep Equilibrium Object Detection
[IJCV 2023] Dual Graph Networks for Pose Estimation in Crowded Scenes
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
[CVIU] Fully Convolutional Online Tracking
[BMVC 2021] A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark
[ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing
[CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception
Code and pretrained models for LIP: Local Importance-based Pooling (ICCV 19)
[IJCV 2024] Logit Normalization for Long-Tail Object Detection
[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
[ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition
[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention
[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking
[ICCV2023] MixSort: The Customized Tracker in SportsMOT
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.