Piyush Bagad's Projects
Tests for codec artefacts in stored audio samples.
Audio Visual Instance Discrimination with Cross-Modal Agreement
Scalable Bayesian Optimization : Comparison of various methods
Test repository for Blender python test examples
:couple: Social Sign-In Buttons for Bootstrap
My personal introductory repository
A portfolio page
Evaluation measures for the EPIC-KITCHENS-100 Action Detection challenge
Evaluating CLIP's cross-modal grounding using explainability methods.
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Simple image captioning model
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
Exploration of effect of correlations among features on GAN generation ability
WA Bot for COVID-19 Resources help
Code for lab assignments for Computer Vision 1 (UvA)
Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)
Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks (ICLR 2022).
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Fast Segment Anything
Fully-Convolutional Network for Pitch Estimation of Speech Signals
Basic tutorial (template) for using hydra configs for ML projects
Video Foundation Models & Data for Multimodal Understanding
A modern, high customizable, responsive Jekyll theme for documention with built-in search.
γICLR 2024π₯γ Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
LAVIS - A One-stop Library for Language-Vision Intelligence
Course files for CS771A - Introduction to Machine Learning