Name: Ali Vosoughi
Type: User
Company: University of Rochester
Bio: Ph.D. of the Elec. & Comp. Eng.
University of Rochester
Rochester, New York, USA
Twitter: ali_vosough
Location: Rochester, New York
Blog: https://alivosoughi.com/
Ali Vosoughi's Projects
A list of papers about audio captioning
A self-supervised learning framework for audio-visual speech
ICCV 2023 AV4D paper - Audio-visual Sound Separation
Codes for ICASSP 2022 paper: Relation discovery in nonlinearly related large-scale settings
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
Co-Separating Sounds of Visual Objects (ICCV 2019)
🔥🔥🔥 ICASSP 2024: Learning Audio Concepts from Counterfactual Natural Language
A faster pytorch implementation of faster r-cnn
Graph Attention Networks (https://arxiv.org/abs/1710.10903)
GIT: A Generative Image-to-text Transformer for Vision and Language
GLoRIA: A Multimodal Global-Local Representation Learning Framework forLabel-efficient Medical Image Recognition
Inference of relations between nodal time-series observations.
ICCV 2023 paper - task-based dialog system
🔥🔥🔥 Repository for our IEEE Transactions on Multimedia paper
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
Codebase for ECCV18 "The Sound of Pixels"
Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS 2021