DoSung Lee's Projects
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
Accessible large language models via k-bit quantization for PyTorch.
Official code for "A Closer Look at Audio-Visual Segmentation"
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
COSE474: Deep Learning @ Korea University
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
Denoising Diffusion Probabilistic Models
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
create your rotating proxy server with docker. self hosted rotating proxy service.
GPT-base chat app
Code and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.
[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks" (NeurIPS 2023).
Config files for my GitHub profile.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
OpenMMLab Detection Toolbox and Benchmark
MUSIC Dataset from The Sound of Pixels (ECCV '18)
Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022
NeMo: a framework for generative AI