Haoran Duan's Projects
A 2D Gaussian Splatting paper for no obvious reasons. Enjoy!
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
Unsupervised Representation Learning for Point Clouds: A Survey
3DILG: Irregular Latent Grids for 3D Generative Modeling
[ICCV 2023] Understanding 3D Object Interaction from a Single Image
Text-to-3D Generation within 5 Minutes
Soon, you will realize that you already know things that you thought you didn’t
Adala: Autonomous DAta (Labeling) Agent framework
Auto detecting, masking and inpainting with detection model.
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
🤖 AgentVerse 🪐 provides a flexible framework that simplifies the process of building custom multi-agent environments for large language models (LLMs).
Aim 💫 — An easy-to-use & supercharged open-source AI metadata tracker (experiment tracking, AI agents tracing)
ALiPy: Active Learning in Python is an active learning python toolbox, which allows users to conveniently evaluate, compare and analyze the performance of active learning methods.
[ICLR 2024] This is the official implementation of the paper "The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World"
🧑🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
Segment-Anything + 3D. Let's lift the anything to 3D.
Generate image from anything with ImageBind and Stable Diffusion
Implementation for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
[CVPR 2024] HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)