Hung Q. Vo's Projects
a hybrid information extraction system for unstructured mammography reports. • Generates information frames using dependency parsing and distributed semantics. • Information frames comprise entities and relations that capture imaging observations. • This system obtains a F1-score of 0.94 in extracting complete lesion information. • Outperforms
Computer aided detection using Faster-RCNN
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
starter from "How to Train a GAN?" at NIPS2016
Google Research
ICCV 2019, Hand Detection
A curated list of ML|NLP resources for healthcare.
HI-ML toolbox for deep learning for medical imaging and Azure integration
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.
PyTorch code to run synthetic experiments.
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
scikit-learn cross validators for iterative stratification of multilabel data
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
LibFewShot: A Comprehensive Library for Few-shot Learning.
Implementation of Linformer for Pytorch
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
(NeurIPS 2022 CellSeg Challenge - 1st Winner) Open source code for "MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality Microscopy"
A list of Medical imaging datasets.
Library for clinical NLP with spaCy.
Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
OpenMMLab Detection Toolbox and Benchmark