mbzuai-oryx / groundinglmm Goto Github PK
View Code? Open in Web Editor NEW[CVPR 2024 ๐ฅ] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Home Page: https://grounding-anything.com