yousefkotp / visual-question-answering Goto Github PK
View Code? Open in Web Editor NEWA Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder