surabhigovil / lipscribe Goto Github PK
View Code? Open in Web Editor NEW3D CNN based video classification android application. Transcribes lip movements of the speaker in a silent video to text. The neural network captures spatio temporal information from video required to generate words from video. MLOps using Vertex AI was used to deploy the model in a CI/CD fashion on android app