Prithivida's Projects
Airport Taxi Demand Forecast ML problem
Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models to do ZSC. Hence, can be lightweight + supports more languages without trading-off accuracy. (Super simple, a 10th-grader could totally write this but since no 10th-grader did, I did) - Prithivi Da
Amazon Textract Code Samples
Companion Repo for the book The Applied ML Field Manual, Prithiviraj Damodaran
Asynchronous Http and WebSocket Client library for Java
CEP for Banking systems powered by Apache Flink
Keras Layer implementation of Attention
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Extension of bias-bench for select Sentence Transformers* models
C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcome.
This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the dataset are described in more detail by Stahlberg and Kumar (2021) (https://www.aclweb.org/anthology/2021.bea-1.4/)
Creative Commons copyright license files
EMNLP 2021 - Pre-training architectures for dense retrieval
Controlled Caption Generation for Images
Prithivi's DataScience Portfolio
This is a demo of a dataframe with editable cells, powered by `streamlit-aggrid` from Pablo Fonseca. You can edit the cells by clicking on them and then export your selection to a csv file! 🎈
Face Transformer for Recognition
Project description in https://gombru.github.io/
Apache Storm ( Trident ), Redis, Node.js, Socket.IO based Real Time Dashboards
Lightweight Python library to add low-footprint (all-MiniLM-* equivalent) multilingual retrievers to your RAG and Search & Retrieval pipelines.
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
This is a github page for site howtostream.org
Image CEntric CAption Generation Evaluation
Indeed Salary Prediction
Pytorch0.4.1 codes for InsightFace
The PyTorch implementation the Smooth Grad [https://arxiv.org/pdf/1706.03825.pdf] and Integrated Gradients [https://arxiv.org/pdf/1703.01365.pdf] for NLP Models. Fixed for latest HF changes by Prithivi Da