Narasimman's Projects
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Lab files for AI-102 - AI Engineer
Create own chatbot dialog flow project
🧑🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
✨Argilla: the open-source data curation platform for LLMs
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Design Pattern Examples in Python
RVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the literature. There are 400,000 total document images in the dataset. The dataset contains much noise and variance in composition of each document class. Uncompressed, the dataset size is ~100GB, and comprises 16 classes of document types, with 25,000 samples per classes. Example classes include email, resume, and invoice. Achieved an Accuracy of over 93% which beat the benchmark score of 92% based on https://paperswithcode.com/sota/document-image-classification-on-rvl-cdip
User cane define the flow.
Pure implementation of ELM (Extreme Learning Machine) in python (just with numpy)
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Fast Segment Anything
Topic Modelling for Humans
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Build animated charts in Jupyter notebook with a simple Python synthax.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
LLM Finetuning with peft
Repository that contains LLM fine-tuning and deployment scripts along with our research findings.