Giter VIP home page Giter VIP logo

Hi there πŸ‘‹

My name is

Zoumana Keita

  • ⚑ Previously I worked as Machine Learning Engineer at Lincoln for a couple of weeks before moving to the US for my Master in Business & Data Science at Texas Tech University, Rawls College of Business. Before that, I was Data Scientist for 2 years at Axionable, first Sustainable AI startup in France and Canada. Also I spent 2 years and 6 months at IBM as Machine Learning Consultant.
  • ❀️ I love Data Science, Natural Language Processing, Cloud Computing & MLOps
  • 🩺 What keeps me in shape
    • When I was in France, I had Taekwondo classes πŸ₯‹ on Tuesday, Thursday, Friday & Saturday at Mudo Club Argenteuil
    • Daily morning runner πŸƒπŸΎ
    • Occasional football player ⚽️ with friends
    • AttiΓ©kΓ©, Yassa, MafΓ©, Thieb, etc. πŸ˜‹
  • 🌱 I’m addicted to continuous learning, which makes me grow on a regular basis
  • 🌏 I'm sharing my knowledge through my blog in order to make good impact on others life
  • πŸ“« How to find me

πŸ† My Github Stats:

Zoumana's GitHub stats GitHub Views

πŸ… My Most Used Languages:

Zoumana's Top Languages


Data Science, Machine Learning & MLOPs Resources.

This is the collection of all the resources I have created, organized by topics.

Subscribe to:

Content

  1. Data Science
  2. Machine Learning
  3. MLOps
  4. Natural Language Processing
  5. Large Language Models
  6. Python
  7. Pandas & Python Tricks
  8. Computer Vision

Data Science

Title Article Link Video
A simple way to understand Association Rule from the Customer Basket Analysis Use Case πŸ”—
Different Metrics to Evaluate Binary Classification Models and Some Strategies to Choose the Right One πŸ”—
Introduction to Mito: Spreadsheet for Data Scientists That Also Generates Python Codes πŸ”—
When R Meets SQL to Query Dataframes πŸ”—
5 Essential Tools to Start a Career in Data Science and Data Analytics πŸ”—
4 Types of SQL JOIN Every Data Scientist Should Know: Visual Representation πŸ”—
Data Preprocessing Using Pipeline in Pandas πŸ”— πŸ”—
The guide to choosing the right database for my project: MongoDB vs. MySQL πŸ”—
How to Run SQL Queries On Your Pandas DataFrames With Python πŸ”— πŸ”—
Algorithmic Bias in Healthcare and Some Strategies for Mitigating It πŸ”—
Which One of These 2 Open-Source Libraries Is Better for Processing Gigabytes of Data? πŸ”— πŸ”—
ChatGPT for Data Scientists, Data Analysts, and Programmers πŸ”— πŸ”—
Tableau Data Blending Tutorialβ€Šβ€”β€ŠA Step-By-Step Guide For Beginners πŸ”—
Fundamentals of Statistics All Data Scientists & Analysts Should Knowβ€Šβ€”β€ŠWith Codeβ€Šβ€”β€ŠPart 1 πŸ”— πŸ”—
Everything You Need to Know About Heatmap β€” Tutorial With PowerBI πŸ”—
Top Techniques to Handle Missing Values Every Data Scientist Should Know πŸ”—
An Introduction to Hierarchical Clustering in Python πŸ”—
Multiple Linear Regression in R: Tutorial With Examples πŸ”—
NoSQL Databases: What Every Data Scientist Needs to Know πŸ”—

Machine Learning

Title Article Link Video
Transfer Learning: Understand the Big Picture & Make the Right Choices for Your Use Case πŸ”—
Overview Of 4 Model Validation Approaches to Mitigate Overfitting Problem πŸ”—
eXplainable AI (XAI): LIME & SHAP, Two Great Candidates to Help You Explain Your Machine Learning Models πŸ”—
Using Gradio To Create Apps For Your Machine Learning Models πŸ”— πŸ”—
How to Perform KMeans Clustering Using Python πŸ”— πŸ”—
Classification in Machine Learning: An Introduction πŸ”—

MLOps

Title Article Link Video
Create An Awesome Streamlit App & Deploy it With Docker πŸ”—
Machine Learning models monitoring made easy with Mlfow, a concrete use case with Python API πŸ”—
When Your Machine Learning model teams up with Django REST API, A successful deployment into production πŸ”—
NLP MLops Project With DagsHub β€” Multi-Language Sentiment Classification Using Transformers β€” Part 1 πŸ”—
NLP MLops Project With DagsHub β€” Deploy Your Streamlit App On AWS EC2 Instance β€” Part 2 πŸ”—
Step-by-step Approach to Build Your Machine Learning API Using Fast API πŸ”—
Data And Model Versioning With DVC And Azure Blob Storage πŸ”—
GitHub Actions for Machine Learning: Train, Test and Deploy Your ML Model on AWS EC2. πŸ”—
CI/CD for Machine Learning Model Training with GitHub Actions πŸ”—
Speed Up Your Model Training with DagsHub Direct Data Access on AWS πŸ”—
Git Reset and Revert Tutorial for Beginners πŸ”—

Natural Language Processing

Title Article Link Video
Do You Want To Cluster Unlabeled Text Data? Try Out Topic Modeling πŸ”—
Financial Text Classification With Deep Learning Using FinBERT πŸ”—
Named Entity Recognition with Spacy and the Mighty roBERTa πŸ”— πŸ”—
Scientific Documents Similarity Search With Deep Learning Using Transformers (SciBERT) πŸ”—
Meet BERTopicβ€” BERT’s Cousin For Advanced Topic Modeling πŸ”— πŸ”—
Unsupervised Multilingual Text Classification With Zero-Shot Approach πŸ”—
Semantic Keywords And Keyphrases Extraction With KeyBERT πŸ”—
4 NLP Libraries for Automatic Language Identification of Text Data In Python πŸ”—
Data Augmentation in NLP Using Back Translation With MarianMT πŸ”— πŸ”—
Social Media Sentiment Analysis In Python With VADER β€” No Training Required! πŸ”— πŸ”—
Stemming, Lemmatizationβ€” Which One is Worth Going For? πŸ”—
VADER Vs. TextBlob β€” Which One Is Better For Social Media Sentiment Analysis? πŸ”—
Most Common Text Processing Tasks In Natural Language Processing πŸ”— πŸ”—
How to Perform Speech-to-Text and Translate Any Speech to English With OpenAI’s Whisper πŸ”— πŸ”—
Plagiarism Detection Using Transformers πŸ”— πŸ”—
Text-to-Image and Image-to-image search Using CLIP πŸ”—
A Step-by-step Guide to Solving 4 Real-life Problems With Transformers and Hugging Face πŸ”— πŸ”—
Text data representation with one-hot encoding, Tf-Idf, Count Vectors, Co-occurrence Vectors and Word2Vec πŸ”—
Fine-Tuning GPT-3 Using the OpenAI API and Python πŸ”—

Large Language Models

Title Article Link Video
How I Built A Video Recommendation System Using Large Language Models and Vector Database πŸ”—
A Framework For Efficiently Serving Your Large Language Models πŸ”— πŸ”—
How To Scrape a Web Page With ChatGPT β€” No Coding Required! πŸ”— πŸ”—
How to Chat With Any PDFs and Image Files Using Large Language Models β€” With Code πŸ”— πŸ”—

Python

Title Article Link Video
5 Python open-source tools to extract text and tabular data from PDF Files πŸ”—
When Should You Consider Using Datatable Instead of Pandas to Process Large Data? πŸ”—
Convert Any Type of Document to Text With Apache Tika Using Python API πŸ”—
Collect Data From Reddit and Twitterβ€” 600+ Million Monthly Active Users Platforms πŸ”—
Knockknock β€” Probably The Best Python Library For Notifications πŸ”—
Extract Text Written in Different Languages from Images with Python πŸ”—
Introduction to Twint: Say Goodbye to Twitter Rate Limitations β€” Also No Need for A Twitter API! πŸ”—
Avoid Using β€œpip freeze” β€” Use β€œpipreqs” instead πŸ”—
Extract Tweets Without Limitations in a Few Lines of Code Using Python πŸ”— πŸ”—
Collect Data from Twitter: A Step-by-Step Implementation Using Tweepy πŸ”—
How to Create a Virtual Environment and Use it on Jupyter Notebook πŸ”— πŸ”—

Pandas & Python Tricks

Title Article Link Video
Pandas and Python Tips and Tricks for Data Science and Data Analysis πŸ”— πŸ”—
Pandas & Python Tricks for Data Science & Data Analysis β€” Part 2 πŸ”— πŸ”—

Computer Vision

Title Article Link Video
Five Simple Image Data Augmentation Techniques to Mitigate Overfitting In Computer Vision πŸ”—
YOLO Object Detection Explained πŸ”—
How to Measure Model Performance in Computer Vision: A Comprehensive Guide πŸ”—

Zoumana Keita's Projects

applied-ml icon applied-ml

πŸ“š Papers & tech blogs by companies sharing their work on data science & machine learning in production.

basics-for-pandas icon basics-for-pandas

This notebook teaches the basics regarding the pandas library for data science

bert-as-service icon bert-as-service

Mapping a variable-length sentence to a fixed-length vector using BERT model

best-of-ml-python icon best-of-ml-python

πŸ† A ranked list of awesome machine learning Python libraries. Updated weekly.

chitchat icon chitchat

πŸ€–πŸ’¬πŸ“’πŸ€– chitchat is a question answering in context (QuAC) tool powered by GPT3.5

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.