Giter VIP home page Giter VIP logo

kennethleungty / keyword-analysis-with-keybert-and-taipy Goto Github PK

View Code? Open in Web Editor NEW
12.0 2.0 3.0 338 KB

Keyword Extraction and Analysis Pipeline & Application with KeyBERT and Taipy

Home Page: https://towardsdatascience.com/arxiv-keyword-extraction-and-analysis-pipeline-with-keybert-and-taipy-2972e81d9fa4

Jupyter Notebook 41.22% Python 57.89% Dockerfile 0.89%
data-science keybert keyword keyword-analysis keyword-extraction machine-learning natural-language-processing nlp taipy taipy-core

keyword-analysis-with-keybert-and-taipy's Introduction

๐Ÿ‘‹ Hello, I'm Kenneth Leung

  • Thanks for popping by! As an avid learner, bold builder, curious explorer, and driven doer with a bias towards action, I enjoy seeking and solving meaningful problems with data and technology while having fun at the same time.
  • I welcome you to join me on a journey of data science discovery! Follow me on GitHub, Medium, and LinkedIn to stay updated with more engaging and practical content.
  • You can find my data science portfolio here, where every project and article was born out of inspiration, curiosity, and motivation. Feel free to connect for a chat (coffee or virtual) to discuss shared interests and topics!

Project Count

How to reach me

ย  ย  Buy Me A Coffeeย 

Portfolio Contents

  1. Computer Vision
  2. Database Management
  3. Data Extraction and Web Scraping
  4. Data Science Certification Guides
  5. Data Science Toolkit
  6. Data Science in the Real World
  7. Generative AI
  8. Insights from Data Science Talks
  9. Machine Learning
  10. MLOps
  11. Natural Language Processing
  12. Networks and Graphs
  13. Responsible AI
  14. Sports Analytics
  15. Visualization
  16. Web Development
  17. Web3 and Metaverse
  18. Writing for DataCamp
  19. Writing Tips

Projects with โญ are my personal favourites, so do check them out!


Computer Vision ๐Ÿ‘๏ธ

Title Article Repo
Classifying Images of Alcoholic Beverages with fast.ai v2 ๐Ÿ”— ๐Ÿ”—
Russian Car Plate Detection with OpenCV and TesseractOCR ๐Ÿ”— ๐Ÿ”—
Evaluate OCR Output Quality with Character Error Rate (CER) and Word Error Rate (WER) ๐Ÿ”— ๐Ÿ”—
Top Python libraries for Image Augmentation in Computerย Vision ๐Ÿ”— ๐Ÿ”—
โญ PyTorch Ignite Tutorial - Classifying Tiny ImageNet with EfficientNet ๐Ÿ”— ๐Ÿ”—
Practical Guide to Transfer Learning in TensorFlow for Multiclass Image Classification ๐Ÿ”— ๐Ÿ”—

Database Management ๐Ÿ—„๏ธ

Title Article Repo
โญ Definitive Guide to Creating a SQL Database on Cloud with AWS and Python ๐Ÿ”— ๐Ÿ”—
PyMySQLโ€Š-โ€ŠConnecting Python andย SQL for Data Science ๐Ÿ”— ๐Ÿ”—

Data Extraction and Web Scraping ๐Ÿงฐ

Title Article Repo
Using OneMap API to extract Singapore postal codes, coordinates and travel distance - ๐Ÿ”—
A Detailed Web Scraping Walkthrough Using Python and Selenium ๐Ÿ”— ๐Ÿ”—
โญ How to Web Scrape Wikipedia using LangChain Agents and Tools with OpenAI's LLMs and Functionย Calling ๐Ÿ”— ๐Ÿ”—

Data Science Certification Guides ๐Ÿ‘จโ€๐ŸŽ“

Title Article Repo
3 Steps to Get AWS Cloud Practitioner Certified in 2 Weeks ๐Ÿ”— ๐Ÿ”—
3 Steps to Get Tableau Desktop Certified in 2 Weeks ๐Ÿ”— -
โญ No-Frills Guide to Passing the AWS Certified Machine Learning Specialty Exam ๐Ÿ”— -

Data Science Toolkit ๐Ÿ› ๏ธ

Title Article Repo
Common Python codes for Data Wrangling - ๐Ÿ”—
Enhance your Python codeโ€™s readability with pycodestyle ๐Ÿ”— -
Free Resources for Generating Realistic Fake Data ๐Ÿ”— -
Most Starred and Forked GitHub Repos for Data Science and Python ๐Ÿ”— -
Most Starred and Forked GitHub Repos for Data Science and R ๐Ÿ”— -
Automatically Generate Machine Learning Code with Just a Few Clicks ๐Ÿ”— -
Read and Modify Image Metadata withย Python ๐Ÿ”— ๐Ÿ”—
Top Tips to Google Search Like a Seasoned Data Scientist ๐Ÿ”— -
How to Swap Day and Month of Incorrectly Formatted Excel Dates ๐Ÿ”— -

Data Science in the Real World ๐ŸŒ

Title Article Repo
Exploring Illegal Drugs in Singapore โ€” A Data Perspective ๐Ÿ”— ๐Ÿ”—
Pharmacokinetic Modeling of Drug Concentration Trajectories using Ordinary Differential Equations (ODE) and Global Optimization with Differential Evolution - ๐Ÿ”—
Healthcareโ€™s AI Future โ€” In Conversation with Andrew Ng and Fei-Fei Li ๐Ÿ”— -
Real-World Data Science Use Cases in the Insurance Industry ๐Ÿ”— -
โญ Failed-ML: Compilation of high-profile real-world examples of failed machine learning projects ๐Ÿ”— ๐Ÿ”—

Generative AI ๐Ÿค–

Title Article Repo
Generative AI Pharmacist - Macy ๐Ÿ”— ๐Ÿ”—
โญ ChatPod - Q&A over your Podcasts with Whisper, FAISS, and LangChain ๐Ÿ”— ๐Ÿ”—
โญ Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Documentย Q&A ๐Ÿ”— ๐Ÿ”—
Domain LLMs - Compilation of Customized LLMs for Specific Domains and Industries - ๐Ÿ”—
โญ Text-to-Audio Generation with Bark, Clearly Explained ๐Ÿ”— ๐Ÿ”—
Guide to ChatGPT's Advanced Settings โ€” Top P, Frequency Penalties, Temperature, and More ๐Ÿ”— -
Inside the Leaked System Prompts of GPT-4, Gemini 1.5, Claude 3, andย More ๐Ÿ”— -

Insights from Data Science Talks ๐Ÿ‘จโ€๐Ÿซ

Title Article Repo
Bridging AIโ€™s Proof-of-Concept to Production Gap โ€” Insights from Andrew Ng ๐Ÿ”— -

Machine Learning ๐ŸŽฐ

Title Article Repo
Exploring Condominium Rental Prices with Web Scraping and Exploratory Data Analysis ๐Ÿ”— ๐Ÿ”—
Using Ensemble Regressors to Predict Condominium Rental Prices ๐Ÿ”— ๐Ÿ”—
The Dying ReLU Problem, Clearly Explained ๐Ÿ”— -
Why Bootstrapping Actually Works ๐Ÿ”— -
โญ Assumptions of Logistic Regression, Clearly Explained ๐Ÿ”— ๐Ÿ”—
Data-Centric AI Competition - Tips and Tricks of a Top 5% Finish ๐Ÿ”— ๐Ÿ”—
Credit Card Fraud Detection with AutoXGB ๐Ÿ”— ๐Ÿ”—
โญ Micro, Macro & Weighted Averages of F1 Score, Clearly Explained ๐Ÿ”— -
Principal Component Regression - Clearly Explained and Implemented ๐Ÿ”— ๐Ÿ”—
โญ Feature Selection with Simulated Annealing in Python, Clearly Explained ๐Ÿ”— ๐Ÿ”—
Quick Primer on Types of Missing Data and Imputation Techniques ๐Ÿ”— -
Imputation of Missing Data in Tables withย DataWig ๐Ÿ”— ๐Ÿ”—

MLOps - Machine Learning Operations ๐Ÿ‘จโ€๐Ÿ”ง

Title Article Repo
Key Learning Points from MLOps Specializationโ€Šโ€”โ€ŠCourse 1/4 ๐Ÿ”— ๐Ÿ”—
Key Learning Points from MLOps Specializationโ€Šโ€”โ€ŠCourse 2/4 ๐Ÿ”— ๐Ÿ”—
Key Learning Points from MLOps Specializationโ€Šโ€”โ€ŠCourse 3/4 ๐Ÿ”— ๐Ÿ”—
Key Learning Points from MLOps Specializationโ€Šโ€”โ€ŠCourse 4/4 ๐Ÿ”— ๐Ÿ”—
โญ End-to-End AutoML Pipeline with H2O AutoML, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell ๐Ÿ”— ๐Ÿ”—
โญ How to Dockerize Machine Learning Applications Built with H2O, MLflow, FastAPI, and Streamlit ๐Ÿ”— ๐Ÿ”—
โญ Building and Managing an Isolation Forest Anomaly Detection Pipeline with Kedro ๐Ÿ”— ๐Ÿ”—

Natural Language Processing ๐Ÿ“‘

Title Article Repo
COVID-19 Vaccine โ€” Whatโ€™s the Public Sentiment? ๐Ÿ”— ๐Ÿ”—
Keyword Extraction and Analysis Pipeline with KeyBERT and Taipy ๐Ÿ”— ๐Ÿ”—

Networks and Graphs ๐ŸŒ

Title Article Repo
โญ Network Analysis and Visualization of Drug-Drug Interactions ๐Ÿ”— ๐Ÿ”—
How to Deploy Interactive Pyvis Network Graphs on Streamlit ๐Ÿ”— ๐Ÿ”—
A No-Code Approach to Building Knowledge Graphs ๐Ÿ”— ๐Ÿ”—

Responsible AI ๐Ÿ‘ฎ

Title Article Repo
Responsible AI Masterclass (for Institute of Banking and Finance Singapore) ๐Ÿ”— ๐Ÿ”—

Sports Analytics โšฝ

Title Article Repo
โญ Analyzing English Premier League VAR Football Decisions ๐Ÿ”— ๐Ÿ”—
Combining Python and R for FIFA Football World Ranking Analysis ๐Ÿ”— ๐Ÿ”—

Visualization ๐Ÿ“ˆ

Title Article Repo
Uniform Singapore Energy Price and Demand Forecast Dashboard (with Plotly Dash) - ๐Ÿ”—
Visualizing Fortune 500 Companies in a Bar Chart Race ๐Ÿ”— ๐Ÿ”—
How to Easily Draw Neural Network Architecture Diagrams ๐Ÿ”— ๐Ÿ”—

Web Development ๐Ÿ–ฅ๏ธ

Title Article Repo
โญ Post COVID-19 Vaccination Wait-Time Tracker (with Python Flask) ๐Ÿ”— ๐Ÿ”—
From HTTP to HTTPS โ€” Easily Secure Flask Web Apps With Talisman ๐Ÿ”— -
โญ Food King Directory (in collaboration with Night Owl Cinematics) ๐Ÿ”— ๐Ÿ”—

Web3 and Metaverse ๐Ÿ‘จโ€๐Ÿ’ป

Title Article Repo
The Web3 / Metaverse Glossary โ€” A Keyword Guide to the Tech Future ๐Ÿ”— -

Writing for DataCamp โœ๏ธ

Title Article Repo
โญ What Mature Data Infrastructure Looks Like ๐Ÿ”— -
Democratizing Data in Government Agencies ๐Ÿ”— -
A Survey Into Data Governance Tools ๐Ÿ”— -
Scaling Data Science With Data Governance ๐Ÿ”— -
3 Reasons Why All Teams Should Learn SQL ๐Ÿ”— -
3 Reasons Why All Teams Should Learn R ๐Ÿ”— -
How Tableau Helps Your Organization Achieve Greater Data Insights ๐Ÿ”— -
How PowerBI Helps Your Organization Achieve Greater Data Insights ๐Ÿ”— -

Writing Tips ๐Ÿ“œ

Title Article Repo
Create a Clickable Table of Contents for Your Medium Posts ๐Ÿ”— -

keyword-analysis-with-keybert-and-taipy's People

Contributors

florianjacta avatar kennethleungty avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.