Name: KG
Type: User
Company: Passionate about delivering data-driven business outcomes and making the world run with evidences that deliver results.
Bio: Data Scientist | MSc|MBA Health System Specialist| Machine Learning | Data Analytics | Statistical Analysis| Python, SQL, R, Tableau POWERBI, STATA, SAS
KG's Projects
Analysis of the Spotify Top 200 Daily Including Clustering Listening Behavior by Country
Official Microsoft GitHub Repository containing code samples for SQL Server
Source code for 'SQL Server T-SQL Recipes' by David Dye, Jason Brimhall, Timothy Roberts, Wayne Sheffield, Joseph Sack, and Jonathan Gennick
Official Stanford NLP Python Library for Many Human Languages
Main repository for STAT 545 @ University of British Columbia, a course in data wrangling, exploration, and analysis with R.
Lecture Slides and R Sessions for Trevor Hastie and Rob Tibshinari's "Statistical Learning" Stanford course
R & stats illustrations by @allison_horst
Python modules and IPython Notebooks, for the book "Introduction to Statistics With Python"
Streamlit — The fastest way to build custom ML tools
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Execute Python code on the fly and display results in Tableau visualizations
Where to find the schedule, slides and videos for 18F's monthly tech talks
telluric is a Python library to manage vector and raster geospatial data in an interactive and easy way
Code for Tensorflow Machine Learning Cookbook
Text summarization algorithm for the Capstone Project at Springboard code bootcamp
Essentials of Geographic Information Systems
The textbook Computational and Inferential Thinking: The Foundations of Data Science
A website displaying hundreds of charts made with Python
The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.
Text and supporting code for Think Stats, 2nd Edition
Parallel Processing in R using a Thread Pool
Official repo for the #tidytuesday project
Easily install and load packages from the tidyverse
A Shiny app that let's us understand different methods for removing outliers in time series data
Web scraping, data cleaning, dimensionality reduction, clustering project using geospatial and Foursquare API data to segment different neighborhoods of Toronto based on venue type
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
Exploring the patterns created by reaction-diffusion equations
UBI analyses using Tax-Calculator, TaxData, and C-TAM
Projects done for Master in Data Science - UDD