Giter VIP home page Giter VIP logo

wyattowalsh / data-science-notes Goto Github PK

View Code? Open in Web Editor NEW
101.0 8.0 64.0 187.42 MB

Open-source project hosted at https://makeuseofdata.com to crowdsource a robust collection of notes related to data science (math, visualization, modeling, etc)

Home Page: https://makeuseofdata.com

License: GNU General Public License v3.0

JavaScript 99.46% CSS 0.54% HTML 0.01% Shell 0.01%
data-science probability calculus statistics linear-algebra simulation modeling visualization regression classification

data-science-notes's Introduction

Header

WakaTime Stats in README GitHub metrics update awesome-stars Generate starred projects wordclouds Generate QR code contact card


I grew up in the Eastern Sierras in California, spent a few years in Connecticut at boarding school (Go Bearcats!), then came back to the West Coast for my undergraduate work in Industrial Engineering and Operations Research at the University of California, Berkeley (Go Bears! ๐Ÿป).

I usually like to work with data science, machine learning, optimization/mathematical-programming, or similar fields, however, I am boundless in my curiosity when it comes to computers, mathematics, and computer science and I am happy to collaborate on projects across any of those disciplines.

When not behind the keyboard, you can catch me out mountain biking, playing a disc sport (Ultimate Frisbee or Disc Golf ๐Ÿฅ), messing around with music (production ๐ŸŽผ, DJing ๐ŸŽง,or playing piano ๐ŸŽน), or spending some sort of time with friends and loved ones.

Reach out, it would be great to hear from you and maybe we could even develop some good personal or professional ties. ๐Ÿ˜Š

My rรฉsumรฉTwitterLinkedInMediumKaggleCodePen

Scan QR Code to add to contacts

Projects ๐Ÿ“ ๐Ÿ“Š

โ†“ Please see the pinned repositories at the bottom of the page for my past projects โ†“


Technologies ๐Ÿ‘จโ€๐Ÿ’ป ๐Ÿ’พ

Click here to expand โ†ฉ๏ธ

Programming Languages

PythonRJavaMatlabSQLJavaScriptTypeScriptHTML5CSS3Ruby

Data

Collection

Web ScrapingScrapyAPIsETLELT

Storage

Neo4jSQLiteMySQLPostgreSQL

Visualization

MatplotlibPlotlyD3.jsTableau

Processing

Apache SparkApache AirflowMultiprocessingPandasNumPyApache ArrowApache Kafka

Modeling

scikit-learnPyCaretKerasTensorflowPyTorchSciPyOpenCVOpenAI Gym

Deployment

MLOps & Data Orchestration

Kedrodagster

APIs

FlaskFastAPI

Cloud Technologies

Amazon AWSGoogle CloudMicrosoft Azure

CI/CD Platforms

GitHub ActionsTravis CI

Version Control Platforms

GitHubGitLabDagsHub

Workflow Tools

GitLinuxShell ScriptBashZshAnacondaPoetryJupyterDockerKubernetesPytestPython HypothesisPyEnvVisual Studio CodeSublime TextGNU Make

Optimization Tools

AMPLIBM CPLEXGurobiPuLPMetaheuristicsReinforcement Learning

Web Development

ReactNext.jsGatsbyTailwind CSSMaterial User Interface (MUI)Chakra UIGraphQLESLintnpmYarnJestCypressStorybookVercelGoogle Search ConsoleGoogle Analytics

Miscellaneous

JiraConfluenceSphinx DocsFigmaCanvaNotionAsanaObsidianMicrosoft OfficeMicrosoft ExcelMermaid.jsGoogle ColabReplitMarkdownLaTeXOverleafRaspberry PiArduinoAutodeskBoxy SVGisortblackPylintautoflakePylamaHomebrew


My Wakatime Stats ๐Ÿ‘จโ€๐Ÿ’ป ๐Ÿ—“๏ธ

Click here to expand โ†ฉ๏ธ

Code Time

Lines of code

๐Ÿฑ My GitHub Data

๐Ÿ“ฆ 4.6 MB Used in GitHub's Storage

๐Ÿ’ผ Opted to Hire

๐Ÿ“œ 35 Public Repositories

๐Ÿ”‘ 34 Private Repositories

I'm a Night ๐Ÿฆ‰

๐ŸŒž Morning                402 commits         โ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   09.27 % 
๐ŸŒ† Daytime                1463 commits        โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   33.74 % 
๐ŸŒƒ Evening                1532 commits        โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   35.33 % 
๐ŸŒ™ Night                  939 commits         โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   21.66 % 

๐Ÿ“… I'm Most Productive on Tuesday

Monday                   447 commits         โ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   10.31 % 
Tuesday                  874 commits         โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   20.16 % 
Wednesday                686 commits         โ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   15.82 % 
Thursday                 743 commits         โ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   17.14 % 
Friday                   445 commits         โ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   10.26 % 
Saturday                 505 commits         โ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   11.65 % 
Sunday                   636 commits         โ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   14.67 % 

๐Ÿ“Š This Week I Spent My Time On

๐Ÿ’ฌ Programming Languages: 
Python                   6 hrs 40 mins       โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   51.20 % 
Git Config               2 hrs 38 mins       โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   20.27 % 
Markdown                 2 hrs 22 mins       โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   18.24 % 
sh                       34 mins             โ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   04.47 % 
TypeScript               16 mins             โ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   02.10 % 

๐Ÿ’ป Operating System: 
Mac                      13 hrs 2 mins       โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ   100.00 % 

I Mostly Code in Jupyter Notebook

Jupyter Notebook         13 repos            โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   43.33 % 
Python                   6 repos             โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   20.00 % 
Makefile                 3 repos             โ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   10.00 % 
TypeScript               2 repos             โ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   06.67 % 
Swift                    1 repo              โ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘   03.33 % 

Timeline

Lines of Code chart

Last Updated on 25/04/2024 18:36:36 UTC



My GitHub Stats :octocat: ๐Ÿ“ˆ

Click here to expand โ†ฉ๏ธ

My Starred Repos โœจ and Associated Word Clouds โ˜๏ธ

Click here to expand โ†ฉ๏ธ

โž” Check out projects I've starred by language or by topic โœจ

Starred Projects Word Clouds

By Language

By Topic


Stargazers Over Time โœจ ๐Ÿ”ญ

Click here to expand โ†ฉ๏ธ
Stargazers over time


Past Projects

data-science-notes's People

Contributors

dpailet7 avatar pratik-99 avatar wyattowalsh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

data-science-notes's Issues

Complete Overviews

Overviews are similar to landing pages but for larger sections

Overviews

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Calculus Landing Page

In general, Data Science Notes sections are broken into landing pages and basics pages (with more specific pages to come as folks add them).

Landing pages are a good place to host general information about the technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/mathematical_topics/calculus/index.html

Add Content to Exploratory Data Analysis Basics

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/eda/basics.html

Add Content to Data Ingestion Basics

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/data_ingestion/basics.html

Add Content to Modeling Basics

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/modeling/basics.html

Programming Overview

In general, Data Science Notes sections are broken into landing pages and basics pages (with more specific pages to come as folks add them).

Landing pages are a good place to host general information about the technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/programming/index.html

Add Content to Introduction Section

The introduction section is a good place to talk about Data Science in general and its usefulness.

This could take the form of describing its societal impact, describing the path one takes to become a Data Scientist, or perhaps even some sort of metaphorical short story.

This is definitely a section where creativity could be cool!

Add Content to Simulation Landing Page

In general, Data Science Notes sections are broken into landing pages and basics pages (with more specific pages to come as folks add them).

Landing pages are a good place to host general information about the technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/simulation/index.html

Help Write Notes on Data Science

Project Context

There are certainly many topics that are good for a data scientist to know and practice! I've tried my best so far to research and develop what I would consider an outline of the different topics broken up into the major sections of interest and practice. This is by no means a comprehensive list, but is ever developing and so far contains most topics. The goal with this is to present the reader with a solid index of the different topics with easily accessible content of the topic's context, overview, and more specific details.

In the Jupyter Book framework (and as specified in the _toc.yml project file) this strategy breaks down into parts, each containing chapters of which can single pages or collections of pages, where each page can be either a single page or collection of pages. [Read more about Jupyter Book structure here]

Completing the Basics

It would be wonderful to add helpful content to each topic's section (we can always refine the content over time).

Sources can be cited by adding a reference to the references.bib file associated to the topic and using the {cite} command as described in the Jupyter Book documentation. I recommend using BibMe service to build the correct citation to add to references.bib.

Furthermore, additional features can be utilized like multichoice panels, special content blocks, and syntax highlighting! Check out the Jupyter Book documentation for more info!

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Roadmap

Topics With Content

Topics In Progress

Full Topic Index


Add Content to Clustering Basics

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/eda/clustering/basics.html

Add Content to Modeling Landing Page

In general, Data Science Notes sections are broken into landing pages and basics pages (with more specific pages to come as folks add them).

Landing pages are a good place to host general information about the technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/modeling/index.html

SQL

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/programming/sql/basics.html

Add CI for Twitter Feed in README

It would be a cool feature to have the @MakeUseofData Twitter feed included within the README.

There are a few possible solutions to include a feed, but having a GitHub action that queries for the feed and creates an image that can be embedded into the README of a few of the most recent tweets seems like the best solution to me right now.

Complete Landing Pages

In general, Data Science Notes sections are broken into landing pages and basics pages (with more specific pages to come as folks add them).

Landing pages are a good place to host general information about the technology or technique. The main site landing page is a good example as it has a header image, introductory text, and helps to orient the reader.

Landing Pages

Here are the site's landing pages:

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Add Content to Visualization Basics

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/eda/visualization/basics.html

Add Content to R Basics

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/programming/r/basics.html

Add Content to Database Basics

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/data_ingestion/databases/basics.html

Python Landing Page

In general, Data Science Notes sections are broken into landing pages and basics pages (with more specific pages to come as folks add them).

Landing pages are a good place to host general information about the technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/programming/languages/python/index.html

Add Content to Data Cleaning Basics

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/data_cleaning/basics.html

Add Content to SQL Landing Page

In general, Data Science Notes sections are broken into landing pages and basics pages (with more specific pages to come as folks add them).

Landing pages are a good place to host general information about the technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/programming/sql/index.html

Add Content to R Landing Page

In general, Data Science Notes sections are broken into landing pages and basics pages (with more specific pages to come as folks add them).

Landing pages are a good place to host general information about the technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/programming/r/index.html

Ensure Google Analytics

Built-in functionality for Google Analytics seems to be broken and doesn't register on the GA side

Add Content to Probability Basics Section

In general, Data Science Notes sections are broken into landing pages, basics pages, and specific pages (with more specific pages to come as folks add them).

Basics pages are a good place to give a general summary of a certain topic like how to use a given technology or technique.

I'm totally happy to help in whatever form I can, just let me know how I may be of assistance!

Link to page: https://makeuseofdata.com/mathematical_preliminaries/probability/basics.html

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.