Giter VIP home page Giter VIP logo

tomasbeuzen / python-programming-for-data-science Goto Github PK

View Code? Open in Web Editor NEW
72.0 4.0 75.0 11.71 MB

Content from the University of British Columbia's Master of Data Science course DSCI 511.

Home Page: https://www.tomasbeuzen.com/python-programming-for-data-science/

License: Creative Commons Zero v1.0 Universal

Jupyter Notebook 99.88% Python 0.12%
data-science teaching data-manipulation pandas numpy python programming

python-programming-for-data-science's Introduction

Python Programming for Data Science

By Tomas Beuzen 🚀

Welcome to Python Programming for Data Science! With this website I aim to provide an introduction to everything you need to know to start using Python for data science. We'll cover topics such as data structures, basic programming, code testing and documentation, and using libraries like NumPy and Pandas for data exploration and analysis.

If you're interested in learning more about Python packages, check out my and Tiffany Timber's book Python Packages. Or, if you'd like to learn more about using Python and PyTorch for deep learning, you can check out my other online material Deep Learning with PyToch.

The content of this site is adapted from material I used to teach the 2020/2021 offering of the course "DSCI 511 Python Programming for Data Science" for the University of British Columbia's Master of Data Science Program. That material has built upon previous course material developed by Patrick Walls and Mike Gelbart.

Key Learning Outcomes

These are the key learning outcomes for this material:

  1. Translate fundamental programming concepts such as loops, conditionals, etc into Python code.
  2. Understand the key data structures in Python.
  3. Understand how to write functions in Python and assess if they are correct via unit testing.
  4. Know when and how to abstract code (e.g., into functions, or classes) to make it more modular and robust.
  5. Produce human-readable code that incorporates best practices of programming, documentation, and coding style.
  6. Use NumPy perform common data wrangling and computational tasks in Python.
  7. Use Pandas to create and manipulate data structures like Series and DataFrames.
  8. Wrangle different types of data in Pandas including numeric data, strings, and datetimes.

Getting Started

The material on this site is written in Jupyter notebooks and rendered using Jupyter Book to make it easily accessible. However, if you wish to run these notebooks on your local machine, you can do the following:

  1. Clone the GitHub repository:
    git clone https://github.com/TomasBeuzen/python-programming-for-data-science.git
  2. Install the conda environment by typing the following in your terminal:
    conda env create -f py4ds.yaml
  3. Open the course in JupyterLab by typing the following in your terminal:
    cd python-programming-for-data-science
    jupyterlab

If you're not comfortable with git, GitHub or conda, feel free to just read through the material on this website - you're not missing out on anything!

python-programming-for-data-science's People

Contributors

tomasbeuzen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

python-programming-for-data-science's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.