Giter VIP home page Giter VIP logo

learndatascience's Introduction

Who

What

  • A collection of Data Science Learning materials in the form of IPython Notebooks.
  • Associated data sets.

The initial beta release consists of four major topics

  • Linear Regression
  • Logistic Regression
  • Random Forests
  • K-Means Clustering

Each of the above has at least three IPython Notebooks covering

  • Overview (an exposition of the technique for the math-wary)
  • Data Exploration (the nuts and bolts of real world data wrangling)
  • Analysis (using the technique to get results)

One or more of these may have supplementary material. Each of these have worksheets that contain mostly the code sections so you can iteratively explore the code.

Three openly available data sets are used.

Why

There's a need for open content to raise the level of awareness and training in basics, in the Data Science field (circa early 2013).

IPython Notebook provides an appropriate platform for rapid iterative exploration and learning.

When

Starting in 2013 and intended to extend for a long while.

Where

Today github, tomorrow the world. Google Group "learnds"

How

Learn Data Science is based on content developed by me (Nitin Borwankar) for the Open Data Science Training project http://opendst.org Most of the content (circa July 2013) is copyright (c) Alpine Data Labs as per the license at opendst.org, and is freely available. Extensions to the content embodied in this projects content are also released under the same license - see the LICENSE.txt file.

IPython Notebooks at Beta.

Background

If you are unfamiliar with IPython Notebook you can start with http://ipython.org/notebook

Installation

  • Prerequisites
    One of the following distributions is needed. Please note that even if you have Python installed it is important to have one of these distributions installed and the binary for this installation in your path. This is because these distributions come packaged with all the supplementary libraries needed and these have been historically difficult to install separately.

    • EPD Free Enthought Python Distribution from http://enthought.com
    • Anaconda Python from http://continuum.io
    • Development has been done on v 1.5 of Anaconda distribution but EPD Free should work just as well.
  • The following steps assume you have installed one of the distributions mentioned in prerequisites.

  • From a zip or tar file

    • download the zip or tar file
    • unpack the file to a directory called learnds
    • cd to the 'notebooks' subdirectory
    • start IPython Notebook 'ipython notebook --pylab=inline'
  • From the git repo

    • clone the repo
    • cd to 'notebooks'
    • start IPython Notebook 'ipython notebook --pylab=inline'

learndatascience's People

Contributors

nborwankar avatar mrshu avatar jirkalhotka avatar tebeka avatar dmirylenka avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.