Giter VIP home page Giter VIP logo

pystatsml's Introduction

Statistics and Machine Learning in Python

Structure

Courses are available in three formats:

  1. Jupyter notebooks.

  2. Python files using sphinx-gallery.

  3. ReStructuredText files.

All notebooks and python files are converted into rst format and then assembled together using sphinx.

Directories and main files:

introduction/
├── machine_learning.rst
└── python_ecosystem.rst

python_lang/                        # (Python language)
├── python_lang.py # (main file)
└── python_lang_solutions.py

scientific_python/
├── matplotlib.ipynb
├── scipy_numpy.py
├── scipy_numpy_solutions.py
├── scipy_pandas.py
└── scipy_pandas_solutions.py

statistics/                         # (Statistics)
├── stat_multiv.ipynb               # (multivariate statistics)
├── stat_univ.ipynb                 # (univariate statistics)
├── stat_univ_solutions.ipynb
├── stat_univ_lab01_brain-volume.py # (lab)
├── stat_univ_solutions.ipynb
└── time_series.ipynb

machine_learning/                   # (Machine learning)
├── clustering.ipynb
├── decomposition.ipynb
├── decomposition_solutions.ipynb
├── linear_classification.ipynb
├── linear_regression.ipynb
├── non_linear_prediction.ipynb
├── resampling.ipynb
├── resampling_solution.py
└── sklearn.ipynb

optimization/
├── optim_gradient_descent.ipynb
└── optim_gradient_descent_lab.ipynb

deep_learning/
├── dl_backprop_numpy-pytorch-sklearn.ipynb
├── dl_cnn_cifar10_pytorch.ipynb
├── dl_mlp_mnist_pytorch.ipynb
└── dl_transfer-learning_cifar10-ants-

Build

After pulling the repository execute Jupyter notebooks (outputs are expected to be removed before git submission).

make exe

Build the pdf file (requires LaTeX):

make pdf

Build the html files:

make html

Clean everything and strip output from Jupyter notebook (useless if you installed the nbstripout hook, ):

make clean

Dependencies

The easier is to install Anaconda at https://www.continuum.io with python >= 3. Anaconda provides

  • python 3
  • ipython
  • Jupyter
  • pandoc
  • LaTeX to generate pdf

Then install:

  1. sphinx-gallery
pip install sphinx-gallery
  1. nbstripout
conda install -c conda-forge nbstripout

Configure your git repository with nbstripout pre-commit hook for users who don't want to track output in VCS.

cd pystatsml
nbstripout --install
  1. Git LFS for datasets

a. Install Git LFS

git lfs install

b. select the file types you'd like Git LFS to manage

git lfs track "*.npz"
git lfs track "*.npy"
git lfs track "*.nii"
git lfs track "*.nii.gz"
git lfs track "*.csv"

b. Now make sure .gitattributes is tracked:

git add .gitattributes
  1. LaTeX (optional for pdf)

For Linux debian like:

sudo apt-get install latexmk texlive-latex-extra
  1. MS docx (optional)

docxbuilder

a. Install

pip install docxbuilder
pip install docxbuilder[math]

b. Build

make docx

pystatsml's People

Contributors

duchesnay avatar even4void avatar rth avatar tomlof avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

pystatsml's Issues

Clarifying the existence of the PDF book in the readme and repo structure

Hello,

I have found your book «Statistics and Machine Learning in Python» by chance during my online searches, and I find it very valuable, concise and complete.

I find however that it is difficult to find it in this repo, as it is not described in the readme.

Would it be possible to add a section in the readme describing the book, and the relation to the code in this repo (ie, what folders contain code snippets that are pertinent for the book?).

Also, would it be possible to clarify which version should be used? I see there are two versions, the latest one with one of the authors removed, but the version is the same (v0.2), are there any other differences?

Thank you very much for this great work!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.