Giter VIP home page Giter VIP logo

la-cookiecutter's Introduction

Cookiecutter for Streamlit ML Conda projects

This is a template cookiecutter project for bootstrapping your work on ML data science projects. It contains :

  • a directory structure for sorting your notebooks, data, models, figures, tasks and source code to reuse in notebooks
  • a conda environment file with the basic python libraries and some extras :
    • numpy / pandas / scikit-learn / seaborn / statsmodels / plotly / jupyterlab classic Data Science stack
    • streamlit for building and run top to bottom data apps
    • pyspark and h2o for distributed processing
    • pandas-profiling for generating HTML reports on pandas dataframes
    • missingno for missing data analysis
    • invoke as a replacement to Makefile for managing project tasks
    • nbdime for diffing and merging notebooks
    • kaggle-api a CLI for interacting with Kaggle API
    • keras and lightgbm for prediction
    • path.py for browsing files in Python

Prerequisites

  • Anaconda >=5.x
  • Cookiecutter >= 1.4.0: This can be installed with pip by or conda depending on how you manage your Python packages:
$ pip install cookiecutter

or

$ conda config --add channels conda-forge
$ conda install cookiecutter

Generate a new project

In a folder where you want your project generated : cookiecutter https://github.com/flight505/la-cookiecutter

You can also clone the project in <path/to/template>, and from the folder where you want to generate your project, launch cookiecutter <path/to/template>

It will ask for the following values :

full_name
email
project_name
project_slug
project_short_description
version

(project_slug is the name you will use to install your package elsewere using pip install your_package_name)

Complete the values for your project and voilà ! Then follow the README inside your new project for further installation.

Contributing guide

All contributions, bug reports, bug fixes, documentation improvements, enhancements and ideas are welcome.

Credits

This is a Fork from andfanilo, which was modified for Streamlit applications

This project is heavily influenced by drivendata's Data Science cookiecutter.

Other links that helped shape this cookiecutter :

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.