Giter VIP home page Giter VIP logo

pybgmm's Introduction

PyBGMM: Bayesian inference for Gaussian mixture model

Overview

Bayesian inference for Gaussian mixture model to reduce over-clustering via the powered Chinese restaurant process (pCRP). We use collapsed Gibbs sampling for posterior inference.

Code Structure

|-- GMM # base class for Gaussian mixture model
    |---- IGMM  # base class for infinite Gaussian mixture model
        |------ CRPMM     ## traditional Chinese restaurant process (CRP) mixture model
        |------ PCRPMM    ## powered Chinese restaurant process (pCRP) mixture model

Documentation

What do we include:

  • Chinese restaurant process mixture model (CRPMM)

  • Powered Chinese restaurant process (pCRP) mixture model

Examples

Code Description
CRPMM 1d Chinese restaurant process mixture model for 1d data
CRPMM 2d Chinese restaurant process mixture model for 2d data
pCRPMM 1d powered Chinese restaurant process mixture model for 1d data
pCRPMM 2d powered Chinese restaurant process mixture model for 2d data

Dependencies

  1. See requirements.txt

Lincense

MIT

Citation

The repo is based on the following research articles:

  • Lu, Jun, Meng Li, and David Dunson. "Reducing over-clustering via the powered Chinese restaurant process." arXiv preprint arXiv:1802.05392 (2018).

References

  1. H. Kamper, A. Jansen, S. King, and S. Goldwater, "Unsupervised lexical clustering of speech segments using fixed-dimensional acoustic embeddings", in Proceedings of the IEEE Spoken Language Technology Workshop (SLT), 2014.
  2. Murphy, Kevin P. "Conjugate Bayesian analysis of the Gaussian distribution." def 1.2ฯƒ2 (2007): 16.
  3. Murphy, Kevin P. Machine learning: a probabilistic perspective. MIT press, 2012.
  4. Pedregosa, Fabian, et al. "Scikit-learn: Machine learning in Python." Journal of Machine Learning Research 12.Oct (2011): 2825-2830.
  5. Rasmussen, Carl Edward. "The infinite Gaussian mixture model." Advances in neural information processing systems. 2000.
  6. Tadesse, Mahlet G., Naijun Sha, and Marina Vannucci. "Bayesian variable selection in clustering high-dimensional data." Journal of the American Statistical Association 100.470 (2005): 602-617.

pybgmm's People

Contributors

junlulocky avatar xylimeng avatar

Watchers

James Cloos avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.