Giter VIP home page Giter VIP logo

gacluster's Introduction

GACluster -- Graph Agglomerative Clustering Library

Version 0.1.0

Table of Contents

Introduction

The GACluster open source library implements popular Graph Agglomerative Clustering algorithms.

GACluster is distributed under the BSD license (see the COPYING file).

Two major limits of previous GAC toolbox are 1) memory cost and 2) C++ MEX implementation. This new version only includes pure MATLAB code and is optimized for memory. E.g., [5] did not report the result of GDL [1] on 70000 samples of MNIST full dataset due to huge memory cost, and it is possible to run GDL with this library if you have 24G memory. Further optimization is possbile with less readability.

Citations

Please cite the following papers, if you find the code is helpful.

  @inproceedings{zhang2012graph,
    title={Graph degree linkage: Agglomerative clustering on a directed graph},
    author={Zhang, Wei and Wang, Xiaogang and Zhao, Deli and Tang, Xiaoou},
    booktitle={European Conference on Computer Vision},
    pages={428--441},
    year={2012}
  }

  @article{zhang2013agglomerative,
    title={Agglomerative clustering via maximum incremental path integral},
    author={Zhang, Wei and Zhao, Deli and Wang, Xiaogang},
    journal={Pattern Recognition},
    volume={46},
    number={11},
    pages={3056--3065},
    year={2013}
  }

Benchmark Results

GDL without deep representation achieves close performance to state-of-the-art deep clustering algorithms.

Clustering performances of different algorithms in terms of NMI/ACC.

DataSet MNIST MNIST-test USPS Fashion-MNIST
GDL [1] 0.910/0.964 0.864/0.933 0.860/0.922 0.660/0.627
[Deep Clustering]
DEC [9] -/0.843 -/- -/- -/-
JULE [3] 0.913/0.964 0.915/0.961 0.913/- -/-
DEPICT [4] 0.917/0.965 0.915/0.963 0.927/0.964 -/-
VaDE [10] -/0.945 -/- -/- -/-
DAC [11] 0.935/0.978 -/- -/- -/-
DBC [12] 0.917/0.964 -/- 0.724/0.743 -/-
ConvDEC-DA [7] 0.960/0.985 0.958/0.983 0.962/0.987 0.636/0.586
DDC-DA [13] 0.941/0.969 0.927/ 0.970 0.939/0.977 0.661/0.609
ClusterGAN [14] 0.921/0.964 -/- 0.931/0.970 -/-

Quick start with MATLAB

GACluster provides a number of demos for reproducing results on popular datasets.

Changes

  • Initial public release.

TODO

References

[1] W. Zhang, X. Wang, D. Zhao and X. Tang. Graph degree linkage: Agglomerative clustering on a directed graph. ECCV, 2012.

[2] W. Zhang, D. Zhao and X. Wang. Agglomerative clustering via maximum incremental path integral. Pattern Recognition, 46(11), pp.3056-3065, 2013.

[3] J. Yang, D. Parikh and D. Batra. Joint unsupervised learning of deep representations and image clusters. CVPR, 2016. Paper Code

[4] K.G. Dizaji, A. Herandi, C. Deng, W. Cai, H. Huang. Deep clustering via joint convolutional autoencoder embedding and relative entropy minimization. ICCV, 2017. Paper Code

[5] S.A. Shah and V. Koltun. Robust continuous clustering. PNAS, 114(37), pp.9814-9819, 2017. Paper Code

[6] Shah, S.A. and Koltun, V., Deep Continuous Clustering. Paper Code

[7] X. Guo, E. Zhu, X. Liu and J. Yin. Deep embedded clustering with data augmentation. ACML, 2018. Paper Code

[8] X. Guo, L. Gao, X. Liu and J. Yin. Improved deep embedded clustering with local structure preservation. IJCAI, 2017. Paper Code

[9] J. Xie, R. Girshick and A. Farhadi. Unsupervised deep embedding for clustering analysis. ICML, 2016.

[10] Z. Jiang, Y. Zheng, H. Tan, B. Tang and H. Zhou. Variational deep embedding: An unsupervised and generative approach to clustering. IJCAI, 2017. Paper Code

[11] J. Chang, L. Wang, G. Meng, S. Xiang and C. Pan. Deep adaptive image clustering. ICCV, 2017. Paper Code

[12] F. Li, H. Qiao and B. Zhang. Discriminatively boosted image clustering with fully convolutional auto-encoders. Pattern Recognition, 83, 2017. Paper

[13] Y. Ren, N. Wang, M. Li and Z. Xu. Deep Density-based Image Clustering. arXiv preprint arXiv:1812.04287, 2018. Paper

[14] K.G. Dizaji, X. Wang, C. Deng and H. Huang. Balanced Self-Paced Learning for Generative Adversarial Clustering Network. CVPR, 2019. Paper

[15] W. Hu, T. Miyato, S. Tokui, E. Matsumoto and M. Sugiyama. Learning Discrete Representations via Information Maximizing Self-Augmented Training. ICML, 2017. Paper Code

[16] U. Shaham, K. Stanton, H. Li, B. Nadler, R. Basri and Y. Kluger. SpectralNet: Spectral Clustering Using Deep Neural Networks. ICLR, 2018. Paper Code

[17] X. Guo, X. Liu, E. Zhu, X. Zhu, M. Li,X. Xu and J. Yin. Adaptive Self-paced Deep Clustering with Data Augmentation. IEEE TKDE, 2019. Paper Code

gacluster's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.