Tensor4ML

Made by Xinyu Chen • 🌐 https://xinychen.github.io

Tensor Decomposition for Machine Learning (Tensor4ML). This article summarizes the development of tensor decomposition models and algorithms in the literature, offering comprehensive reviews and tutorials on topics ranging from matrix and tensor computations to tensor decomposition techniques across a wide range of scientific areas and applications. Since the decomposition of tensors is often formulated as an optimization problem, this article also provides a preliminary introduction to some classical methods for solving convex and nonconvex optimization problems. This work aims to offer valuable insights to both the machine learning and data science communities by drawing strong connections with the key concepts related to tensor decomposition. To ensure reproducibility and sustainability, we provide resources such as datasets and Python implementations, primarily utilizing Python’s numpy library.

In a hurry? Please check out our contents as follows.

Introduction
- Tensor decomposition in the past 10-100 years
- Tensor decomposition in the past decade
What Are Tensors?
- Tensors in algebra & machine learning
- Tensors in data science
Foundation of Tensor Computations
- Norms
- Matrix trace
- Kronecker product
- Khatri-Rao product
- Modal product
- Outer product
- Derivatives
Foundation of Optimization
- Gradient descent methods
- Power iteration
- Alternating minimization
- Alternating direction method of multipliers
- Greedy methods for -norm minimization
- Bayesian optimization

Our Research

^{▴ Back to top}

We conduct extensive experiments on some real-world data sets:

Middle-scale data sets:
- PeMS (P) registers traffic speed time series from 228 sensors over 44 days with 288 time points per day (i.e., 5-min frequency). The tensor size is 228 x 288 x 44.
- Guanghzou (G) contains traffic speed time series from 214 road segments in Guangzhou, China over 61 days with 144 time points per day (i.e., 10-min frequency). The tensor size is 214 x 144 x 61.
- Electricity (E) records hourly electricity consumption transactions of 370 clients from 2011 to 2014. We use a subset of the last five weeks of 321 clients in our experiments. The tensor size is 321 x 24 x 35.
Large-scale PeMS traffic speed data set registers traffic speed time series from 11160 sensors over 4/8/12 weeks (for PeMS-4W/PeMS-8W/PeMS-12W) with 288 time points per day (i.e., 5-min frequency) in California, USA. You can download this data set and place it at the folder of ../datasets.
- Data size:
  - PeMS-4W: 11160 x 288 x 28 (contains about 90 million observations).
  - PeMS-8W: 11160 x 288 x 56 (contains about 180 million observations).
- Data path example: ../datasets/California-data-set/pems-4w.csv.
- Open data in Python with Pandas:

import pandas as pd

data = pd.read_csv('../datasets/California-data-set/pems-4w.csv', header = None)

mats

mats is a project in the tensor learning repository, and it aims to develop machine learning models for multivariate time series forecasting. In this project, we propose the following low-rank tensor learning models:

Low-Rank Autoregressive Tensor Completion (LATC) (3-min introduction) for multivariate time series (middle-scale data sets like PeMS, Guangzhou, and Electricity) imputation and forecasting (Chen et al., 2020):
- with nuclear norm (NN) minimization [Python code for imputation]
- with truncated nuclear norm (TNN) minimization [Python code for imputation] [Python code for prediction]
- with Schatten p-norm (SN) minimization [Python code for imputation]
- with truncated Schatten p-norm (TSN) minimization [Python code for imputation]
Low-Tubal-Rank Autoregressive Tensor Completion (LATC-Tubal) for large-scale spatiotemporal traffic data (large-scale data sets like PeMS-4W and PeMS-8W) imputation (Chen et al., 2020):
- without autoregressive norm [Python code]
- with autoregressive norm [Python code]

We write Python codes with Jupyter notebook and place the notebooks at the folder of ../mats. If you want to test our Python code, please run the notebook at the folder of ../mats. Note that each notebook is independent on others, you could run each individual notebook directly.

The baseline models include:

on middle-scale data sets:
- coming soon...
on large-scale data sets:
- Bayesian Probabilistic Matrix Factorization (BPMF, Salakhutdinov and Mnih, 2008) [Python code]
- Bayesian Gaussian CP decomposition (BGCP, Chen et al., 2019) [Python code]
- High-accuracy Low-Rank Tensor Completion (HaLRTC, Liu et al., 2013) [Python code]
- Low-Rank Tensor Completion with Truncated Nuclear Norm minimization (LRTC-TNN, Chen et al., 2020) [Python code]
- Tensor Nuclear Norm minimization with Discrete Cosine Transform (TNN-DCT, Lu et al., 2019) [Python code]

We write Python codes with Jupyter notebook and place the notebooks at the folder of ../baselines. If you want to test our Python code, please run the notebook at the folder of ../baselines. The notebook which reproduces algorithm on large-scale data sets is emphasized by Large-Scale-xx.

📖 Reproducing Literature in Python

^{▴ Back to top}

We reproduce some tensor learning experiments in the previous literature.

Year	Title	PDF	Authors' Code	Our Code	Status
2015	Accelerated Online Low-Rank Tensor Learning for Multivariate Spatio-Temporal Streams	ICML 2015	Matlab code	Python code	Under development
2016	Scalable and Sound Low-Rank Tensor Learning	AISTATS 2016	-	xx	Under development

📖 Tutorial

^{▴ Back to top}

We summarize some preliminaries for better understanding tensor learning. They are given in the form of tutorial as follows.

Foundations of Python Numpy Programming
- Generating random numbers in Matlab and Numpy [Jupyter notebook] [blog post]
Foundations of Tensor Computations
- Kronecker product
Singular Value Decomposition (SVD)
- Randomized singular value decomposition [Jupyter notebook] [blog post]
- Tensor singular value decomposition

If you find these codes useful, please star (★) this repository.

Helpful Material

^{▴ Back to top}

We believe that these material will be a valuable and useful source for the readers in the further study or advanced research.

Vladimir Britanak, Patrick C. Yip, K.R. Rao (2006). Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations. Academic Press. [About the book]
Ruye Wang (2010). Introduction to Orthogonal Transforms with Applications in Data Processing and Analysis. Cambridge University Press. [PDF]
J. Nathan Kutz, Steven L. Brunton, Bingni Brunton, Joshua L. Proctor (2016). Dynamic Mode Decomposition: Data-Driven Modeling of Complex Systems. SIAM. [About the book]
Yimin Wei, Weiyang Ding (2016). Theory and Computation of Tensors: Multi-Dimensional Arrays. Academic Press.
Steven L. Brunton, J. Nathan Kutz (2019). Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control. Cambridge University Press. [PDF] [data & code]

Quick Run

^{▴ Back to top}

If you want to run the code, please
- download (or clone) this repository,
- open the .ipynb file using Jupyter notebook,
- and run the code.

Citing

^{▴ Back to top}

This repository is from the following paper, please cite our paper if it helps your research.

Acknowledgements

^{▴ Back to top}

This research is supported by the Institute for Data Valorization (IVADO).

License

^{▴ Back to top}

This work is released under the MIT license.

a bug in LRTC-TNN.ipynb

In the svt_tnn code:

def svt_tnn(mat, alpha, rho, theta):
    tau = alpha / rho
    [m, n] = mat.shape
    if 2 * m < n:
        u, s, v = np.linalg.svd(mat @ mat.T, full_matrices = 0)
        s = np.sqrt(s)
        idx = np.sum(s > tau)
        mid = np.zeros(idx)
        mid[:theta] = 1
        mid[theta:idx] = (s[theta:idx] - tau) / s[theta:idx]
        return (u[:, :idx] @ np.diag(mid)) @ (u[:, :idx].T @ mat)
    elif m > 2 * n:
        return svt_tnn(mat.T, tau, theta).T # this svt_tnn lack an argument. :( It only has 3 aurgements. 
    u, s, v = np.linalg.svd(mat, full_matrices = 0)
    idx = np.sum(s > tau)
    vec = s[:idx].copy()
    vec[theta:idx] = s[theta:idx] - tau
    return u[:, :idx] @ np.diag(vec) @ v[:idx, :]

The error shows:

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
Cell In[18], line 7
      5 epsilon = 1e-4
      6 maxiter = 200
----> 7 x = LRTC(dense_tensor, sparse_tensor, alpha, rho, theta, epsilon, maxiter)
      8 # end = time.time()
      9 # print('Running time: %d seconds'%(end - start))

Cell In[8], line 17, in LRTC(dense_tensor, sparse_tensor, alpha, rho, theta, epsilon, maxiter)
     15 rho = min(rho * 1.05, 1e5)
     16 for k in range(len(dim)):
---> 17     X[k] = mat2ten(svt_tnn(ten2mat(Z - T[k] [/](https://file+.vscode-resource.vscode-cdn.net/) rho, k), alpha[k], rho, int(np.ceil(theta * dim[k]))), dim, k)
     18 Z[pos_missing] = np.mean(X + T [/](https://file+.vscode-resource.vscode-cdn.net/) rho, axis = 0)[pos_missing]
     19 T = T + rho * (X - np.broadcast_to(Z, np.insert(dim, 0, len(dim))))

Cell In[6], line 13, in svt_tnn(mat, alpha, rho, theta)
     11     return (u[:, :idx] @ np.diag(mid)) @ (u[:, :idx].T @ mat)
     12 elif m > 2 * n:
---> 13     return svt_tnn(mat.T, tau, theta).T
     14 u, s, v = np.linalg.svd(mat, full_matrices = 0)
     15 idx = np.sum(s > tau)

TypeError: svt_tnn() missing 1 required positional argument: 'theta'

xinychen / tensor4ml Goto Github PK

tensor4ml's Introduction

Tensor4ML

Made by Xinyu Chen • 🌐 https://xinychen.github.io

Our Research

mats

📖 Reproducing Literature in Python

📖 Tutorial

Helpful Material

Quick Run

Citing

Acknowledgements

License

tensor4ml's People

Contributors

Stargazers

Watchers

Forkers

tensor4ml's Issues

Recommend Projects

Recommend Topics

Recommend Org