YipengSong's Projects
Solutions for the exercises of the book "Advanced R" published in CRC Press (2015) by Hadley Wickham
the homework of the algorithm specialization of stanford university on coursera
Scripts to reproduce the results in "Principal component analysis of binary genomics data" paper.
The solutions for the exercises in C++ Primer, fifth edition
final project of Python Project: pillow, tesseract, and opencv
Software to accompany "Generalized simultaneous component analysis of binary and quantitative data" paper.
Software to accompany "Separating common (global and local) and distinct variation in multiple mixed types data sets" paper.
This repository is used for the version control of my PhD thesis "Fusing heterogeneous data sets". Figures are not included.
Tutorial on setting up the Python environment on Rstudio server
A standard framework for modelling Deep Learning Models for tabular data
Clustering objects on subsets of attributes
A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)
R package to accompany "Separating common (global and local) and distinct variation in multiple mixed types data sets" paper.
rewrite RpESCA package using C++ and Rcpp
Unsupervised Language Modeling at scale for robust sentiment classification