Hello! π
π« I'm a current PhD Student in Organizational Psychology at Columbia University with interests in data science and applied research.
Name: Gian Zlupko
Type: User
Company: Columbia University
Bio: PhD student in Org Psych at Columbia University | Data Science intern at Autodesk
Lab assignments and code repository for Group 1 in HUDM 5123 Linear Models & Experimental Design, Fall 2021
Code and assignments for HUDM 6122 Multivariate Analysis I. Methods covered include common data mining and dimensionality reduction techniques such as PCA, cluster analysis, factor analysis and multidimensional scaling.
This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.
This repo contains an R programming project that implements and compares decision tree models including C5.0, C5.4 and CART.
Deep Learning for Time Series Classification
People analytics project in R that implements predictive modeling to identify employees most likely to leave a company. Discussion around implications for the sample firm and proposed interventions draw on best practices in organizational development.
Factor analytics techniques employed in R, including EFA and CFA, to analyze Martin & Doris's (2003) research on the development of a psychometric instrument measuring individual styles of humor.
Personal repo for projects associated with GR 5073 Machine Learning for the Social Sciences
Programming for Data Science
This project explores methods in Natural Language Processing (NLP) including text mining and pre-processing, sentiment analysis, and Latent Dirichlet Allocation (LDA) topic modeling.
This R notebook applies machine learning classification methods in the context of organizational network analysis. The goal was to test the predictive accuracy of various supervised learning models on company employee network data.
This repo contains Python and R code for Natural Language Processing (NLP) methods applied to open-entry text data collected in psychological and organizational research.
Work team repo for code and coursework associated with ORLA 6541 Applied Data Science in Organizations at Teachers College, Columbia University.
Personal website
Tensors and Dynamic neural networks in Python with strong GPU acceleration
This repository houses code for an R Shiny web application that allows users to share interactive data visualizations. I deployed the application via shinyapps.io, enabling users to access the application through a web browser.
A README template for anyone to copy and use.
This project analyzes fictional recruiting data through two main approaches - prediction and explanation. A multilayer perceptron is used for prediction and logistic regression for explanation.
Running Python in R with the reticulate package
This repository utilizes social network analysis (SNA) to analyze multiple social networks. Clustering algorithms were used to explore sub-groups and QAP was implemented for non-parametric multiple regression analysis.
Apache Spark - A unified analytics engine for large-scale data processing
Performed data cleaning, visualization, and statistical testing in R on Spotifyβs Global Top 50 songs. Implemented multiple regression to identify multivariate predictors of song popularity.
Spun instance in Amazon Web Services (AWS) and built database using SQL. Connected the AWS instance to RStudio in a local environment and ran SQL commands in RStudio using the DBI package.
Repo for the course Applied multivariate statistics
A data science project to predict whether a transaction is a fraud or not.
Utilized Twitter API for data mining. Built an HTML-formatted data visualization using RMarkdown of tweet activity for trending houseplants.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.