SAI SURAJ ARGULA's Projects
This Repository contains analysis of customer churn data
This project is regarding the analysis of Tweets from Donald Trump, U.S president (2016-2020) from Jan 2015 to 17/09/2020. The analysis includes how tweet counts, frequently tweeted words, Twitter handles, and tweet sentiment changed from year to year.
This project is regarding the analysis of US presidential speeches. The analysis includes presidential speeches vocabulary patterns, most frequent words, sentiments across the parties i.e., Democrats, Republic and others
This repository is to beautify GitHub intro Page.
This repository contains project on clustering of news articles and headlines that are being shared on Facebook.
Compilation of R and Python programming codes
In this project, We would like to analyze credit card fraud detection data from Kaggle.
In this repository, I have analyzed Cricket match data. The SQL queries in the analysis are written in Hive, Impala to load from Hadoop HDFS Keeping Big-Data Applications in Mind
The Leek group guide to data sharing
In this project, I used decison tree classification algorithm to build a model from historical data of patients, their response to different medications and predict the class of a unknown patient, or to find a proper drug for a new patient.
Build statistical models to estimate Enrollment and Graduation rates across universities in USA on USNEWS data. Analyzed Quantitative Impact of Cost of attendance, faculty, students Quality-Quantity on Enrolment and Graduation
example repo
This is a repository for data warehouse dimensional modeling exercise for analytical engineer position of fetch rewards
Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.
In this project, we analyzed and built models upon Home Credit Default Risk dataset from Kaggle competition.
This repository contains an example database design of a hospital implemented for ADBMS course project
In this Project, SVM (Support Vector Machines) is used to build and train a model using human cell records, and classify cells to whether the samples are benign or malignant.
Understand Transportation as a social determinant of health. Built a model to identify Medicare members most at the risk for a Transportation Challenge.
This is a repository of documents and codes as a part of IBM_Data Science_Professional_certification _Capstone_ProjectT
This repository contains analysis of Incident management held at IT firm. The analysis has been done using Pyspark in Databricks Community edition.
This is a Iris Flower Prediction Web Application created by using Streamlit library
Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.