Gus Sanchez's Projects
Analysis on Amazon's Vine Review program. Created tables in AWS RDS database. In addition, we also created Google Colab notebooks, then extracted the datasets and placed them into each notebook. We then create a schema that fits the datasets and then loaded the datasets into our RDS. Finally, PySpark, the Python API for Apache Spark was used to transform and load the data into an SQL table in order to perform the data analysis.
Created a data visualization story and dashboards for analysis of citi bike data using ETL methods in python to clean the data and Tableau for visualization.
Incorporation of Python to build and evaluate several machine learning models to predict credit risk. Use of the Scikit-learn machine learning library to create machine learning models .
Analyze a dataset of cryptocurrencies to uncover hidden trends that can lead to great investment opportunities. Used unsupervised algorithms. Worked primarily with K-means algorithm.
Voting Audit on a local Congressional Election using Python 3 and VS Code
GitHub stats & most used languages
Performing analysis on Kickstarter data to uncover trends
Created a geographical visualization to map earthquake data by latitude and longitude from the USGS with Leaflet.js Application Programming Interface(API).
Implementation of statistics, and hypothesis testing to analyze a series of datasets from the automotive industry. Merged csv files into one data frame to create a series of scatter plots between treatment groups with `R`. Performed statistical analysis to determine differences between treatment groups.
HTML and CSS web scraping to automate a web browser to visit different websites to extract data about the Mission to Mars. Data is stored in a NoSQL database MongoDB, and then the data is rendered in a web application created with Flask to display the data from the web scrape. HTML/CSS portfolio was created to showcase the project. Bootstrap components used to polish and customize the portfolio.
ETL(Extract, Transform, Load) pipeline on kaggle movie data using Python and pgAdmin.
SQL analysis on employee database to provide forecasting recommendations and strategies for eligible retiring employees using SQL and pgAdmin.
Dynamic and interactive visualization dashboard and charts using Plotly.js with a human belly button diversity dataset. Interactive dashboard and charts display bacteria that live inside the human body. Upon selection of an id number in a pull down list, the id metadata will be displayed in a div element and the top ten bacterial samples will be displayed in a pie chart and bubble chart.
Data Analysis and Visualization on ride-sharing data based on selected city-types using Pandas, NumPy, and Matplotlib Libraries through Jupyter Notebook.
Data Analysis on a client's School District's funding and SAT scores in Math and Reading using Pandas and NumPy libraries through Jupyter Notebook.
Analysis of selected weather data from an SQLite Database using tools such as Python, Jupyter Notebook, SQLAlchemy, and design a flask application using data.
Created a website that displays UFO-sightings data and built a table using that data stored in a JavaScript array. Created filters to make this table fully dynamic, reacting to user input using Javascript, HTML/CSS. Website was customized using Bootstrap, and equipped the table with several fully functional filters that will allow users to interact with the websiteβs visualizations.
Analysis on weather data and travel maps and directions using Google Maps Platform on Google Cloud of selected number of cities globally using APIs.