Giter VIP home page Giter VIP logo

hidivescrape's Introduction

Hi, I'm Pat Mendoza, a recent grad from Cornell. I enjoy figuring out ways to integrate and present data in an aesthetically pleasing and informative format. This often involves transforming files in python then integrating them and creating visualizations in R.

I've worked in SQL, R, and Python for several years and am currently working on uploading some examples of my work.

Here are a few that I have so far (more upcoming):

  1. Google Data Analytics Professional Certificate Capstone Project - Tableau Link
    • This is the final project for my certificate. I extract viewership data from an API and convert the JSON format into tabular data which gets loaded onto kaggle. Then I created a viz on Tableau that allows users to explore the data.
    1. Extracting the data - here I go through the code that allows an extraction of data from MyAnimeList via their API and convert them into tables.
    2. Cleaning the data - here I go through my cleaning process for the data so that I ensure that its ready for loading into Tableau and Kaggle.
  2. R
    1. Mirrorplot - this is creating a simple mirrorplot that can be good visualization for showing up/down regulated genes in an RNA-seq.
    2. Clustering Samples - clustering is a common exercise to determine how closely samples are related to eachother. This shows how samples can be clustered using a PCoA and PCA and visualizing using ggplot.
    3. Data Wrangling with tidyr and dplyr - converting and integrating data from multiple sources is often tricky business. Luckily there are some great tools available that make this a breeze.
  3. Python
    1. Converting files in non-tabular to tabular format - Oftentimes, we come across data that isn't in the form that we need to make joins, when that happens, we can convert those using simple python scripts
    2. Data Wrangling with pandas and numpy - This is a replica of the above R data wrangling but using python with pandas and numpy in place of tidyr and dplyr
  4. Misc Python (in development) - some misc scripts that I used to add in RNA-seq generation.

hidivescrape's People

Contributors

patmendoza330 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.