Giter VIP home page Giter VIP logo

Yashvardhan Rathi's Projects

bestfootballgifs-w3 icon bestfootballgifs-w3

A website where you can upload your favorite football GIF's and upvote them. You can also tip the uploader if you like the GIF.

clustering-in-r-and-visualizing-results-in-tableau-on-titanic-dataset. icon clustering-in-r-and-visualizing-results-in-tableau-on-titanic-dataset.

The objective of this exercise is to get a hands on experience on Decision Tree and K-means clustering analysis using R and visualizing results through Tableau. Below are the objectives of the exercise 1)Data retrieval 2)Data pre-processing 3) Decision Tree using R 4) K-mean clustering using Tableau- R integration by invoking Rserve (). The Titanic dataset will be used for this purpose.

coorelation-matrix-and-heatmap-in-r icon coorelation-matrix-and-heatmap-in-r

Using the Toyota Corolla dataset I observe which variables are related to each other by creating dummy variables and plotting them in a correlation matrix.

geo-heatmap icon geo-heatmap

:world_map: Generate an interactive geo heatmap from your Google location data

linear-discrimant-analysis-in-r- icon linear-discrimant-analysis-in-r-

A team collected data on email messages to create a classifier that can separate spam from non-spam email messages. We use LDA to classify emails as spam and non-spam email and then evaluate the effectiveness of the model. The data-set used is from the UCI Machine Learning Library. Here is the link: https://archive.ics.uci.edu/ml/datasets/spambase

logistic-regression-and-confusion-matrix-in-r icon logistic-regression-and-confusion-matrix-in-r

Ledoitte, a management consulting firm, is studying the roles played by experience and training in a system administrator’s ability to complete a set of tasks in a specified amount of time. Ledoitte is interested in figuring out which administrators can complete given tasks within a specified time and those who are not. Data are collected on the performance of 75 randomly selected administrators. They are stored in the file SystemAdministrators.csv . The variable Experience measures months of full-time system administrator experience, while Training measures the number of relevant training credits. The outcome variable Completed is either Yes or No, according to whether or not the administrator completed the tasks. 1. Using ggplot2 package, create a scatter plot of Experience vs. Training using color or symbol to distinguish programmers who completed the task from those who did not complete it. Which predictor(s) appear(s) potentially useful for classifying task completion? 2. Run a logistic regression model with both predictors using the entire dataset as training data. Generate a confusion matrix and answer the following: among those who completed the task, what is the percentage of programmers incorrectly classified as failing to complete the task? 3. How much experience must be accumulated by a programmer with 6 years of training before his or her estimated probability of completing the task exceeds 0.6?

svm-on-iris-dataset icon svm-on-iris-dataset

Support vector machine algorithm implemented on IRIS data-set to classify the features achieving a accuracy of 97%

visualizing-data-using-gephi icon visualizing-data-using-gephi

The objective of this exercise is to develop skills on how to visualize and analyze large networks using Gephi. This exercise focuses on the relationship between websites. Here we would be emphasizing on the association of Apple with other websites and with itself.

visualizing-data-using-qlikview icon visualizing-data-using-qlikview

The objective of this exercise is to develop skills on how to visualize the data using QlikView tool. This exercise focuses on visualizing data using multi maps aka Trellis maps. We will create these maps to track youth employment, income and expenditure trends over the years globally. Youth employment, income, and expenditure are key factors in identifying new markets for business.

visualizing-risk-of-disasters-at-oil-refineries-using-tableau icon visualizing-risk-of-disasters-at-oil-refineries-using-tableau

The objective of this project is to use data visualization to find out top 4 natural disasters impacting a corporation in the energy industry. As a part of this exercise we will also create multi maps (also known as Trellis charts or Panel charts) using Tableau. The dataset that will be used for this project contains list of natural disasters (Flood, Hurricane etc.) that occurred in the USA between the years of 2004 to 2015(data source FEMA).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.