Giter VIP home page Giter VIP logo

apoorvar5 / deaths-in-united-states-analysis Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 995 KB

Used Jupyter along with python libraries like numpy, panda, matplotlib, etc to understand and visualize cause of deaths in the united states and formed clusters based on each diseases and their respective cause of death.

Jupyter Notebook 100.00%
clustering-algorithm data-cleaning-and-preprocessing data-visualization jupyter-notebook kmeans-clustering pandas python silhouette-score

deaths-in-united-states-analysis's Introduction

Deaths-in-United-States-Analysis

Analysis on Causes of Death in United States

Overview

This repository contains the code and analysis for a project focused on the causes of death in the United States. The analysis includes clustering of states based on the most prevalent causes of death, a comparison between the years 1999 and 2017, and a detailed case study for the state of California. Additionally, a heatmap visualizes the number of deaths across all states from 1999 to 2017.

Project Structure

  • data/: Contains the dataset used for the analysis (causes-of-death.csv).

  • src/: The source code files.

    • Cause of Death in USA.ipynb: Jupyter Notebook file containing the code for k-means clustering and silhouette score verification, comparing causes of death between 1999 and 2017, focusing on the leading causes of death in California for 1999 and 2017, a heatmap of the United States showing the number of deaths from 1999 to 2017.

Clustering

The clustering analysis involves applying k-means clustering to group states based on the most prevalent causes of death. The results are validated using silhouette scores, and states are classified into high, medium, and low categories based on the number of deaths due to specific diseases.

Yearly Comparison

The comparison analysis explores the changes in causes of death from 1999 to 2017, providing insights into trends and shifts over the years.

California Case Study

A detailed case study focuses on the state of California, comparing leading causes of death in 1999 and 2017. The analysis provides a deep dive into the specific changes in this state over the years.

Visualization

A graphical representation of the leading causes of death over the years is presented, offering a comprehensive view of the trends in mortality.

Heatmap

The repository includes a heatmap visualizing the number of deaths across all states in the United States from 1999 to 2017.

Usage

To run the analysis, follow the steps outlined in each Jupyter Notebook. Ensure that the required dependencies are installed.

pip install -r requirements.txt

Dataset

The analysis uses the causes_of_death_usa.csv dataset, which is included in the data/ directory.

Conclusion

This project provides valuable insights into the causes of death in the United States, offering both a broad overview and detailed analyses for specific states and years. Feel free to explore the Jupyter Notebooks for a deeper understanding of the findings.

If you have any questions or suggestions, please open an issue or reach out!

deaths-in-united-states-analysis's People

Contributors

apoorvar5 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.