This repository contains the code and analysis for a project focused on the causes of death in the United States. The analysis includes clustering of states based on the most prevalent causes of death, a comparison between the years 1999 and 2017, and a detailed case study for the state of California. Additionally, a heatmap visualizes the number of deaths across all states from 1999 to 2017.
-
data/: Contains the dataset used for the analysis (
causes-of-death.csv
). -
src/: The source code files.
- Cause of Death in USA.ipynb: Jupyter Notebook file containing the code for k-means clustering and silhouette score verification, comparing causes of death between 1999 and 2017, focusing on the leading causes of death in California for 1999 and 2017, a heatmap of the United States showing the number of deaths from 1999 to 2017.
The clustering analysis involves applying k-means clustering to group states based on the most prevalent causes of death. The results are validated using silhouette scores, and states are classified into high, medium, and low categories based on the number of deaths due to specific diseases.
The comparison analysis explores the changes in causes of death from 1999 to 2017, providing insights into trends and shifts over the years.
A detailed case study focuses on the state of California, comparing leading causes of death in 1999 and 2017. The analysis provides a deep dive into the specific changes in this state over the years.
A graphical representation of the leading causes of death over the years is presented, offering a comprehensive view of the trends in mortality.
The repository includes a heatmap visualizing the number of deaths across all states in the United States from 1999 to 2017.
To run the analysis, follow the steps outlined in each Jupyter Notebook. Ensure that the required dependencies are installed.
pip install -r requirements.txt
The analysis uses the causes_of_death_usa.csv
dataset, which is included in the data/
directory.
This project provides valuable insights into the causes of death in the United States, offering both a broad overview and detailed analyses for specific states and years. Feel free to explore the Jupyter Notebooks for a deeper understanding of the findings.
If you have any questions or suggestions, please open an issue or reach out!