parsakhavarinejad / heart-disease-classification-clustering Goto Github PK

View Code? Open in Web Editor NEW

A machine learning project for the classification and clustering of heart disease using the heart.csv dataset. Explore various algorithms to predict heart disease presence and group individuals based on similar characteristics.

Jupyter Notebook 100.00%

heart-disease-classification-clustering's Introduction

Heart Disease Classification and Clustering

This project focuses on the application of various machine learning algorithms to analyze a dataset called heart.csv for classification and clustering tasks. The dataset contains the following columns:

age: Age of the individual
sex: Gender (0 for female, 1 for male)
cp: Chest pain type
trestbps: Resting blood pressure
chol: Serum cholesterol level
fbs: Fasting blood sugar > 120 mg/dl (1 for true, 0 for false)
restecg: Resting electrocardiographic results
thalach: Maximum heart rate achieved
exang: Exercise-induced angina (1 for yes, 0 for no)
oldpeak: ST depression induced by exercise relative to rest
slope: Slope of the peak exercise ST segment
ca: Number of major vessels colored by fluoroscopy
thal: Thalassemia type
target: Target variable (1 for presence of heart disease, 0 for absence)

Machine Learning Tasks

Classification

In the classification task, various machine learning algorithms will be applied to predict the presence or absence of heart disease based on the given features. Some of the algorithms that can be explored include:

Logistic Regression
Decision Trees
Random Forest
Support Vector Machines
K-Nearest Neighbors (KNN)
GaussianNB
DecisionTreeClassifier
RandomForestClassifier
AdaBoostClassifier
BaggingClassifier

Clustering

In the clustering task, we aim to group individuals based on similar characteristics. Some clustering algorithms to consider are:

K-Means Clustering
Hierarchical Clustering

Getting Started

Clone this repository.
Install the required libraries and dependencies.
Run the Jupyter notebooks or Python scripts to perform classification and clustering.
Analyze the results and make improvements as needed.

Data Source

The heart.csv dataset is the source of our data and can be found in the project folder.

Author

Parsa Khavarinejad This is a project for the data mining course - Tarbiat Modares University

Feel free to add any additional sections or details as needed for your project. Happy coding!

Recommend Projects

parsakhavarinejad / heart-disease-classification-clustering Goto Github PK

heart-disease-classification-clustering's Introduction

Heart Disease Classification and Clustering

Machine Learning Tasks

Classification

Clustering

Getting Started

Data Source

Author

heart-disease-classification-clustering's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent