Giter VIP home page Giter VIP logo

capstoneproject's Introduction

Random Forest Regression to Predict Cancer Rates Base on Data from the OECD

Introduciton

My name is Levi. This is the capstone project for my Masters in Data Anlytics.

In this project I will explore data gathered from The Organization for Economic Cooperation and Development (OECD) to build a machine learning model that can predict cancer rates based on Country, Year, and type of cancer.

Project Report can be viewed here: https://www.overleaf.com/read/bryndtbswyhk#c2f711

Steps in Process

Collect Data

Explore and Clean Data

Develop and Test Model

Visulize Model

Analyze Data

Requirements for this Project

Github to fork repo

Jupyter Lab

Python (3.12.1)

VSCode (not necessary, but it is what I used to organize and code)

Project Libraries

Pandas

Numpy

Matplotlib

Seaborn

SciKit-Learn

Project Files

Capstone Cleaning and EDA.ipynb: Jupyter notebook Cleaning and exploration of the dataset

Capstone Models.ipynb: Notebook with the models and visulizations

OECDHEALTHDATA.csv: Original Data file downloaded from the OECD website

OECD_Final.csv: CSV generated after first round of cleaning

OECD_MN.csv: CSV generated after second round of cleaning. Contains only cancer rates for "Malignant neoplasms" cancer site

OECD_NOMN.csv: CSV generated after second round of cleaning. Does not contain cancer rates for "Malignant neoplasms" cancer site.

Resources

These are the resources used for research and coding

capstoneproject's People

Contributors

levlow avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.