Giter VIP home page Giter VIP logo

cement-strength-prediction-with-pyspark's Introduction

Cement Strength Prediction with PySpark's MLlib

Language Badge PySpark Badge Library Badge Library Badge License Badge

This project involves building a regression model using PySpark's MLlib to predict the compressive strength of cement based on its ingredients. The dataset consists of 1030 instances of concrete samples, containing 9 attributes (8 continuous and 1 discrete), and 1 continuous quantitative output variable.

Our goal is to identify the features that are the strongest predictors of cement strength and provide recommendations for optimal values of cement ingredients. We will also create an application where users can input their own values for age and get a predicted value for cement strength.

Prerequisites

Before running the code, you will need to have the following installed:

  • PySpark: the Python API for Apache Spark
  • Jupyter Notebook: an interactive development environment for Python

Dataset

The dataset used in this project is the Concrete Compressive Strength Data Set from the UCI Machine Learning Repository. It contains 1030 instances of concrete samples, with 9 attributes (8 continuous and 1 discrete) and 1 continuous quantitative output variable (compressive strength).

Column Name Data Type Measurement Unit Description
cement double kg/m3 Input variable
slag double kg/m3 Input variable
flyash double kg/m3 Input variable
water double kg/m3 Input variable
superplasticizer double kg/m3 Input variable
coarseaggregate double kg/m3 Input variable
fineaggregate double kg/m3 Input variable
age integer days Input variable
csMPa (compressive strength) double MPa Output variable

Usage

To run the code, open the Cement_Strength_Prediction.ipynb file in Jupyter Notebook and execute the cells in order. The notebook contains detailed explanations of each step in the code and the results obtained.

To use the application for predicting cement strength based on user input, run the predict_cement_strength function in the notebook after it has been defined. This function takes a single argument age (in days) and returns the predicted cement strength (in MPa).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.