Giter VIP home page Giter VIP logo

randomforest-and-dnn-cpt's Introduction

RandomForest-and-DNN-CPT

This repository is part of my Thesis and it is an open source project for the future.

A brief presentation perspective is necessary for the development of the problem's reasoning. In Geotechnical Engineering there is a need for predictive models to understand and anticipate the geomechanical behavior of the soil. Thus, one of the most easy to apply field trials and the most complete in this regard is the CPT trial. This repository is linked with an academic thesis development article.

The Machine and Deep Learning tools used in the process were: Deep Neural Networks (DNN) and Random Forest Regressor (RFR).

CPT (Cone Penetration Test)

The database used in this article comes from Oberhollenzer et al., 2021.

With approximately 2 million observations and 30 usable features.

Reference: Oberhollenzer, S., Premstaller, M., Marte, R., Tschuchnigg, F., Erharter, G.H., Marcher, T., 2021. Cone penetration test dataset Premstaller Geotechnik. Data in Brief 34, 106618. https://doi.org/10.1016/j.dib.2020.106618

Inputs and Outputs

Thus, the Deep Neural Network (DLNN) and Random Forest Regressor (RFR) models were estimated for the following inputs nad outputs:

Model Input Output
Model 01 Depth (m), σ',v (kPa), SBT (-), SBTn (-), Mod. SBTn (-), ISO Classes qc (MPa), fs (kPa)
Model 02 σ',v (kPa), SBT (-), SBTn (-), Mod. SBTn (-), ISO Classes qc (MPa), fs (kPa)
Model 03 SBT (-), SBTn (-), Mod. SBTn (-), ISO Classes qc (MPa), fs (kPa)
Model 04 SBT (-), SBTn (-), Mod. SBTn (-) qc (MPa), fs (kPa)
Model 05 SBT (-), SBTn (-), Mod. SBTn (-) qc (MPa), fs (kPa)
Model 06 Depth (m), σ',v (kPa), SBT (-), SBTn (-), Mod. SBTn (-) qc (MPa), fs (kPa)

Methodology

The work methodology followed the flowchart below. Thus, the database was subjected to several data processing steps. Pre-processing included the removal and regularization of variables, along with exploratory analysis of the variables of interest.

Alt Text

Still, the models were properly selected initially via Grid-Search. The optimization and validation of the model occurred through cross-validation in both methods: DLNN and RFR.

Metrics

For this purpose, primary metrics were used in all steps of optimization, validation and comparison of the models.

Metrics such as: Coeficient of determination (R2), mean ssquared error (MSE), root mean squared error (RMSE) and mean absolute error (MAE).

randomforest-and-dnn-cpt's People

Contributors

vlpacheco avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.