Giter VIP home page Giter VIP logo

ml-waferfault-detection's Introduction

ML-WaferFault-Detection

Problem Statement:

To build a classification methodology to predict the quality of wafer sensors based on the given training data.

Description:

Manufacturer has deployed various wafers that are used in a semiconductor industry. Currently, if a wafer is faulty the entire manufacturing plant has to be stopped and each wafer has to be checked. In order to avoid shutting down all the lanes, sensors are deployed for each wafer. Sensors send signals that will help to identify the faulty wafer. A faulty wafer is labelled as -1 and intact one as +1. A machine learning algorithm needs to deployed to find the faulty wafer based on sensor signals.

Architecture

courtesy: iNeuron Intelligence img

Steps:

  • Data Ingestion:
    • data for batch training

    • data validation

      • create a schema json file for training and prediction
        • sample file name "wafer_ddmmyyyy_hhmmss.csv"
        • length of date stamp in file name = 8
        • length of time stamp in file name = 6
        • number of columns = 592
        • column names and data types according to schema
      • regex creation with file name syntax and compare against schema
      • compare the length of date stamp and time stamp against schema
      • compare the columns of file against the schema
      • check if file contains all the columnns, else move to the good/bad folders
      • check if any column contains only null values and move to bad files folder
    • data transformation

      • check for missing values in all columns, replace NaN with 'NULL'
    • data insertion in db

  • data processing
    • export data from db to csv for training
    • data preprocessing
    • data clustering
  • Model Training
    • best model for each cluster
    • hyperparameter tuning
    • model saving
  • Deployment
    • Cloud setup
    • Pushing app to cloud
    • Application start
  • Prediction
    • data ingestion
    • data processing
    • model call for specific cluster
    • prediction
    • export prediction to csv

Workflow

  1. Update config.yaml
  2. Update secrets.yaml [Optional]
  3. Update params.yaml
  4. Update the entity
  5. Update the configuration Manager in src config
  6. Update the components
  7. Update the pipeline
  8. Test run pipeline stage
  9. run tox for testing your package
  10. run "dvc repro" for running all the stages in pipeline

ml-waferfault-detection's People

Contributors

nds-najam avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.