Giter VIP home page Giter VIP logo

machine_learning's Introduction

Machine_Learning

Tasks for Machine Learning Course in ITMO University.

Index

  1. Classifiers
  2. Optimization algorithms
  3. Logistic regression method
  4. Natural Language Processing
  5. CNN and Transfer Learning

  • The first task was to train, test and evaluate four different models. The four different trained classifiers are:
  1. Small decision tree
  2. Deep decision tree
  3. Random forest on small trees
  4. Random forest on deep trees
  • Evaluating each model individualy was done by calculating some metrics (precision;recall; accuracy; F1-score; log-loss;) and by ploting precision-recall and ROC curves for each model.
  • The second task was to train a classifier that avoids Type II (False Negative) errors and calculate different metrics to evaluate it. By avoiding Type II errors, it means that the recall for that algorithm should be greater than 0.95

  • In this file two optimization algorithms were implemented:
  1. Gradient Descent (GD) algorithm
  2. Adam's optimization algorithm for a function of two variables f(x,y).
  • The two algorithms were applied on an arbitrary function and implement the search for its minimum.
  • Finally the process of finding an extremum was illustrated in the form of a graph.

Logistic regression method was implemented through three different optimization algorithms: gradient descent, stochastic gradient descent, and Adam algorithm. The three algorithms were built as modeled and trained for 5 different learning rates. Finally the performance of the three algorithms was compared


  • The data analyzed is the story of Alice in Wonderland by Lewis Carroll from Project Gutenberg's website.
  • First of all necessary preprocessing tasks are performed on the text: converting to lower case, removing stop words, numbers / non-alphabetic characters, lemmatization.
  • The first task is to find Top 10 most important words (excluding the word 'Alice') from each chapter in the, in terms of TF-IDF metric. Then trying to name each chapter according to the identified tokens?
  • The second task is to find the Top 10 most used verbs in sentences with Alice. And concluding what does alice do the most.

The aim of this file is to distinguish dogs from cats first by Convolutional Neural Network and then by transfer Learning model using pre-trained VGG16-model weights from keras application.

machine_learning's People

Contributors

nemat-al avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.