Giter VIP home page Giter VIP logo

intromldlt2023spring's Introduction

Introduction to Machine and Deep Learning Theory

Content

  • Projects added to the projects folder
  • Second round of exam will take place on May 17th at 17:00.
  • Exam will take place online right after the 11th lecture on May 3rd. Topics to learn: first 10 lectures. The participants should send an e-mail to [email protected] in advance with e-mail topic [MM MLDL Exam] and mentioning their names, course, and group inside the e-mail body.
    • The projects should be sent via the same e-mail with e-mail topic [MM MLDL Project] and mentioning their names, course, and group inside the e-mail body + link to the code + link to the presentation with results.
  • The first lecture will take place on Wednesday, February 8th, at 16:45 (online)

In the spring semester of 2023 at the Faculty of Mechanics and Mathematics of Lomonosov Moscow State University a new special course of the student's choice, dedicated to the theory of machine learning and deep learning, is to be provided.

The course will be taught on the basis of the Department of Mathematical Theory of Intelligent Systems under the guidance of Ph.D., Senior Researcher Mazurenko I.L. The course will be taught by Ph.D. Petiushko A.A.

Classes are to be taught on Wednesdays at 16:45, online.

Number Date Lecture Video
01 08.02.2023 Empirical Risk and its Approximation. Loss Function. (Stochastic) Gradient Descent. MLE and MAP. Kullback-Leibler divergence and Cross Entropy lecture01
02 15.02.2023 FaceID: Evolution of Loss Function. Representation Learning. SoftMax-, contrastive- and angular-based losses lecture02
03 22.02.2023 Discriminate vs Generative models. Generative Adversarial Networks. Deep Convolutional GAN. Wasserstein GAN. Gradient-Penalty WGAN. Conditional GAN. lecture03
04 01.03.2023 Bayesian Inference, Bayesian Neural Network, Variational Inference, Autoencoder, Variational Autoencoder, Conditional Variational Autoencoder lecture04
05 15.03.2023 Recap of Markov Chains. Markov Chain Monte Carlo. Gibbs sampler. Metropolis-Hastings sampler. Langevin dynamics and Metropolis-Adjusted Langevin. Stochastic Gradient Langevin Dynamics lecture05
06 22.03.2023 Recap of Variational Autoencoder. Markovian Hierarchical VAE. Diffusion models: Variational Diffusion Models, Diffusion Denoising Probabilistic Models, Diffusion Denoising Implicit Models, Classifier and classifier-free guidance, 3 interpretations lecture06
07 29.03.2023 Adversarial Robustness I: Great Success of CNNs, Robustness Phenomenon, Taxonomy of Adversarial Attacks, l_p norms, Digital Domain, Fast Gradient Sign Method and its variants, Universal Attacks, Top-k Attacks, l_0 attacks lecture07
08 12.04.2023 Adversarial Robustness II: Adversarial examples in real world, Adversarial attack on Face detection and Face ID systems, Defense from adversarial examples in real world, Black-box Face restoration lecture08
09 19.04.2023 Certified Robustness I: definitions of Certified Robustness, connection to Lipschitzness, Randomized Smoothing and its variants lecture09
10 26.04.2023 Certified Robustness II: recap of Certified Robustness, Ablations on base classifier/norm of perturbation/smoothing distribution, Certification in High Dimensional case, Certification of Semantic Perturbations, Application to different Computer Vision tasks lecture10
11 03.05.2023 Neural Tangent Kernel: Lazy regime of training, GD as PDE, NTK and CNTK, NTK convergence rates lecture11
  1. Investigate Neural Collapse on different datasets (MNIST, Omniglot, LFW, ...)
  2. Make a comparison study of angular-based losses vs metric-based ones on different datasets (MNIST, Omniglot, LFW, ...)
  3. Think of evaluation metric for GAN solution (aside from Inception Score / Frechet Inception Distance) and make a coparison study of this metric for different GAN solution: vanilla GAN, WGAN, WGAN-GP
  4. Implement and analyze the BNN recognition results using different priors for weights (Uniform, Gaussian, Laplace) on different datasets (MNIST, Omniglot, LFW, ...)
    1. Do it with Variational Inference
    2. Do it with MCMC
  5. Explore the Diffusion generation quality vs number of steps on different datasets (MNIST, Omniglot, LFW, ...)
    1. Do it with unconditional generation
    2. Do it with classifier(-free) guidance
    3. Explore different strategies of $\alpha$ ($\beta$) decrease schedule
  6. Make a quantitave and qualitative analysis of different $l_0/l_1/l_2/l_{\infty}$-based Adversarial Attacks (success rate, number of iterations, etc) on different datasets (MNIST, Omniglot, LFW, ...)
    1. Do it for the Universal Adversarial Attack as well
    2. Compare the transferability for different NN architectures (LeNet, VGG, ResNet, etc)
  7. Create a real-world attack demo for any detection/recognition system
  1. Machine Learning Lecture Course on http://www.machinelearning.ru from Vorontsov K.V.
  2. Hastie, T. and Tibshirani, R. and Friedman, J. The Elements of Statistical Learning, 2nd edition, Springer, 2009.
  3. Bishop, C.M. Pattern Recognition and Machine Learning, Springer, 2006.
  4. Ian Goodfellow, Yoshua Bengio, Aaron Courville, and Yoshua Bengio. Deep learning. Vol. 1. Cambridge: MIT press, 2016.
  5. Matus Telgarsky, Deep learning theory lecture notes, 2021
  6. Sanjeev Arora et al., Theory of Deep learning book draft, 2020

Introduction to machine learning

Theoretic Courses

  • Foundations of Deep Learning: Course at UCLA
  • Deep learning theory: Course at UIUC
  • Theoretical Deep Learning: Course at Princeton

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.