Giter VIP home page Giter VIP logo

msc-datascience-thesis's Introduction

Genetic Programming and Neural Networks for Financial Predictive Modeling

This is currently a private repository, it contains the whole project, data and execution codes to produce the results that will be used for the Master in Science in Data Science graduate programm. This is only made public in order to apply to the Turing-LIDA DSG 2021

Abstract

Financial time series forecasting certainly is the case of a predictive modeling process with many challenges, mainly because the temporal structure of the data. Genetic programming, as a particular variation of genetic algorithms, can be used to as a feature engineering, importance and selection process all at once, it can provide highly interpretable symbolic features that have low colinearity among them and yet high correlation with a target variable. We present the use of such method for generating symbolic features from endogenous linear and autoregressive variables, along with a Multi-Layer Perceptron, to construct a binary predictor for the price of Ethereum cryptocurrency.

Main Functionalities

  • Autoregressive Feature Generation (autoregressive_features)
  • Hadamard Product for Feature Generation (hadamard_features)
  • Genetic Programming for Symbolic Operations for Feature Generation (symbolic_features)
  • Timeseries Block Folds without filtration (t_folds)
  • Classifier model: Logistic Regression with Elastic Net Regularization (logistic_net)
  • Classifier model: L1 Support Vector Machines (l1_svm)
  • Classifier model: Artificial Neural Net Multilayer Perceptron (ann_mlp)
  • Genetic Algorithms Optimization (genetic_algo_optimization)
  • Machine Learning Models Performance Metrics (model_evaluation)
  • Plotly visualizations of results (visualizations.py)

Running on cluster

Because of the extensive explorations, this project was executed in a computing cluster at the ITESO university

image

Other progress is for the paper format in order to submit some of the findings to a journal ...

logo1 logo2
logo3 logo4

License

Private. This content is intendedn only for the application to the Turing-LIDA DSG 20212

msc-datascience-thesis's People

Contributors

iffranciscome avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.