Keep static for the time being - shifting to a survey paper on RL

Quote
Problems
Key Ideas
Books
RL
DL, Online and Bandits
Classic Portfolio Selection Materials
Machine Learning based Portfolio Selection
Canonical Correlation Analysis
Sample Efficient

Deliverables for transfer on PhD

Quote

One should avoid solving more difficult intermediate problems when solving a target problem. Vladimir Vapnik, Statistical Learning Theory, 1998

Problems

Related to stochastic optimal control - Can do model free with simulations RL and Deep RL

Merton Problem (Portfolio and consumption)
Optimal Execution - Liquidation Problem
Optimal Execution - Limit Order Placement
Optimal Stopping and Control
Optimal Execution for statistical arbitrage
Optimal execution targetting volume
Market Making problems
Pairs trading - optimal entry/ exit
Multi-period parametric policies (Brandt in mult-period)
Optimal hedging of derivatives with path dependency (JPM - explore the model and more, when is it better than monte carlo and greeks).

Simulation

Stochastic optimal control assumes a model, simulations assume some knowledge of the world (say monte carlo), alternative and more robust simulation methods ? (for example GAN's for time series).

Key Ideas

We are looking to allocate to assets or strategies in a manner that is better than the current state of the art and to get RL working in real world finance. Reinforcement learning is a method for solving MDP's in a model free fashion. There are many MDP problems in finance and a whole mathematical methodology such as stochastic optimal control. Applications are myriad and range from investment/ consumption decisions, derivative hedging, algorithmic trading and inventory management. Solutions may have particular value when there is path dependency on an agent's decisions into the future.

In the derivative hedging method of finance, problems are usually solved in a step-wise fashion...often by calculating or adjusting the greeks, or in more awkward cases by monte carlo methods. Recent paper's hint at the ability to directly learn a hedging strategy in a greek free fashion from a simulation of the environment. In other words rather than a 2 step process - model the environment, solve the model, we can go straight from simulation to hedging, including where there are difficult real life problems such as transaction costs and path dependency and indeed complex risk adjusted functions of our final distribution of returns that we wish to maximise.

Allocation decisions within Finance lie within a most difficult environment. It is partially observed, noisy, and non-stationary, there may be outliers and regimes. Time also plays a key role and decisions again may have long term consequences. In contradistinction to standard methods we are not looking to apply single period prediction and then combine these predictions using an optimiser. This is akin to supervised learning, but in the real world our actions may have long term effects and indeed actions taken by our agents may be reacted against by the environment.

Most standard methods are single period and represent a two stage process, this involves two sets of parameters and forecast error is not utility, so we may even be optimising the wrong target. Other works give up upon some of our ability to predict and are thus more heuristic but more practical methods for allocation decisions, albeit pessimistic.

An information bottleneck is created between the supervised forecast error minimisation and the subsequent forecasts which are then used by an optimiser (some argue that this also serves as an error maximiser and indeed has its own parameters to be found). Given the noise inherent within finance and the fact that predictions are either very weak or indeed only exist for small windows of time then this makes the two stage process even more problematic.

Research has been created to address the two stage parameter estimation - Brandt and this enable a more aligned target. However most current academic work applying RL to allocation decisions is either on a very small scale or ignores basic practical realities of markets (such as transaction costs). Moody et al. appear to have been the earliest to understand these issues and attempt to have one set of parameters, a single utility, include transaction costs and directly map from inputs to actions (rather than predict then optimise).

The Moody work was nearly 20 years ago and indeed he left academic in 2003 to set up a successful hedge fund (which continues to be successful).

My goal is to advance this work using the latest in deep reinforcement learning (and potentially deep learning). The goal is to examine the state of the art, and advance it - particularly with a view to practicality, it is the author’s view that the current gap between the state of the art in RL in academia but applied within finance remains impractical. And in both parts of the research I am examining multi-period, path dependent decision making in difficult environments in a direct rather than 2 stage indirect fashion.

It should also be noted that explainability and sensitivity analysis is important in finance, black boxes are not widely trusted and indeed legally there may be cases where explainability is forced. I propose to also examine RL methods within this domain where sensitivity analysis and explainability is enabled.

A further question is if we are seeking to move directly from inputs to actions which in this case will be allocation weights with an objective of maximising say some long run utility, then there are practical questions as regards transaction costs, sparsity and indeed including practical constraints such as a draw down constraint. Also there are questions as regards throwing noisy time series into an RL agent and the best way to do this, for example if we go ‘deep’ do autoencoders have a part to play and should we be seeking to induce sparsity in our agent’s allocations?

Note that allocation problems, may in some cases be reduced to single state bandit problems and note that sometimes a poor model of the environment may be known and the agent may possibly be able to bootstrap from here. Allocations may be to experts, assets, or indeed strategies.

Books

(book) Portfolio Selection, Markowitz (1952).
(book) Continuous Time Finance, Merton (1992).
(book) Learning Bayesian Networks, Neopolitan (2003).
(book) Advances in Financial Machine Learning, De Marcos (2018).
(book) Active Portfolio Management, Grinold and Kahn (1999).
(book) Efficient Asset Management, Michaud (2008).
(book) Probabilistic Graphical Models: A New Way of Thinking in Financial Modelling, Denev (2015).
(book) Portfolio Management under Stress, Rebonnato (2014).
(book) Risk Assessment and Decision Analysis with Bayesian Networks, Fenton (2018).
(book) Portfolio Construction and Risk Budgeting, Scherer (2015).
(book) Machine Learning for Algorithmic Trading, Jansen (2018).
(book) Financial Signal Processing and Machine Learning, Akansu et al. (2016).
(book) Asset Management A Systematic Approach to Factor Investing, Ang (2014).
(book) Introduction to Risk Parity and Budgeting, Roncalli (2013).
(book) The Misbehaviour of Markets a Fractal View of Risk, Ruin and Reward, Mandelbrot (2008).
(book) Fooled by Randomness, Taleb (2007).
(book) Red-Blooded Risk: The Secret History of Wall Street, Brown (2011).

Deep Hedging

RL

Reinforcement learning in financial markets - a survey, Fischer (2018).
Optimal Asset Allocation Using Adaptive Dynamic Programming, Neuneier (1996).
Reinforcement Learning for Trading Systems and Portfolios, Moody et al (1998).
PERFORMANCE FUNCTIONS AND REINFORCEMENT LEARNING FOR TRADING SYSTEMS AND PORTFOLIOS, Moody et al (1998).
Reinforcement Learning for Trading, Moody et al. (1999).
Learning to trade via direct reinforcement, Moody (2001).
Moody presentation, Moody (2003).
Stochastic Direct Reinforcement: Application to Simple Games with Recurrence, Moody et al (2003).
Reinforcement Learning for Stochastic Control Problems in Finance, Rao (2018).
Deep Hedging, BUEHLER et al. (2018).
Deep Hedging: Hedging Derivatives Under Generic Market Frictions Using Reinforcement Learning, Buehler (2019).
Model-Free Option Pricing with Reinforcement Learning, Halperin, (2018).
QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds, Halperin (2019).
Agent Inspired Trading Using Recurrent Reinforcement Learning and LSTM Neural Networks, Lu (2017).
A reinforcement learning approach for pricing derivatives, Grassl, (2008).
Regime-switching recurrent reinforcement learning for investment decision making, Maringer (2009).
Automating Transition Functions: A Way To Improve Trading Profits with Recurrent Reinforcement Learning, Zhang (2013).
Algorithm trading using q-learning and recurrent reinforcement learning, Du et al. (2009).
RL for Portfolio Management - Thesis, Filos (2018).
A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem, Jiang (2017).
An Automated fx trading system using adaptive reinforcement learning, Dempster (2004).
Model-based Deep Reinforcement Learning for Dynamic Portfolio Optimization, Yu et al (2019).
Cryptocurrency Portfolio Management with Deep Reinforcement Learning, Jiang et al (2017).
An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown, Almahdi et al (2017).
Portfolio Choices with Orthogonal Bandit Learning, Shen et al. (2015)
Risk-Aware Multi-Armed Bandit Problem with Application to Portfolio Selection, Huo (2017)
Application of stochastic recurrent reinforcement learning to index trading , Gorse (2011).
Testing different Reinforcement Learning configurations for financial trading: Introduction and applications, Bertolluzo et al, (2012).
An Investigation into the Use of Reinforcement Learning Techniques within the Algorithmic Trading Domain, Cumming (2015).
Financial Trading as a Game: A Deep Reinforcement Learning Approach, Huang (2018).
Deep Reinforcement Learning for Foreign Exchange Trading, Wang (2019).
Capturing Financial markets to apply Deep Reinforcement Learning, Chakraborty (2019).
Reinforcement Learning For Automated Trading, Necchi (2016).
Towards Inverse Reinforcement Learning for Limit Order Book Dynamics, Roa-Vicens et al (2019).
Multi-Period Trading via Convex Optimization. Boyd et al. (2017).
Optimal Execution of Portfolio Transactions, Chris (1999).
Deep Learning Volatility, Horvath (2019).
Pricing options and computing implied volatilities using neural networks, Liu (2019).
Machine Learning and Option Implied Information, Zheng (2018).
Hedged Monte-Carlo: low variance derivative pricing with objective probabilities, Potter et al. (2000).
Dynamic Replication and Hedging: A Reinforcement Learning Approach, Ritter (2019).
Deep learning approach to hedging Thesis, Kozyra (2018).
Deep Hedging - Presentation, Beuler (2018).
Pricing and Hedging Exotic Options with Monte Carlo Simulations, Perilla Msc (2003).
The Four Horsemen of Machine Learning in Finance, Halperin (2019).
Machine Learning for Trading, Ritter (2017).
Reinforcement Learning With Continuous States, Ritter (2018).
The Usefulness of Reinforcement Learning in Finance - Pres, Ritter (2018).
Generative Bayesian neural network model for risk-neutral pricing of American index options, Jang et al. (2018).
Reinforcement Learning Applied to Option Pricing - MSc, Martin (2014).
Quant GANs:Deep Generation of Financial Time Series, Wiese et al. (2019).
Enhancing Time Series Momentum Strategies Using Deep Neural Networks, Lim et al. (2019).
Deep Learning Approximation for Stochastic Control Problems Han et al. (2016).
Deep Reinforcement Learning for Trading, Zhang (2019).
Model-based Reinforcement Learning for Predictions and Control for Limit Order Books, Wei et al. (2019).
Topics in Dynamic Portfolio Choice Problems -PhD Thesis, Wimonkittiwat, (2013)
Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States, Ye et al. (2020).

Machine-Learning-Asset-Allocation

DL Online and Bandits

Reinforcement Learning for Stochastic Control Problems in Finance, Rao (2018).
Deep Learning in Finance, Heaton et al. (2015).
Deep Portfolio Theory, Heaton et al. (2016).
Deep Learning in Asset Pricing, Feng et al. (2018).
Adaptive Portfolio Asset Allocation Optimization with Deep Learning Obeidat et al. (2018).
Portfolio Optimisation in an Uncertain world, de Jong (2017).
Empirical Asset Pricing via Machine Learning, Kelly et al (2018).
Real-valued (Medical) Time Series Generation with Recurrent Conditional GANs, Estaban (2017).
Portfolio Choices with Orthogonal Bandit Learning, Shen et al. (2015)
Risk-Aware Multi-Armed Bandit Problem with Application to Portfolio Selection, Huo (2017)
Structured Online Learning with Full and Bandit Information, Johnson PhD Thesis (2016).
Censored exploration and the dark pool problem, Ganchev et al. (2010).
Adaptive Execution:Exploration and Learning of Price Impact, Park et al (2012).
Online Portfolio Selection: A Survey, Li et al (2013).
Conditional time series forecasting with convolutional neural networks, Borovykh et al (2018).
PAGAN: Portfolio Analysis with Generative Adversarial Networks, Mariani et al (2019).
A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress, Arora (2019).
Algorithms for inverse reinforcement learning, Ng at al. (2000).
An Empirical Comparison of Machine Learning Models for Time Series Forecasting, Ahmed et al. (2010).
A Signal Processing Perspective on Financial Engineering , Feng et al. (2017).
Fourier Policy Gradients, Fellows et al. (2018).
Expected Policy Gradients, Kiosek et al. (2018).

Classic Portfolio Selection Materials

Portfolio Selection, Markowitz (1952).
Capital Asset Prices: A Theory of Market Equilibrium under Conditions of Risk, Sharpe (1964).
The Arbitrage Theory and Capital Asset Pricing, Ross (1976).
Continuous Time Finance, Merton (1992).
Hot Spots and Hedges, Litterman (1996).
Global Portfolio Optimisation Black, Litterman (1992).
On the Inverse of the Covariance Matrix in Portfolio Analysis, Stevens (1998).
Improved Estimation of the Covariance Matrix of Stock Returns With an Application to Portfolio Selection, Le Doit et al. (2001).
Portfolio Constraints and the Fundamental Law of Active Management, Clarke et al. (2002).
Optimal Versus Naive Diversification:How Inefficient is the 1/N Portfolio Strategy, DeMiguel et al. (2009).
Enhancing mean-variance portfolio selection by modeling distributional asymmetries Kwong Yee Low et al. (2015).
Risk management under Omega measure, Metel et al. (2015).
From Portfolio Optimisation to Risk Parity Roncalli (2012).
Risk Parity Portfolios with Risk Factors, Roncalli (2014).
Introducing Expected Returns into Risk Parity, Roncalli (2014).
Why Indexing works, Heaton (2015).
Noise Dressing of Financial Correlation Matrices, Laloux (1998).
Random Matrix Theory and Cross-correlations in Global Financial Indices and Local Stock Market Indices, Nobi (2013).
Analysis of cross-correlations between financial markets after the 2008 crisis, Sensoy (2013).
Financial Applications of Random Matrix Theory: a short review, Bouchard et al. (2009).
Do High-Frequency Data Improve High-DimensionalPortfolio Allocations, Hautsche (2013).
Shrinking the Cross Section, Kozak (2018)
Random Matrix Theory, Edelman (2005).
Characteristics are Covariances, Kelly et al (2018).
Arbitrage Portfolios, Kim (2019).
Estimating Latent Asset Pricing Factors, Lettau (2018).
Dissecting Characteristics Nonparametrically, Freyberger (2019).
Dynamics of competition between collectivity and noise in the stock market, Drozdz (2018).
Markowitz Revisited: Single-Period and Multi-Period Mean-Variance Models, Steinback (1999).
Sparse and Stable Markowitz Portfolios, De Moi et al.
Portfolio Choice Problems, Brandt
Variable Selection for Portfolio Choice, Ait-Sahalia and Brandt (2001).
Parametric Portfolio Policies:Exploiting Characteristics in the Cross-Section of Equity Returns, Brandt (2004).
Dynamic Portfolio Selection by Augmenting the Asset Space, Brandt and Santa Clara (2004).
Testing Portfolio Efficiency with Conditioning Information, Ferson and Siegel (2007).

Machine Learning based Portfolio Selection

Sparse and Stable Markowitz Portfolios, Brodie et al. (2009).
A generalised approach to Portfolio Optimisation, DeMiguel (2009).
Optimal Portfolio Selection using regularisation, Carrasco et al. (2012).
Lp Regularised Portfolio Optimisation, Caccioli et al. (2014).
Bayesian Regularisation from Tikhonov to Horsehoe, Polson (2019).
Construction, management, and performance of sparse Markowitz portfolios, Henriques et al. (2014).
Portfolio Selection using Tikhonov Filtering to estimate the Covariance Matrix, Park (2010).
Sparse recovery under Matrix Uncertainty, Rosenbaum (2008).
Financial Applications of Gaussian Processes and Bayesian Optimization, Gonzales et al. (2019).
Applications of Gaussian Processes in Finance, Anonymous under review (2019).
Portfolio Optimisation Models, Valle PhD Thesis (2013).
Optimal Financial Portfolio Selection, Derpanopoulos, (2018).
Markowitz Minimum Variance Portfolio Optimization using New Machine Learning Methods, Awoye PhD Thesis (2016).
Sparse Portfolio Selection via the sorted L1 norm, Kremer et al. (2017).
Machine Learning and Portfolio Optimization, Ban et al (2016).
Reducing Estimation Risk in Mean-Variance Portfolios with Machine Learning, Kinn (2018).
Building Diversified Portfolios that outperform out of sample, DeLopez (2016).
A Constrained Hierarchical Risk Parity Algorithm with Cluster-based Capital Allocation, Pfitzinger et al. (2017).
Hierarchical Clustering based Asset Allocation, Raffinot (2017).
Portfolio Selection via Subset Resampling, Shen (2017).
Kernel Principal Component Analysis and Random Matrix Theory in Portfolio Selection, Peng et al (2017).
Asset Allocation in Finance: A Bayesian Perspective, Jacquier et al (2011).
Estimation of Covariance Matrices for Portfolio Optimization using Gaussian Processes, Nirwan et al. (2018).
Bayesian mean-variance analysis: Optimal portfolio selection under parameter uncertainty, Bauder (2018).
Asset Allocation, Economic Cycles and Machine Learning,Raffinot PhD Thesis (2018).

Canonical Correlation Analysis

Comparison of Penalty Functions for Sparse Canonical Correlation Analysis, Chalise et al. (2012).
Comparison of Canonical Variate Analysis and Principal Component Analysis on 422 descriptive sensory studies, Peltier et al. (2014).
Enhancing canonical variate analysis by taking the scaling effect into account, Peltier et al. (2017).
Comparison of variants of canonical correlation analysis and partial least squares for combined analysis of MRI and genetic data, Grellmann et al. (2014).
Multiway canonical correlation analysis of brain data, Chevaigne et al. (2018).
Kernel Canonical Variate Analysis for Nonlinear Dynamic Process Monitoring, Samuel et al. (2015).
A Least Squares Formulation for Canonical Correlation Analysis, Ji et al.
Inference for Robust Canonical Variate Analysis, Aest et al.
Correlation Analysis 1, Tibshirani, (2013).
Correlation Analysis 2, Tibshirani, (2013).
Optimisation of Trading Strategies using Parameterised Decision Rules, Towers et al. (1998).
On the Equivalence Between Canonical Correlation Analysis and Orthonormalized Partial Least Squares, Sun et al.
A Least Squares Formulation for Canonical Correlation Analysis, Sun et al.
A Probabilistic Interpretation of Canonical Correlation Analysis, Bach et al. (2005).
Moments and Absolute Moments of the Normal Distribution, Winkelbauer, (2014).
Large Scale Canonical Correlation Analysis with Iterative Least Squares, Lu, (2014).
Sparse canonical correlation analysis from a predictive point of view, Wilms et al. (2015).
Variational Analysis of the Ky Fan k-norm, Ding, (2015).
Canonical correlation analysis of high-dimensional data with very small sample support, Song, (2016).
FDR-Corrected Sparse Canonical Correlation Analysis with Applications to Imaging Genomics, Gossman, (2018).
Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis, Tang et al. (2018).
A Tutorial on Canonical Correlation Methods, Uurtio et al. (2018).
Change Point Analysis of Correlation in Non-stationary Time Series, Dette, (2018).
Pricing without martingale measure, Batiste, (2018).
Canonical Analysis, A review with applications in Ecology, Gittins ,(1986).
The Estimation of Prediction Error: Covariance Penalties and Cross-Validation, Efron.
On the Moreau-Yosida regularization of the vector k-norm related functions, Wu et al. (2011).
Gen-Oja: A Simple and Efficient Algorithm for Streaming Generalized Eigenvector Computation, Bhatia et al. (2018).
Robust sparse canonical correlation analysis, Wilms et al. (2016).
The Geometry of Canonical Variant Analysis, Campbell et al. (1981).
Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Autoregressive Models, Johannsen, (1991).
Sufficient Canonical Correlation Analysis, Guo et al. (2016).
Some properties of canonical correlations and variates in infinite dimensions, Cupidon et al. (2007).
High Dimensional Asymptotic expansions for the distributions of canonical correlations, Fujikoshi el al. (2008).
A Canonical Correlation Analysis of Intelligence and Executive Functioning, Davis et al. (2011).
A Closed Testing Procedure for Canonical Correlations, Calinski et al. (2006).
A Comparison of Some Tests for Determining the Number of Nonzero Canonical Correlations, Calinski et al. (2006).
A New Perspective On Sequential Testing Procedures In Canonical Analysis: A Monte Carlo Evaluation, Mendoza et al. (1978).
A modification of canonical variates analysis to handle highly collinear multivariate data, Norgaard et al. (2006).
Consistency of log-likelihood-based information criteria for selecting variables in high-dimensional canonical correlation analysis under nonnormality, Fukui, (2015).
An analysis of the total least squares problem, Golub, (1980).
Bayesian Canonical Correlation Analysis, Klami et al. (2013).
Canonical Variate Analysis and Related Methods with Longitudinal Data, Beaghan PhD thesis (1997).
Dynamic Portfolio Selection by Augmenting the Asset Space, Brandt, (2006).
Canonical Correlation Clarified by Singular Value Decomposition, Press, (2011).
Chapter 8, Multivariate Data Analysis, Anderson et al. (1998).
Multivariate Data Analysis, Anderson et al. (1998).
NCSS Stat software explanation
Canonical Correlation Analysis, Weenink, (2003).
CCA for Optimal Signal Combination in Algorithmic Trading, Firoozye, (2019).
Canonical Correlation: A Tutorial, Borga, (2001).
Convex Estimation of Cointegrated VAR Models by a Nuclear Norm Penalty, Signoretto et al. (2012).
A modified information Criterion for Cointegration Tests based on a VAR approximation, Qu et al. (2007).
Convergence analysis of kernel Canonical Correlation Analysis: Theory and practice, Hardoon et al. (2009).
Canonical Variates Analysis - ch 5 book extract
Extreme Canonical Correlations and high dimensional cointegration analysis, Onatski et al. (2018).
DynOpt: Incorporating Dynamics into Mean-Variance Portfolio Optimization, Signoretto.
The efficient use of conditioning information in Portfolios, Ferson et al. (2001).
Correlations and canonical forms of bivariate distributions, Lancaster, (1962).
Convex optimization methods for dimension reduction and coefficient estimation in multivariate linear regression, Lu et al. (2008).
Statistical Consistency of Kernel Canonical Correlation Analysis, Fukumizu et al. (2007).
Generalized canonical analysis based on optimizing matrix correlations and a relation with IDIOSCAL, Kiers et al. (1991).
Determination of vector error correction models in high dimensions, Liang et al. (2019).
Partial Sparse Canonical Correlation Analysis (PSCCA) for Population Studies in Medical Imagine, Dhillon et al.
The Convex Algebraic Geometry of rank minimization, Parrilo, (2009).
Fitting method based on correlation maximization: Applications in space physics, Livadiotis et al. (2012).
Kernel Canonical Correlation Analysis, Welling.
Controlling Singular Values with Semidefinite Programming, Kovalsky et al. (2014).
Lecture 7: Analysis of Factors and Canonical Correlations, Thulin.
Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization, Recht et al. (2008).
L1-Regularized Multiway Canonical Correlation Analysis for SSVEP-based BCI, Zhang et al. (2013).
Canonical Correlation Analysis - ETFs, Mathematica journal.
Stochastic PCA with L1 and L2 Regularization, Mianji (2018).
Stochastic PCA with L1 and L2 Regularization - supplement, Mianji (2018).
Iterative Reweighted Algorithms for Matrix Rank Minimization, Mohan et al.
A Rank Minimization Heuristic with Application to Minimum Order System Approximation, Fazel et al.
Ridge-Penalty Regularization for Kernel-CCA, Rieter et al.
Canonical Correlation Forests, Rainforth, (2015).
Constrained Singular Value Decomposition, Zollman et al. (2009).
Large-Scale Convex Minimization with a Low-Rank Constraint, Shalev-Schwartz et al. (2011).
Sparse Canonical Correlation Analysis: New Formulation and Algorithm, Chu et al.
Sparse CCA via Precision Adjusted Iterative Thresholding, Choi et al. (2017).
Generalised Canonical Regression, Estrella, (2007).
Statistics Methods and Applications - Book, Lewicki.
Structured Sparse Canonical Correlation Analysis, Chen et al. (2012).
A singular value thresholding algorithm for matrix completion, Cai et al.
The Geometry Of Kernel Canonical Correlation Analysis,Kuss et al. (2003).
Variants of Canonical Correlation Analysis, Choi (2016).
Variable selection and interpretation in canonical correlation analysis, Al-Khandari (1997).
On the Variance, Reliability and Importance of Canonical Variates, Momirovic.
The asymptotic distribution of canonical correlations and variates in cointegrated models, Anderson (2000).
Canonical correlation analysis based on sparse penalty and through rank-1 matrix approximation, El Bey et al. (2015).
Canonical Correlation Analysis - Slides, Stieiger.
Comparison of Penalty Functions for Sparse Canonical Correlation Analysis, Chalise et al. (2012).
Use of Smoothly Clipped Absolute Deviation (SCAD) Penalty on Sparse Canonical Correlation Analysis, Very short paper, some R.
Developing Long/Short ETF Strategies - CCA, Kinlay.
Identifying Small Mean Reverting Portfolios, Aspremont (2008).
Sparse Canonical Correlation Analysis, Hardoon (2010).
A kernel method for canonical correlation analysis, Akaho, (2007).
Avoiding Backtesting Overfitting by Covariance-Penalties: An Empirical Investigation of the Ordinary and Total Least Squares cases,Koshiyama et al. (2019).
Deep Canonical Correlation Analysis, Andrew et al. (2013).
Deep Generalized Canonical Correlation Analysis, Benton et al. (2017).
Multiview Canonical Correlation Analysis, Rupnik (2016).
Multiview Canonical Correlation Analysis - Phd Thesis, Rupnik (2016).
Generative Adversarial Networks for FinancialTrading Strategies Fine-Tuning and Combination, Koshiyama (2019).
Temporal Kernel CCA and its Application in MultimodalNeuronal Data Analysis, Bierman et al. (2009)

Sample Efficient

Compressed Conditional Mean Embeddings for Model-Based Reinforcement Learning, Lever et al. (2016).
ACCME: Actively Compressed Conditional Mean Embeddings for Model-Based Reinforcement Learning, Stafford et al. (2018).
Kernel Mean Embedding of Distributions: A Review and Beyond, Muandet et al. (2016).
Learning via Hilbert Space Embedding of Distributions - PhD Thesis, Le Song (2008).
Kernel Embeddings of Conditional Distributions, Le Song et al, (2013).
Modelling Policies in MDPs in Reproducing Kernel Hilbert Space, Lever et al (2015).
Kernel-Based Reinforcement Learning, Ormoneit et al. (2002).
Practical Kernel-Based Reinforcement Learning, Barreto et al. (2014).
Reinforcement Learning using Kernel-Based Stochastic Factorization, Barreto et al. (2011).
Conditional mean embeddings as regressors, Grunewalder et al. (2012).
Modelling transition dynamics in mdps with rkhs embeddings, Grunewalder et al. (2012).
Kernel-Based Reinforcement Learning on Representative States, Kveton et al. (2012).
Least-Squares Policy Iteration, Lagoudakis et al. (2003).
An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning, Parr et al. (2008).
Pseudo-MDPs and Factored Linear Action Models, Yao et al. (2008).
Approximate policy iteration: A survey and some new methods, Bertsekas (2012).
Continuous Deep Q-Learning with Model-based Acceleration,Gu et al. (2016).
Learning of Non-Parametric Control Policies with High-Dimensional State Features, Van Hoof et al. (2015).
Learning Transition Dynamics in MDPs with Online Regression and Greedy Feature Selection, Lever et al. (2015).

sijia1120 / deep-hedging Goto Github PK

deep-hedging's Introduction

Keep static for the time being - shifting to a survey paper on RL

Table of contents

Deliverables for transfer on PhD

Quote

Problems

Key Ideas

Books

Deep Hedging

RL

Machine-Learning-Asset-Allocation

DL Online and Bandits

Classic Portfolio Selection Materials

Machine Learning based Portfolio Selection

Canonical Correlation Analysis

Sample Efficient

deep-hedging's People

Contributors

Stargazers

Recommend Projects

Recommend Topics

Recommend Org