jankrepl / deepdow Goto Github PK

Portfolio optimization with deep learning.

Home Page: https://deepdow.readthedocs.io

License: Apache License 2.0

Python 100.00%

deep-learning portfolio-optimization finance machine-learning pytorch timeseries markowitz convex-optimization stock-price-prediction wealth-management

deepdow's People

Contributors

Stargazers

Watchers

Forkers

i-cjw kiminh 01-71u siavashyj wiretail jyjatbupt jingmouren stochastic-thread lzlgboy zhaoy99 samuelfink nickolaus-sundholm myso42 laokpa deeplearning2012 victor8733 fengwucb trendingtechnology kyawkyawkhaing wzy330 michaellinp gitter-badger ashikpaul thenumblife a3digit floraf92 satheeshcdo bijanx deep-learning-trader romakoks yunguan-wang peterpiper987 branchwhisperer moudhamam huning2009 rodrigosnader dppalomar mirca murali037 zachmeador azipsauhabah myvrml ancast-iason bxre7at7sf-forks sankhe-llc sim-on alexluoqk matz9x avacaondata javiermoralh ntoand amesterdam1 carvalhoamc harishangaran blueskysir rorcde hishamzain yutiansut webclinic017 randomwalker42 danielrzo shism2 dli-invest vishalbelsare urheen loochao verystrongjoe ii-research-yu messorem7 xemage puneeth714 dorogov jaideepmurkute yieldslabs jujuque reesing overfittingstudyroom lzy-v qiyuansz05 guillermop98 chiwenheng dorianhung michael2015tse ibestrace pyquantsharp xuzhipenganhui olivier2311 tashik jonychoi unknowncoder135 shivanirathod126 xfx88 arfmatos shawn-nau fsfajardoe vex1023 alaatekleh levonbulwyer adriencss dst1213

deepdow's Issues

Extra linefeed in Epoch reporting

With each epoch there is an extra linefeed inserted when reporting metrics in Jupyter. E.g. notice in the below screenshot the gap between each line. It grows again in epoch 3 and so on.

Pass device to DowNet constructor

And make sure or operations inside propagate it. Problems with torch.eye inside of run

Weight normalization allocator

Would be cool to create an allocator that just learns a single weight per each asset. To make sure all the weights sum up to one can only consider positive weights and then divide them by the sum.

MLflow bumpup metric

For determinstic benchmarks metrics can be just copied from previous step rather than recomputed

Investigate usage of fold and unfold

Generating synthetic data

Apart from generating iid sequences one can do a lot of different things. Just need to pay attention to using too many external dependencies.

Statistical models

AR
ARMA
GARCH
VAR

Signal processing

State space models (both discrete and continuous latent space)

Multiple channels initial input

Example: Zipline with Quandl bundle

Unfortunately installing Zipline is a nightmare, only supports Python 3.5.

So maybe investigate open-source backtesters

something about the turnover constraint

These days I have been using your deepdow package to do some experiments about portfolio optimization. Thanks for your great work!
But I have a problem here. I want to add the turnover rate constraint into the optimization. To achieve that, every round the network run over the training optimization, I have to keep the weights that has been calculated so that in the next round I can assure the new weight calculated won’t be too far from the previous one.
So I want to ask you that if there is some way I can save the weight each time the network calculated during the training process?

raw_to_Xy doesn't handle gaps in data

raw_to_Xy appears to handle regular gaps in data (e.g. weekend days) but cannot handle irregular gaps such as holidays.

When fed trading data similar to the example at https://deepdow.readthedocs.io/en/latest/source/data_loading.html but covering an entire trading year it get out of sync on every holiday. E.g. a Monday that would typically trade but does not on a holiday such as Jan 20, 2020.

The result is that the assertion assert timestamps[0] == raw_df.index[lookback] fails.

This, and likely other data formatting issues, causes an error when executing history = run.launch(30) which is RuntimeError: mat1 and mat2 shapes cannot be multiplied

Improve visualize module

Some ideas below

The visualize model would use some helper function that inputs network and dataloader and returns DataFrame
weight_image
Include in documentation

Make raw_to_Xy more transparent

Currently, deepdow.utils.raw_to_Xy does a lot of magic inside and outputs only the bare minimum for training:

X
timestamps
y
asset_names
indicators

Would be nice to have some debug mode that returns more.

Implement sub

Currently one cannot just do -SomeLoss(). Of course it could be hacked by doing (-1) * SomeLoss(). We want to implement the first syntax via __sub__.

Beta loss

raw2Xy

Complete preprocessor

EDA for dataset/dataloader

Improve travis config

Multiple platforms
Coverage only ones after success

Fix benchmarks

possibility to fix problem size in constructor
returns channel
feed batches into cvxpy

generate_cumrets_table and plot_cumrets

Changelog

Loss arithmetic

Implement some dunders...

Example: Learning NumericalMarkowitz parameters

Would be nice to show, how deepdow is able to directly learn or have a network predictor of any input variables of the deepdow.layers.NumericalMarkowitz

It might be a good idea to use real data (i.e. yfinance), however one needs to be careful about the example running too long (both CI and readthedocs need to run it)

AssertionError: when using BachelierNet

I'm able to get the out-of-the box examples to execute successfully (getting_started and iid) when using the generated data but when using differenty toy datasets I get an AssertionError in the cvxcpy module.

/opt/conda/lib/python3.6/site-packages/cvxpy/cvxcore/python/canonInterface.py in nonzero_csc_matrix(A)
    162     # this function returns (rows, cols) corresponding to nonzero entries in
    163     # A; an entry that is explicitly set to zero is treated as nonzero
--> 164     assert not np.isnan(A.data).any()
    165 
    166     # scipy drops rows, cols with explicit zeros; use nan as a sentinel

AssertionError:

Steps to reproduce:

Start with getting_started.ipynb
Replace the data generation logic with loading data as described in this issue (I've used this as well as larger data sets)
Replace the Network Definition with:

from deepdow.nn import BachelierNet

n_channels = X.shape[1]
lookback = X.shape[2]
n_assets = X.shape[3]
max_weight = 0.5
hidden_size = 32

network = BachelierNet(n_channels, n_assets, hidden_size=hidden_size, max_weight=max_weight)

print(network)

Same error occurs even if reducing channels to 1, increasing number of samples, keeping lookback, gap, horizion small (5,0,1).

VAR and CVAR

Using torch.distributions

Argmax allocator

Probably not possible since we would encounter zero gradients

Variational softmax is missing a constraint

In the SoftmaxAllocator the nonnegativity constraint w >= 0 is missing.

Example: Softmax and Sparsemax

4 options:

Sparsemax constrained
Sparsemax unconstrained
Softmax constrained
Softmax unconstrained

Create contributing section in the docs

Currently, not clear how to contribute to the project.

Custom portfolio benchmark

It would be nice to have a benchmark that is just some predefined portfolio. One would construct it by passing all the weights.

Fixed sized convex problem

Design new network with a fixed sized size - major speedups probably

Example: Demonstrate how 1D convolutions affect allocation

Show that sliding the window by one does not change the allocation much because of the underlying 1D convolutions.

Add sparsemax, csoftmax and csparsemax

The same idea as SoftmaxAllocator but with additional quite useful features for portfolio optimization

sparsity of predictions
maximum value

Inspired by https://locuslab.github.io/2019-10-28-cvxpylayers/ however implementing via cvxpylayers (as written in the blog) is not the most efficient way to do it.

Clipping in gradient_wrt_input

Currently, we implement one "Explainable" algorithm in deepdow.explain.gradient_wrt_input. The problem is that we do not restrict the values the input can have. One solution would be to implement some projection/clipping logic that takes place after each optimizer step and thus forces the values to be in a given range.

See https://arxiv.org/pdf/1702.04782.pdf

Add layer using spatial transformations

Would be nice for the network to do translations along the time dimensions (possibly independently for each assets) https://pytorch.org/docs/stable/nn.functional.html#grid-sample

Scaling potentially also useful

Papers

http://www-personal.umich.edu/~wiensj/papers/MLHC2018_Oh.pdf

Define str or repr of losses and bm

Can be used for mlflow logging

Dealing with sum(w) != 1

With the convex optimization it could happen that the solver does not find solution and then it results in weights not summing up to one (sometimes drastically different).

Possible solutions

Postprocessing layer (rescales to 1)
Loss punishing incorrect w

Numpy metrics

Currently all metrics input and output torch.tensors...limiting!?

Reasonable feature scaling

Or at least give a user an option to do it

Create dummy datasets

Additional augmentations

Rather than reinventing the wheel one could just use torchvision transforms https://pytorch.org/docs/stable/torchvision/transforms.html

Compose (already recreated in deepdow)
RandomApply - apply all with some probability
RandomChoice - apply exactly one but at random
RandomOrder - apply all but in random order
1Dwarping - Affine would be a special case, one could in theory have any increasing function (derivative > 0)
RandomAffine - scaling and translation along the y axis (lookback) could be a brilliant augmentation for deepdow tensors
RandomHorizontalFlip - flipping the time flow, probably super confusing if one wants to pic up mean reversion
Normalize - a must together with some helper function that computes means, stds in the training set. However, it still assumes that the time series is stationary.
RandomErasing - (similar to the current Dropout however it is contiguous regions)

Additionally, torchvision might be also helpful in other tasks (see #39)

The clear downside is introducing yet another dependency. Additionally, one might argue that it is better to go all the way and use imgaug, albumentation,...

Other nonvision augmentations: