paper

Continuous-Time Mean-Variance Portfolio Selection: A Reinforcement Learning Framework

Haoran Wang, Xun Yu Zhou

Continuous-Time Mean-Variance Portfolio Selection: A Reinforcement Learning Framework

Name: Continuous-Time Mean-Variance Portfolio Selection: A Reinforcement Learning Framework
Author: Haoran Wang, Xun Yu Zhou

Haoran Wang, Xun Yu Zhou

paper2019-04-25English

Start Reading

deep learning portfolioarxiv

Description

We approach the continuous-time mean-variance (MV) portfolio selection with reinforcement learning (RL). The problem is to achieve the best tradeoff between exploration and exploitation, and is formulated as an entropy-regularized, relaxed stochastic control problem. We prove that the optimal feedback policy for this problem must be Gaussian, with time-decaying variance. We then establish connections between the entropy-regularized MV and the classical MV, including the solvability equivalence a...