paper

Regret-Optimized Portfolio Enhancement through Deep Reinforcement Learning and Future Looking Rewards

Daniil Karzanov, Rubén Garzón, Mikhail Terekhov, Caglar Gulcehre, Thomas Raffinot

Regret-Optimized Portfolio Enhancement through Deep Reinforcement Learning and Future Looking Rewards

Name: Regret-Optimized Portfolio Enhancement through Deep Reinforcement Learning and Future Looking Rewards
Author: Daniil Karzanov, Rubén Garzón, Mikhail Terekhov, Caglar Gulcehre, Thomas Raffinot

Daniil Karzanov, Rubén Garzón, Mikhail Terekhov, Caglar Gulcehre, Thomas Raffinot

Paper2025-02-04English

Start Reading

deep learning portfolioarxiv

Description

This paper introduces a novel agent-based approach for enhancing existing portfolio strategies using Proximal Policy Optimization (PPO). Rather than focusing solely on traditional portfolio construction, our approach aims to improve an already high-performing strategy through dynamic rebalancing driven by PPO and Oracle agents. Our target is to enhance the traditional 60/40 benchmark (60% stocks, 40% bonds) by employing the Regret-based Sharpe reward function. To address the impact of transactio...