Skip to main content

Home Hot Groups Market Me

Benchmarking Batch Deep Reinforcement Learning Algorithms | Arena Library | Arena

Rankings Groups Feed Market Hot

Home
Library
Benchmarking Batch Deep Reinforcement Learning Algorithms

paper

Benchmarking Batch Deep Reinforcement Learning Algorithms

Scott Fujimoto, Edoardo Conti, Mohammad Ghavamzadeh, Joelle Pineau

Benchmarking Batch Deep Reinforcement Learning Algorithms

Scott Fujimoto, Edoardo Conti, Mohammad Ghavamzadeh, Joelle Pineau

paper2019-10-03English

deep learning portfolioarxiv

Description

Widely-used deep reinforcement learning algorithms have been shown to fail in the batch setting--learning from a fixed data set without interaction with the environment. Following this result, there have been several papers showing reasonable performances under a variety of environments and batch settings. In this paper, we benchmark the performance of recent off-policy and batch reinforcement learning algorithms under unified settings on the Atari domain, with data generated by a single partial...

Similar Books

Quantitative mode stability for the wave equation on the Kerr-Newman spacetime

Quantitative mode stability for the wave equation on the Kerr-Newman spacetime

Risk-Aware Objective-Based Forecasting in Inertia Management

Haipeng Zhang, Ran Li, Yan Chen, Zhongda Chu, Mingyang Sun

Risk-Aware Objective-Based Forecasting in Inertia Management

Chainalysis: Geography of Cryptocurrency 2023

Chainalysis: Geography of Cryptocurrency 2023

Periodicity in Cryptocurrency Volatility and Liquidity

Peter Reinhard Hansen, Chan Kim, Wade Kimbrough

Periodicity in Cryptocurrency Volatility and Liquidity

Impact of Geometric Uncertainty on the Computation of Abdominal Aortic Aneurysm Wall Strain

Saeideh Sekhavat, Mostafa Jamshidian, Adam Wittek, Karol Miller

Impact of Geometric Uncertainty on the Computation of Abdominal Aortic Aneurysm Wall Strain

Simulation-based Bayesian inference with ameliorative learned summary statistics -- Part I

Getachew K. Befekadu

Simulation-based Bayesian inference with ameliorative learned summary statistics -- Part I

Home Hot Groups Market Me