Skip to main content

Home Hot Groups Market Me

On-Policy Robot Imitation Learning from a Converging Supervisor | Arena Library | Arena

Rankings Groups Feed Market Hot

Home
Library
On-Policy Robot Imitation Learning from a Converging Supervisor

paper

On-Policy Robot Imitation Learning from a Converging Supervisor

Ashwin Balakrishna, Brijen Thananjeyan, Jonathan Lee, Felix Li, Arsh Zahed

On-Policy Robot Imitation Learning from a Converging Supervisor

Ashwin Balakrishna, Brijen Thananjeyan, Jonathan Lee, Felix Li, Arsh Zahed

paper2019-07-08English

machine learning financearxiv

Description

Existing on-policy imitation learning algorithms, such as DAgger, assume access to a fixed supervisor. However, there are many settings where the supervisor may evolve during policy learning, such as a human performing a novel task or an improving algorithmic controller. We formalize imitation learning from a "converging supervisor" and provide sublinear static and dynamic regret guarantees against the best policy in hindsight with labels from the converged supervisor, even when labels during le...

Similar Books

Quantitative mode stability for the wave equation on the Kerr-Newman spacetime

Quantitative mode stability for the wave equation on the Kerr-Newman spacetime

Risk-Aware Objective-Based Forecasting in Inertia Management

Haipeng Zhang, Ran Li, Yan Chen, Zhongda Chu, Mingyang Sun

Risk-Aware Objective-Based Forecasting in Inertia Management

Chainalysis: Geography of Cryptocurrency 2023

Chainalysis: Geography of Cryptocurrency 2023

Periodicity in Cryptocurrency Volatility and Liquidity

Peter Reinhard Hansen, Chan Kim, Wade Kimbrough

Periodicity in Cryptocurrency Volatility and Liquidity

Impact of Geometric Uncertainty on the Computation of Abdominal Aortic Aneurysm Wall Strain

Saeideh Sekhavat, Mostafa Jamshidian, Adam Wittek, Karol Miller

Impact of Geometric Uncertainty on the Computation of Abdominal Aortic Aneurysm Wall Strain

Simulation-based Bayesian inference with ameliorative learned summary statistics -- Part I

Getachew K. Befekadu

Simulation-based Bayesian inference with ameliorative learned summary statistics -- Part I

Home Hot Groups Market Me