Skip to main content

Home Hot Groups Market Me

Disentangling Exploration from Exploitation | Arena Library | Arena

Rankings Groups Feed Market Hot

Home
Library
Disentangling Exploration from Exploitation

paper

Disentangling Exploration from Exploitation

Alessandro Lizzeri, Eran Shmaya, Leeat Yariv

Disentangling Exploration from Exploitation

Alessandro Lizzeri, Eran Shmaya, Leeat Yariv

paper2024-04-29English

Description

Starting from Robbins (1952), the literature on experimentation via multi-armed bandits has wed exploration and exploitation. Nonetheless, in many applications, agents' exploration and exploitation need not be intertwined: a policymaker may assess new policies different than the status quo; an investor may evaluate projects outside her portfolio. We characterize the optimal experimentation policy when exploration and exploitation are disentangled in the case of Poisson bandits, allowing for general news structures. The optimal policy features complete learning asymptotically, exhibits lots of persistence, but cannot be identified by an index a la Gittins. Disentanglement is particularly valuable for intermediate parameter values.

Similar Books

Quantitative mode stability for the wave equation on the Kerr-Newman spacetime

Quantitative mode stability for the wave equation on the Kerr-Newman spacetime

Risk-Aware Objective-Based Forecasting in Inertia Management

Haipeng Zhang, Ran Li, Yan Chen, Zhongda Chu, Mingyang Sun

Risk-Aware Objective-Based Forecasting in Inertia Management

Chainalysis: Geography of Cryptocurrency 2023

Chainalysis: Geography of Cryptocurrency 2023

Periodicity in Cryptocurrency Volatility and Liquidity

Peter Reinhard Hansen, Chan Kim, Wade Kimbrough

Periodicity in Cryptocurrency Volatility and Liquidity

Impact of Geometric Uncertainty on the Computation of Abdominal Aortic Aneurysm Wall Strain

Saeideh Sekhavat, Mostafa Jamshidian, Adam Wittek, Karol Miller

Impact of Geometric Uncertainty on the Computation of Abdominal Aortic Aneurysm Wall Strain

Simulation-based Bayesian inference with ameliorative learned summary statistics -- Part I

Getachew K. Befekadu

Simulation-based Bayesian inference with ameliorative learned summary statistics -- Part I

Home Hot Groups Market Me