paper

Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective

Fabian Falck, Ziyu Wang, Chris Holmes

Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective

Name: Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective
Author: Fabian Falck, Ziyu Wang, Chris Holmes

Fabian Falck, Ziyu Wang, Chris Holmes

Paper2024-06-02English

Start Reading

machine learning financearxiv

Description

In-context learning (ICL) has emerged as a particularly remarkable characteristic of Large Language Models (LLM): given a pretrained LLM and an observed dataset, LLMs can make predictions for new data points from the same distribution without fine-tuning. Numerous works have postulated ICL as approximately Bayesian inference, rendering this a natural hypothesis. In this work, we analyse this hypothesis from a new angle through the martingale property, a fundamental requirement of a Bayesian lear...