Optimal Attention Temperature Enhances In-Context Learning under Distribution Shift | Arena Library | Arena