Self-supervised Learning of Contextualized Local Visual Embeddings | Arena Library | Arena