Investigating Data Pruning for Pretraining Biological Foundation Models at Scale Library | Arena