Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning | Arena Library | Arena