Loading...

PARROT: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs - Yusuf Çelebi, Özay Ezerceli, Mahmoud El Hussieni | Arena