Systemic risk measures such as CoVaR, CoES and MES are widely-used in finance, macroeconomics and by regulatory bodies. Despite their importance, we show that they fail to be elicitable and identifiable. This renders forecast comparison and validation, commonly summarised as `backtesting', impossible. The novel notion of \emph{multi-objective elicitability} solves this problem. Specifically, we propose Diebold--Mariano type tests utilising two-dimensional scores equipped with the lexicographic o...