Evans, J., D'Souza, J., & Auer, S. Large Language Models as Evaluators for Scientific Synthesis [Data set]. https://aclanthology.org/2024.konvens-main.1/