Arshaan Nazir, Thadaka Kalyan Chakravarthy, David Amore Cecchini, Thadaka Kalyan Chakravarthy, Rakshit Khajuria, Prikshit Sharma, Ali Tarik Mirik, Veysel Kocaman, & David Talby. (2024). LangTest: A comprehensive evaluation library for custom LLM and NLP models. Software Impacts, 19(100619). https://doi.org/10.1016/j.simpa.2024.100619